loading page

Technical Language Processing for Telecommunications Specifications
  • Felipe A. Rodriguez Y.
Felipe A. Rodriguez Y.
Nokia Solutions and Networks Oy

Corresponding Author:[email protected]

Author Profile

Abstract

Large Language Models (LLMs) are continuously being applied in a more diverse set of con‐ texts. At their current state, however, even state‐of‐the‐art LLMs such as Generative Pre‐Trained Transformer 4 (GTP‐4) have challenges when extracting information from real‐world technical docu‐ mentation without a heavy preprocessing. One such area with real‐world technical documentation is telecommunications engineering, which could greatly benefit from domain‐specific LLMs. The unique format and overall structure of telecommunications internal specifications differs greatly from standard English and thus it is evident that the application of out‐of‐the‐box Natural Language Processing (NLP) tools is not a viable option. This article provides a brief outline of the limitations of out‐of‐the‐box NLP tools for processing technical information generated by telecommunications experts and expand the concept of Technical Language Processing (TLP) to the telecommunica‐ tions domain. Additionally, we emphasize the importance of use case definition by introducing the required information mapping from the perspective of a Q&A application that uses internal speci‐ fications as the source of knowledge. Finally, we recommend actions to mitigate the effect of the internal specifications format on information extraction, effectively achieving LLM‐friendly inter‐ nal specifications.
10 Aug 2024Submitted to Applied AI Letters
05 Sep 2024Submission Checks Completed
05 Sep 2024Assigned to Editor
09 Sep 2024Reviewer(s) Assigned
07 Oct 2024Review(s) Completed, Editorial Evaluation Pending
07 Oct 2024Editorial Decision: Revise Major