top of page

Mining and Classification of Neologisms in Persian Blogs

Abstract

The exponential growth of the Persian blogosphere and the increased number of neologisms create a major challenge in NLP applications of Persian blogs. This paper describes a method for extracting and classifying newly constructed words and borrowings from Persian blog posts. The analysis of the occurrence of neologisms across five distinct topic categories points to a correspondence between the topic domain and the type of neologism that is most commonly encountered. The results suggest that different approaches should be implemented for the automatic detection and processing of neologisms depending on the domain of application.

Public released

yes

External link

Not all documents are
available for download

@2025 website by Karine Megerdoomian. 

Catwoman logo
bottom of page