top of page
< Back
Mining and Classification of Neologisms in Persian Blogs

Abstract

The exponential growth of the Persian blogosphere and the increased number of neologisms create a major challenge in NLP applications of Persian blogs. This paper describes a method for extracting and classifying newly constructed words and borrowings from Persian blog posts. The analysis of the occurrence of neologisms across five distinct topic categories points to a correspondence between the topic domain and the type of neologism that is most commonly encountered. The results suggest that different approaches should be implemented for the automatic detection and processing of neologisms depending on the domain of application.

Public released

yes

External link: 

Download Document
(if available)

@2025 website by Karine Megerdoomian. Powered by Wix.

Catwoman logo
bottom of page