Department of Chinese, Translation and Linguistics
Research Degree Forum
Term Recognition Using Conditional Random Fields
Ms. ZHANG Xing
PhD candidate, Department of Chinese, Translation and Linguistics, City University of Hong Kong
Date: 15 September 2010, Wednesday
Time: 4:30 - 5:30 pm
Venue: B7603 (7/F, Blue Zone), Academic Building, CityU
A machine learning framework, Conditional Random fields (CRF), is constructed in this study, which exploits syntactic information to recognize biomedical terms. Features used in this CRF framework focus on syntactic information in different levels, including parent nodes, syntactic functions, syntactic paths and term ratios. A series of experiments have been done to study the effects of training sizes, general term recognition and novel term recognition. The experiment results show that features as syntactic paths and term ratios can achieve good precision of term recognition, including both general terms and novel terms. However, the recall of novel term recognition is still unsatisfactory, which calls for more effective features to be used. All in all, as this research studies in depth the uses of some unique syntactic features, it is innovative in respect of constructing machine learning based term recognition system.
Ms. ZHANG Xing is currently a PhD candidate in the Department of Chinese, Translation and Linguistics. Her research interests focus on corpus linguistics, mainly term recognition and extraction.
~ CTL Staff and Research Degree Students only ~