Linguistic Approaches for Automatic Term Extraction


Ananiadou, S. (1994) ‘A Methodology for Automatic Term Recognition’ Proceedings of Coling94, 1034-1038


Arppe, A. (1995) Term extraction from unrestricted term, Lingsoft Web site:


Bourigault, D.(1992) ‘Surface Grammatical Analysis for the Extraction of Terminological Noun Phrases’ Proceedings of Coling92, 977-981.


Bourigault, D., Gonzalez-Mullier, I., and Gros, C. (1996) LEXTER, a Natural Language Processing tool for Terminology Extraction, in Proceedings of the 7th Euralex International Congress, Goteborg.


Jacquemin, C. (1996) What is a tree that we see through the window: a linguistic approach to windowing and term variation. Information Processing and Management, 32(4):445-458.


Lauriston, A. (1994) Automatic recognition of complex terms: problems and the TERMINO solution. Terminology, 1, 1:147-170.



Hybrid approaches combining Linguistics and Statistics for Automatic Term Extraction


Daille, B., Gaussier, E., and Lange, J. (1994)  ‘Towards automatic extraction of monolingual and bilingual terminology’ Proceedings of Coling94, 515-521.


Dagan, I, and Church, K. (1995) Termight: identifying and translating technical terminology, in Proceedings of EACL’95, 34-40.


Enguehard, C. and Pantera, L. (1994) Automatic natural acquisition of a terminology. Journal of quantitative linguistics, 2(1), 27-32


Frantzi, K. T., Ananiadou, S. (1999) . The C/NC value domain independent method for multi-word term extraction, Journal of Natural Language Processing, 6(3): 145-180.


Frantzi, K.T. and Ananiadou, S (1996) ‘Extracting  nested collocations’ in 16thConference on Computational Linguistics, COLING, 41-46.


Justeson, J. and Katz, S. (1995) Technical Terminology: some linguistic properties and an algorithm for identification in text. Natural Language Engineering, 1, 9-27.


Maynard, D. and Ananiadou, S. 2000.  Identifying Terms by their Family and Friends, In Proceedings of The 18th International Conference on Computational Linguistics, COLING 2000, 530—536.


Maynard, D. and Ananiadou, S. ‘Identifying contextual information for term extraction’, in Proc. of 5th International Congress on Terminology and Knowledge Engineering (TKE’99), 212-221.


Van der Eijk, P. (1993) Automating the acquisition of bilingual terminology. In Proceedings of EACL’93, 113-119.


Vivaldi, J., and Rodriquez, H. (2000) Improving term extraction by combining different techniques,  in S.Ananiadou & D. Maynard (eds), Workshop on Computational Terminology for Medical and Biological Applications (NLP2000, Patras, Greece), 61-68.




General  on Term Extraction and Terminology


Boguraev, B. and Kennedy, C. (1999) Applications of term identification technology: domain description and content characterisation. Natural Language Engineering, 5(1):17-44.

Kageura, K.and Umino, B. (1996). Methods of Automatic Term Recognition -A Review-”. Terminology 3(2), 259—289.


Wright, S. and Budin, G. (eds) (1997) Handbook of Terminology Management, volume 1, Basic Concepts of Terminology Management, John Benjamins, Amsterdam.


Sager, J. (1990) A Practical Course in Terminology Processing. John Benjamins.


Related work from Information Retrieval (Automatic Indexing)


Bookstein, A. and Swanson D.R. (1974) ‘Probabilistic models for automatic indexing’ Journal of the American Society for Information Science, 25(5), 312-318.


Cohen, J.D. (1995) ‘Highlights: language and domain independent automatic indexing terms for abstracting’ Journal of the American Society for Information Science, 46(3), 162-174.


Damerau , F. J.(1993) ‘Evaluating domain-oriented multi-word terms from texts’ Information Processing and Management 29(4), 433-447.


Dillon, M., and Gray, A. (1983) ‘FASIT: Fully automatic syntax-based indexing’ Journal of the American Society for Information Science, 34(2), 99—108


Evans, D.A. and Lefferts, R.G. (1995) ‘CLARIT-TREC Experiments’ Information Processing and Management 31(3), 385-395.


Paice, C.D. and Jones, P.A. (1993) ‘The identification of important concepts in highly structured technical papers’ Proceedings of ACM-SIGIR’93, 69-77.


Salton, G. (1983). Introduction to modern information retrieval. Computer Science. McGraw-Hill.


Sparck-Jones, K. and Tait, J. (1984) Automatic search term variant generation, in Journal of Documentation, 40(1), 50-66.



Term Extraction for Biology


Andrade, M. A. and Valencia, A. (1998) Automatic extraction of keywords from scientific text: application to the knowledge domain of protein families, Bioinformatics, 14(7): 600-607



Collier, N., Nobata, C., and Tsujii, J. (2000) Extracting the Names of Genes and Gene Products with a Hidden Markov Model,  Coling2000, 201-207


Fukuda, K., Tsunoda, T., Tamura, A. and Takagi, T. (1998) Toward Information Extraction: Identifying protein names from biological papers, PSB-98, 705-716.


Gaizauskas, R., Demetriou, G., and Humphreys, K. (2000) Term Recognition in Biological Science Journal Articles, in S.Ananiadou & D. Maynard (eds), Workshop on Computational Terminology for Medical and Biological Applications (NLP2000, Patras, Greece) , 37-44.


Klein, H., van den Berg, L.T.W, and Vos, R. (2000) The extraction of drug-ADR relations, in

in S.Ananiadou & D. Maynard (eds), Workshop on Computational Terminology for Medical and Biological Applications (NLP2000, Patras, Greece), 69-74.


Nobata, C., Collier, N. and Tsujii, J. (1999) Automatic term identification and classification in biology texts , in Proceedings of the Natural Language Pacific Rim Symposium (NLPRS’2000), 369-375.


Oh, J.H, Chae, Y.S., Choi, K.S. (2000) The Statistical Model for Automatic Recognition of Biological Terminologies, in in S.Ananiadou & D. Maynard (eds), Workshop on Computational Terminology for Medical and Biological Applications (NLP2000, Patras, Greece), 29-35.


Rindflesch, T.C., Hunter, L., and Aronson, A.R. (1999) Mining molecular binding terminology from biomedical text. In Proceedings of the 1999 AMIA Annual Fall Symposium, N.M. Lorenzi, (ed) 127-136.


Tersmette, K., Scott, A., Moore, G., Matheson, N., and Miller, R. (1988) Barrier word method for detecting molecular biology multiple word terms. In Greenes, R. (ed) Proc. of SCAMC, 207-211.