Selected Publications (1990-onwards)
- Ide, N. and Suderman, K. (2007). GrAF: A Graph-based Format for Linguistic Annotations. Proceedings of the Linguistic Annotation Workshop,held in conjunction with ACL 2007, Prague, June 28-29, 1-8.
- Ide, N. and Woolner, D. (2007). Historical Ontologies. In Ahmad, K, Brewster, C., and Stevenson, M. (eds.), Words and Intelligence II: Essays in Honor of Yorick Wilks, Springer, 137-152.
- Ide, N., Romary, L. (2007). Towards International Standards for Language Resources. In Dybkjaer, L., Hemsen, H., Minker, W. (Eds.), Evaluation of Text and Speech Systems, Springer, 263-84.
- Ide, N., Wilks, Y. (2006). Making Sense About Sense. In Agirre, E., Edmonds, P. (Eds.), Word Sense Disambiguation: Algorithms and Applications, Springer, 47-74..
- Ide, N. (2006). Making Senses: Bootstrapping Sense-tagged Lists of Semantically-Related Words. In Gelbukh, Alexander (Ed.), Computational Linguistics and Intelligent Text, Lecture Notes in Computer Science, Springer.
- Ide, N., Suderman, K. (2006). Integrating Linguistic Resources: The American National Corpus Model. Proceedings of the Fifth Language Resources and Evaluation Conference (LREC), Genoa, Italy.
- Ide, N., Romary, L.. (2006). Representing Linguistic Corpora and Their Annotations. Proceedings of the Fifth Language Resources and Evaluation Conference (LREC), Genoa, Italy.
- Ide, N., Suderman, K. (2006). Merging Layered Annotations Proceedings of Merging and Layering Linguistic Information, Workshop held in conjunction with LREC 2006, Genoa, Italy.
- Ide, N., Woolner, D. (2004). Exploiting Semantic Web Technologies for Intelligent Access to Historical Documents. Proceedings of the Fourth Language Resources and Evaluation Conference (LREC), Lisbon, 2177-80.
- Ide, N., Suderman, K. (2004). The American National Corpus First Release. Proceedings of the Fourth Language Resources and Evaluation Conference (LREC), Lisbon, 1681-84.
- Ide, N., Romary, L. (2004). International standard for a linguistic annotation framework. Journal of Naturaql Language Engineering, 10:3-4, 211-225.
- Tufis, D., Ion, R., Ide, N. (2004). Fine-Grained Word Sense Disambiguation Based on Parallel Corpora, Word Alignment, Word Clustering, and Aligned WordNets. Proceedings of COLING'04. Geneva.
- Ide, N., Romary, L. (2004). A Registry of Standard Data Categories for Linguistic Annotation. Proceedings of the Fourth Language Resources and Evaluation Conference (LREC), Lisbon, 135-39.
- Reppen, R., Ide, N. (2004). The American National Corpus: An overview of the First Release. Journal of English Linguistics.
- Ide, N. (2004). Preparation and Analysis of Linguistic Corpora. In Schreibman, S., Siemens, R, Unsworth, J, Eds. A Companion to Digital Humanities. Blackwell.
- Tufis, D., Ion, R., Ide, N. (2004). Fine-Grained Word Sense Disambiguation Based on Parallel Corpora, Word Alignment, Word Clustering and Aligned WordNets. Proceedings of COLING'04, Geneva.
- Ide, N., Romary, L. (2003). Encoding Syntactic Annotation. In Abeillé, Anne (ed.) Treebanks: Building and Using Parsed Corpora, Kluwer, Dordrecht, 281-96.
- Ide, N., Romary, L. (2003). Outline of the International Standard Linguistic Annotation Framework. Proceedings of ACL'03 Workshop on Linguistic Annotation: Getting the Model Right, Sapporo, 1-5.
- Ide, N., Lenci, A., Calzolari, N. (2003). RDF Instantiation of ISLE/MILE Lexical Entries. Proceedings of ACL'03 Workshop on Linguistic Annotation: Getting the Model Right, Sapporo, 30-37.
- Ide, N., Romary, L., de la Clergerie, E. (2003). International Standard for a Linguistic Annotation Framework. Proceedings of HLT-NAACL'03 Workshop on The Software Engineering and Architecture of Language Technology, Edmunton.
- Ide, N. Erjavec, T., Tufis, D. (2002). Sense Discrimination with Parallel Corpora. Proceedings of ACL'02 Workshop on Word Sense Disambiguation: Recent Successes and Future Directions, Philadelphia, 54-60.
- Ide, N., Reppen, R., Suderman, K. (2002). The American National Corpus: More Than the Web Can Provide. Proceedings of the Third Language Resources and Evaluation Conference (LREC), Las Palmas, Canary Islands, Spain, 839-44.
- Ide, N., Romary, L. (2002). Standards for Language Resources. Proceedings of the Third Language Resources and Evaluation Conference (LREC), Las Palmas, Canary Islands, Spain, 839-44.
- Ide, N. Erjavec, T., Tufis, D. (2001). Automatic Sense Tagging Using Parallel Corpora. Proceedings of the Sixth Natural Language Processing Pacific Rim Symposium, Tokyo, 83-9.
- Ide, N., Romary, L. (2001). Standards for Language Resources. Proceedings of the IRCS Workshop on Linguistic Databases, Philapdelphia, 141-9.
- Ide, N., Romary, L. (2001). A Common Framework for Syntactic Annotation. Proceedings of ACL'2001, Toulouse, 298-305.
- Ide, N., Macleod, C. (2001). The American National Corpus: A Standardized Resource of American English. Proceedings of Corpus Linguistics 2001, Lancaster UK.
- Ide, N., Cristea, D. (2000). A Hierarchical Account of Referential Accessibility. Proceedings of ACL'2000, Hong Kong, 416-24.
- Ide, N., Romary, L. (2000) XML Support for Annotated Language Resources. Proceedings of the Workshop on Web-based Language Documentation and Description, Philadelphia, 148-153.
- Ide, N., Kilgarriff, A., Romary, L. (2000). A Formal Model of Dictionary Structure and Content. Proceedings of Euralex 2000, Stuttgart, 113-126.
- Cristea, D., Ide, N., Marcu, D., Tablan, V. (2000). An Empirical Study of the Relation Between Discourse Structure and Reference. Proceedings of COLING'00, Saarbrucken, Germany, 208-214.
- Ide, N. (2000). Cross-lingual sense determination: Can it work? Computers and the Humanities, 34: 1-2, Special Issue on the Proceedings of the SIGLEX/SENSEVAL Workshop, A. Kilgarriff and M. Palmer, eds., 223-34.
- Ide, N. (2000). The XML Framework and Its Implications for the Development of Natural Language Processing Tools. Proceedings of the COLING Workshop on Using Toolsets and Architectures to Build NLP Systems, Luxembourg, 5 August 2000.
- Ide, N., Brew, C. (2000). Requirements, Tools, and Architectures for Annotated Corpora. Proceedings of Data Architectures and Software Support for Large Corpora. Paris: European Language Resources Association, 1-5.
- Ide, N., 2000. The XML Framework and Its Implications for Corpus Access and Use. Proceedings of Data Architectures and Software Support for Large Corpora. Paris: European Language Resources Association, 28-32.
- Ide, N., Bonhomme, P., Romary, L. (2000). XCES: An XML-based Standard for Linguistic Corpora.. Proceedings of the Second Language Resources and Evaluation Conference (LREC), Athens, Greece, 825-30.
- Erjavec, T., Evans, R., Ide, N., Kilgarriff, A. (2000). The CONCEDE Model for Lexical Databases.. Proceedings of the Second Language Resources and Evaluation Conference (LREC), Athens, Greece, 355-62.
- Macleod, C., Ide, N., Grishman, R. (2000). The American National Corpus: Standardized Resources for American English.. Proceedings of the Second Language Resources and Evaluation Conference (LREC), Athens, Greece, 831-36.
- Cristea, D., Ide, N., Marcu, D., Tablan, V. (1999). Discourse structure and co-reference: An empirical study. Proceedings of the ACL99 Workshop on the Relation between Discourse/Dialogue Structure and Reference, College Park, Maryland, 46-53.
- Ide, N. (1999). Parallel translations as sense discriminators. SIGLEX99: Standardizing Lexical Resources, ACL99 Workshop, College Park, Maryland, 52-61.
- Ide, N. (ed.) (1999). Methods and Techniques of Processing (html document). In Frederking, R., Hovy, E., Ide, N. (eds.) Multilingual Information Management: Current Levels and Future Abilities. National Science Foundation report. Available at http://www.cs.cmu.edu/~ref/mlim/.
- Erjavec, T. and Ide, N. (1999) Markup enhancement: Converting CEE dictionaries into TEI, and beyond. In Kiefer, F., Kiss, G. and Pajzs, J. (eds.) Papers in Computational Lexicography COMPLEX'99, Linguistics Institute, Hungarian Academy of Sciences, Budapest, 211-217.
- Welty, C. and Ide, N. (1999). Using the right tools: Enhancing retrieval from marked-up documents. Computers and the Humanities 33:1-2,Special Issue on the Tenth Anniversary of the Text Encoding Initiative, E. Mylonas and A. Renear, eds.
- Ide, N. and V éronis, J. (1998) Word Sense Disambiguation: The State of the Art. Computational Linguistics,24:1, 1-40.
- Cristea, D., Ide, N., Romary, L. (1998) Marking Up Multiple Views of a Text: Discourse and Reference. Proceedings of the First International Language Resources and Evaluation Conference,Granada, Spain, 483-88.
- Cristea, D., Ide, N., Romary, L. (1998). Veins Theory: A Model of Global Discourse Cohesion and Coherence. Proceedings of ACL/COLING98, Montreal, 281-85.
- Dimitrova, L., Erjavec, T., Ide, N. Kaalep, H., Petkevic, V., Tufis, D. (1998) Multext-East: Parallel and Comparable Corpora for Six Central and Eastern European Languages. Proceedings of ACL/COLING98, Montreal, 315-19.
- Fillmore, C., Ide, N., Jurafsky, D., and Macleod, C. (1998). An American National Corpus: A Proposal. Proceedings of the First International Language Resources and Evaluation Conference,Granada, Spain, 965-70.
- Ide, N. (1998) Encoding Linguistic Corpora. Proceedings of the Sixth Workshop on Very Large Corpora,Montreal, 9-17.
- Ide, N. (1998). Corpus Encoding Standard: SGML Guidelines for Encoding Linguistic Corpora. Proceedings of the First International Language Resources and Evaluation Conference,Granada, Spain, 463-70.
- Ide, N., McGraw, T., Welty, C. (1997) Representing TEI Documents in the CLASSIC Knowledge Representation System. Proceedings of the Tenth Anniversary Conference of the Text Encoding Initiative,Providence, RI.
- Tufis, D., Ide, N., Erjavec, T. (1998) Standardized Specifications, Development and Assessment of Large Morpho-syntactic Resources for Six Central and Eastern European Languages. Proceedings of the First International Language Resources and Evaluation Conference,Granada, Spain, 233-40.
- Ide, N. (1997) Final Report on Integration, Testing, and Refinement. Multext Deliverable D1.2F. 32 pp.
- Ide, N., Sperberg-McQueen, C.M. (1997). Toward a Unified Docuverse: Standardizing Document Markup and Access Without Procrustean Bargains. Proceedings of the American Society for Information Science Conference,Washington, D.C., 347-60.
- Barnard, D. and Ide, N. (1997) The Text Encoding Initiative: Flexible and Extensible Document Encoding. Journal of the American Society for Information Science.
- Erjavec, T., Ide, N., Petkevic, V., Véronis, J. (1996) Multext-East: Multilingual Text, Tools and Corpora for Central and Eastern European Languages. Proceedings of the First TELRI European Seminar,87-98.
- Ide, N. (1996) Encoding Standards for Linguistic Corpora. Proceedings of the First TELRI European Seminar,65-78.
- Erjavec, T. and Ide, N. (1996) Corpus Collection and Preparation. COP Project 106 MULTEXT-EAST Deliverable D2.1 F Final Report.
- Priest-Dorman, G,. Erjavec, T., Ide, N. and Petkevic, V. (1996) Corpus Markup. COP Project 106 MULTEXT-EAST Deliverable D2.3F.
- Ide, N. (Ed.) (1996). Multext-East Language-specific Resources. COP Project 106 MULTEXT-EAST Deliverable D1.2.
- Durand, D., Ide, N., LeMaitre, J., Véronis, J. (1995) Internal Standard Formats. Multext Deliverable 1.3.1B. 27 pp.
- Véronis, J., Ide, N. (1996). Considerations for the Reusability of Linguistic Software. EAGLES-MULTEXT Report.
- Véronis, J., Ide, N. (1996). Guidelines for Linguistic Software Development. EAGLES-MULTEXT Report.
- Ide, N., Le Maitre, J., Véronis, J. (1995). Outline of a Model for Lexical Databases. Current Issues in Computational Linguistics: In Honour of Don Walker. Linguistica ComputazionaleIX, X (Pisa, 1995), 283-320. [reprinted from Information Processing and Management., 29, 2, 159-186]
- Ide, N., Véronis, J. (1995). Large Neural Networks for the Resolution of Lexical Ambiguity. In Saint-Dizier, P. and Viegas, E. (Eds.) Computational Lexical Semantics.Natural Language Processing Series, Cambridge University Press, 251-270.
- Ide, N., Sperberg-McQueen, C.M. (1995). The Text Encoding Initiative: History and Context. In Ide, N., Veronis, J. (Eds.) The Text Encoding Initiative: Background and Context.Dordrecht: Kluwer Academic Publishers, 5-15.
- Ide, N., Véronis, J. (1995). Encoding dictionaries. In Ide, N., Veronis, J. (Eds.) The Text Encoding Initiative: Background and Context.Dordrecht: Kluwer Academic Publishers, 167-80.
- Ide, N., Véronis, J. (1994) Knowledge Extraction from Machine-Readable Dictionaries: An Evaluation. In Steffens, P. (Ed.) Machine Translation and the Lexicon. Springer-Verlag, 19-34.
- Ide, N., Véronis, J. (1994). A feature-based data model for lexical databases. In Hockey, S., Ide, N. Research in Humanities Computing IV,Oxford University Press , 193-206.
- Le Maitre, J., Ide, N., Véronis, J. (1994) Modélisation et interrogation de bases de données lexicales. Ingénierie des systemes d'informations,2:1, 57-82.
- Ide, N. (1994) Encoding standards for large text resources: The Text Encoding Initiative. Proceedings of the 15th International Conference on Computational Linguistics, COLING'94, Kyoto, Japan, August 1994, 574-78.
- Ide, N., Véronis, J. (1994). MULTEXT: Multilingual Text Tools and Corpora. Proceedings of the 15th International Conference on Computational Linguistics, COLING'94, Kyoto, Japan, 588-92.
- Ide, N., Véronis, J. (1994). Machine Readable Dictionaries: What have we learned, where do we go? Proceedings of the International Workshop on the Future of Lexical Research,Beijing, China, 137-46.
- Véronis, J., Hirst, D. Espesser, R., Ide, N. (1994). NL and Speech in the MULTEXT Project. Proceedings of the AIII-94 Workshop on Integration of Natural Language and Speech,Seattle, July 1994, 72-8.
- Hirst, D., Ide, N. Véronis, J. (1994) Coding fundamental frequency patterns for multi-lingual synthesis with INTSINT in the MULTEXT project. Conference Proceedings of the 2nd ESCA/IEEE Workshop on Speech Synthesis,New Paltz, NY, September 1994, 77-81.
- Ide, N., Véronis, J. (1993). Extracting knowledge bases from machine-readable dictionaries : Have we wasted our time? KB&KS'93 Workshop,Tokyo, 257-266.
- Ide, N., Véronis, J. (1993). Refining taxonomies extracted from machine-readable dictionaries. In Hockey, S., Ide, N. Research in Humanities Computing II,Oxford University Press, 145-59.
- Ide, N., Walker, D. (1993) Common methodologies in humanities computing and computational linguistics. Computers and the Humanities,Special issue on Common Methodologies in Humanities Computing and Computational Linguistics, N. Ide and D. Walker, Eds., 327-330.
- Ide, N., Véronis, J., Warwick-Armstrong, S., Calzolari, N., (1992). Principles for encoding machine readable dictionaries, EURALEX'92 Proceedings,H. Tommola, K. Varantola, T. Salmi-Tolonen, Y. Schopp, Eds., in Studia Translatologica,Ser. a, 2, Tampere, Finland, 239-246.
- Véronis, J., Ide, N. (1992). A feature-based model for lexical databases. Proceedings of the 14th International Conference on Computational Linguistics, COLING'92,Nantes, France, 588-594.
- Ide, N.M., Véronis, J. (1991). An Assessment of Semantic Information Automatically Extracted from Machine Readable Dictionaries. Fifth Conference of the European Chapter of the Association for Computational Linguistics,Berlin, 227-233.
- Ide, N.M., Véronis, J. (1990). Mapping Dictionaries: A Spreading Activation Approach. Proceedings of the 6th Annual Conference of the Centre for the New Oxford English Dictionary,Waterloo, Ontario, 52-64.
- Ide, N.M., Véronis, J. (1990). Very large neural networks for word sense disambiguation. Proceedings of the 9th European Conference on Artificial Intelligence, ECAI'90,Stockholm, 366-68.
- Ide, N.M., Véronis, J. (1990). Word Sense Disambiguation with Very Large Neural Networks Extracted from Machine Readable Dictionaries. Proceedings of the 13th International Conference on Computational Linguistics, COLING'90,Helsinki, vol. 2, 389-94.