====== Selected Publications (1990-onwards) ====== * Ide, N. and Suderman, K. (2007). [[http://www.cs.vassar.edu/~ide/papers/LAW.pdf| GrAF: A Graph-based Format for Linguistic Annotations.]] //Proceedings of the Linguistic Annotation Workshop//,held in conjunction with ACL 2007, Prague, June 28-29, 1-8. * Ide, N. and Woolner, D. (2007). [[http://www.cs.vassar.edu/~ide/papers/festschrift.pdf|Historical Ontologies.]] In Ahmad, K, Brewster, C., and Stevenson, M. (eds.), //Words and Intelligence II: Essays in Honor of Yorick Wilks,// Springer, 137-152. * Ide, N., Romary, L. (2007). [[http://www.cs.vassar.edu/~ide/papers/Elsnet.pdf| Towards International Standards for Language Resources.]] In Dybkjaer, L., Hemsen, H., Minker, W. (Eds.), //Evaluation of Text and Speech Systems,// Springer, 263-84. * Ide, N., Wilks, Y. (2006). [[http://www.cs.vassar.edu/~ide/papers/WSDbook.pdf|Making Sense About Sense.]] In Agirre, E., Edmonds, P. (Eds.), //Word Sense Disambiguation: Algorithms and Applications//, Springer, 47-74.. * Ide, N. (2006). [[http://www.cs.vassar.edu/~ide/papers/CICLing2006.pdf|Making Senses: Bootstrapping Sense-tagged Lists of Semantically-Related Words.]] In Gelbukh, Alexander (Ed.), //Computational Linguistics and Intelligent Text//, Lecture Notes in Computer Science, Springer. * Ide, N., Suderman, K. (2006). [[http://www.cs.vassar.edu/~ide/papers/ANC-LREC06.pdf| Integrating Linguistic Resources: The American National Corpus Model.]] //Proceedings of the Fifth Language Resources and Evaluation Conference// (LREC), Genoa, Italy. * Ide, N., Romary, L.. (2006). [[http://www.cs.vassar.edu/~ide/papers/LAF-LREC06.pdf| Representing Linguistic Corpora and Their Annotations. ]] //Proceedings of the Fifth Language Resources and Evaluation Conference// (LREC), Genoa, Italy. * Ide, N., Suderman, K. (2006). [[http://www.cs.vassar.edu/~ide/papers/Merging06.pdf| Merging Layered Annotations]] // Proceedings of Merging and Layering Linguistic Information//, Workshop held in conjunction with LREC 2006, Genoa, Italy. * Ide, N., Woolner, D. (2004). [[http://www.cs.vassar.edu/~ide/papers/LREC2004-FDR.pdf| Exploiting Semantic Web Technologies for Intelligent Access to Historical Documents.]] //Proceedings of the Fourth Language Resources and Evaluation Conference// (LREC), Lisbon, 2177-80. * Ide, N., Suderman, K. (2004). [[http://www.cs.vassar.edu/~ide/papers/LREC2004-ANC.pdf| The American National Corpus First Release.]] //Proceedings of the Fourth Language Resources and Evaluation Conference// (LREC), Lisbon, 1681-84. * Ide, N., Romary, L. (2004). [[http://www.cs.vassar.edu/~ide/papers/JNLE-rev.pdf|International standard for a linguistic annotation framework]]. //Journal of Naturaql Language Engineering,// 10:3-4, 211-225. * Tufis, D., Ion, R., Ide, N. (2004). Fine-Grained Word Sense Disambiguation Based on Parallel Corpora, Word Alignment, Word Clustering, and Aligned WordNets. Proceedings of COLING'04. Geneva. * Ide, N., Romary, L. (2004). [[http://www.cs.vassar.edu/~ide/papers/LREC2004-DCR.pdf|A Registry of Standard Data Categories for Linguistic Annotation]]. //Proceedings of the Fourth Language Resources and Evaluation Conference// (LREC), Lisbon, 135-39. * Reppen, R., Ide, N. (2004). [[http://www.cs.vassar.edu/~ide/papers/JEL.pdf| The American National Corpus: An overview of the First Release.]] //Journal of English Linguistics//. * Ide, N. (2004). [[http://www.cs.vassar.edu/~ide/papers/Humbook.pdf|Preparation and Analysis of Linguistic Corpora]]. In Schreibman, S., Siemens, R, Unsworth, J, Eds. [[http://www.blackwellpublishing.com/book.asp?ref=1405103213|A Companion to Digital Humanities]]. Blackwell. * Tufis, D., Ion, R., Ide, N. (2004). Fine-Grained Word Sense Disambiguation Based on Parallel Corpora, Word Alignment, Word Clustering and Aligned WordNets. //Proceedings of COLING'04//, Geneva. * Ide, N., Romary, L. (2003). [[http://www.cs.vassar.edu/~ide/papers/ide-romary.pdf| Encoding Syntactic Annotation.]] In Abeillé, Anne (ed.) //Treebanks: Building and Using Parsed Corpora,// Kluwer, Dordrecht, 281-96. * Ide, N., Romary, L. (2003). [[http://www.cs.vassar.edu/~ide/papers/ACL2003-ws-LAF.pdf| Outline of the International Standard Linguistic Annotation Framework.]] //Proceedings of ACL'03 Workshop on Linguistic Annotation: Getting the Model Right,// Sapporo, 1-5. * Ide, N., Lenci, A., Calzolari, N. (2003). [[http://www.cs.vassar.edu/~ide/papers/ACL2003-ws-ISLE.pdf| RDF Instantiation of ISLE/MILE Lexical Entries.]] //Proceedings of ACL'03 Workshop on Linguistic Annotation: Getting the Model Right,// Sapporo, 30-37. * Ide, N., Romary, L., de la Clergerie, E. (2003). [[http://www.cs.vassar.edu/~ide/papers/ide-romary-clergerie.pdf| International Standard for a Linguistic Annotation Framework.]] //Proceedings of HLT-NAACL'03 Workshop on The Software Engineering and Architecture of Language Technology//, Edmunton. * Ide, N. Erjavec, T., Tufis, D. (2002). [[http://www.cs.vassar.edu/~ide/papers/senseval02.pdf| Sense Discrimination with Parallel Corpora.]] //Proceedings of ACL'02 Workshop on Word Sense Disambiguation: Recent Successes and Future Directions,// Philadelphia, 54-60. * Ide, N., Reppen, R., Suderman, K. (2002).[[http://www.cs.vassar.edu/~ide/papers/anc-lrec02.pdf| The American National Corpus: More Than the Web Can Provide. ]] //Proceedings of the Third Language Resources and Evaluation Conference// (LREC), Las Palmas, Canary Islands, Spain, 839-44. * Ide, N., Romary, L. (2002). [[http://www.cs.vassar.edu/~ide/papers/ide-romary-LREC02.pdf| Standards for Language Resources.]] //Proceedings of the Third Language Resources and Evaluation Conference// (LREC), Las Palmas, Canary Islands, Spain, 839-44. * Ide, N. Erjavec, T., Tufis, D. (2001). [[http://www.cs.vassar.edu/~ide/papers/NLPRS01.pdf| Automatic Sense Tagging Using Parallel Corpora.]] //Proceedings of the Sixth Natural Language Processing Pacific Rim Symposium,// Tokyo, 83-9. * Ide, N., Romary, L. (2001). [[http://www.cs.vassar.edu/~ide/papers/IRCS01.pdf| Standards for Language Resources.]] //Proceedings of the IRCS Workshop on Linguistic Databases,// Philapdelphia, 141-9. * Ide, N., Romary, L. (2001). [[http://www.cs.vassar.edu/~ide/papers/acl2001.pdf|A Common Framework for Syntactic Annotation.]] //Proceedings of ACL'2001,// Toulouse, 298-305. * Ide, N., Macleod, C. (2001). [[http://www.cs.vassar.edu/~ide/papers/CorpLing01.pdf|The American National Corpus: A Standardized Resource of American English.]] //Proceedings of Corpus Linguistics 2001,// Lancaster UK. * Ide, N., Cristea, D. (2000). [[http://www.cs.vassar.edu/~ide/papers/ACL2000.pdf|A Hierarchical Account of Referential Accessibility.]] //Proceedings of ACL'2000,// Hong Kong, 416-24. * Ide, N., Romary, L. (2000) [[http://www.cs.vassar.edu/~ide/papers/Expl2000.pdf|XML Support for Annotated Language Resources.]] //Proceedings of the Workshop on Web-based Language Documentation and Description,// Philadelphia, 148-153. * Ide, N., Kilgarriff, A., Romary, L. (2000). [[http://www.cs.vassar.edu/~ide/papers/euralex00.final.ps| A Formal Model of Dictionary Structure and Content.]] //Proceedings of Euralex 2000,// Stuttgart, 113-126. * Cristea, D., Ide, N., Marcu, D., Tablan, V. (2000). [[http://www.cs.vassar.edu/~ide/papers/coling00.pdf| An Empirical Study of the Relation Between Discourse Structure and Reference.]] //Proceedings of COLING'00,// Saarbrucken, Germany, 208-214. * Ide, N. (2000). [[http://www.cs.vassar.edu/~ide/papers/senseval.pdf|Cross-lingual sense determination: Can it work?]] //Computers and the Humanities,// 34: 1-2, Special Issue on the Proceedings of the SIGLEX/SENSEVAL Workshop, A. Kilgarriff and M. Palmer, eds., 223-34. * Ide, N. (2000). [[http://www.cs.vassar.edu/~ide/papers/coling00-ws-final.pdf|The XML Framework and Its Implications for the Development of Natural Language Processing Tools.]] Proceedings of the COLING Workshop on Using Toolsets and Architectures to Build NLP Systems, Luxembourg, 5 August 2000. * Ide, N., Brew, C. (2000). [[http://www.cs.vassar.edu/~ide/papers/ide-brew-lrec2000.pdf| Requirements, Tools, and Architectures for Annotated Corpora.]] //Proceedings of Data Architectures and Software Support for Large Corpora.// Paris: European Language Resources Association, 1-5. * Ide, N., 2000. [[http://www.cs.vassar.edu/~ide/papers/xml-lrec00.pdf| The XML Framework and Its Implications for Corpus Access and Use.]] //Proceedings of Data Architectures and Software Support for Large Corpora.// Paris: European Language Resources Association, 28-32. * Ide, N., Bonhomme, P., Romary, L. (2000). [[http://www.cs.vassar.edu/~ide/papers/xces-lrec00.pdf| XCES: An XML-based Standard for Linguistic Corpora.]]. //Proceedings of the Second Language Resources and Evaluation Conference// (LREC), Athens, Greece, 825-30. * Erjavec, T., Evans, R., Ide, N., Kilgarriff, A. (2000). [[http://www.cs.vassar.edu/~ide/papers/lrec-cnc2.pdf| The CONCEDE Model for Lexical Databases.]]. //Proceedings of the Second Language Resources and Evaluation Conference// (LREC), Athens, Greece, 355-62. * Macleod, C., Ide, N., Grishman, R. (2000). [[http://www.cs.vassar.edu/~ide/papers/anc-lrec00.pdf| The American National Corpus: Standardized Resources for American English.]]. //Proceedings of the Second Language Resources and Evaluation Conference// (LREC), Athens, Greece, 831-36. * Cristea, D., Ide, N., Marcu, D., Tablan, V. (1999). [[http://www.cs.vassar.edu/~ide/papers/acl99.pdf|Discourse structure and co-reference: An empirical study.]] //Proceedings of the ACL99 Workshop on the Relation between Discourse/Dialogue Structure and Reference,// College Park, Maryland, 46-53. * Ide, N. (1999). [[http://www.cs.vassar.edu/~ide/papers/siglex99.pdf|Parallel translations as sense discriminators.]] //SIGLEX99: Standardizing Lexical Resources//, ACL99 Workshop, College Park, Maryland, 52-61. * Ide, N. (ed.) (1999). [[http://www.cs.cmu.edu/~ref/mlim/chapter6.html|Methods and Techniques of Processing]] (html document). In Frederking, R., Hovy, E., Ide, N. (eds.) //Multilingual Information Management: Current Levels and Future Abilities.// National Science Foundation report. Available at http://www.cs.cmu.edu/~ref/mlim/. * Erjavec, T. and Ide, N. (1999) [[http://www.cs.vassar.edu/~ide/papers/complex99-final.pdf|Markup enhancement: Converting CEE dictionaries into TEI, and beyond.]] In Kiefer, F., Kiss, G. and Pajzs, J. (eds.) //Papers in Computational Lexicography COMPLEX'99,// Linguistics Institute, Hungarian Academy of Sciences, Budapest, 211-217. * Welty, C. and Ide, N. (1999). [[http://www.cs.vassar.edu/~ide/papers/Welty&Ide.pdf|Using the right tools: Enhancing retrieval from marked-up documents.]] //Computers and the Humanities 33:1-2,//Special Issue on the Tenth Anniversary of the Text Encoding Initiative, E. Mylonas and A. Renear, eds. * Ide, N. and V éronis, J. (1998) [[http://www.cs.vassar.edu/~ide/papers/idevero.pdfd|Word Sense Disambiguation: The State of the Art.]] //Computational Linguistics,//24:1, 1-40. * Cristea, D., Ide, N., Romary, L. (1998) [[http://www.cs.vassar.edu/~ide/papers/discourse.granada.pdf|Marking Up Multiple Views of a Text: Discourse and Reference]]. //Proceedings of the First International Language Resources and Evaluation Conference,//Granada, Spain, 483-88. * Cristea, D., Ide, N., Romary, L. (1998). [[http://www.cs.vassar.edu/~ide/papers/coling98.pdf|Veins Theory: A Model of Global Discourse Cohesion and Coherence]]. //Proceedings of ACL/COLING98,// Montreal, 281-85. * Dimitrova, L., Erjavec, T., Ide, N. Kaalep, H., Petkevic, V., Tufis, D. (1998) [[http://www.cs.vassar.edu/~ide/papers/ME.coling.pdf|Multext-East: Parallel and Comparable Corpora for Six Central and Eastern European Languages.]] //Proceedings of ACL/COLING98,// Montreal, 315-19. * Fillmore, C., Ide, N., Jurafsky, D., and Macleod, C. (1998). [[http://www.cs.vassar.edu/~ide/papers/anc.final.pdf|An American National Corpus: A Proposal]]. //Proceedings of the First International Language Resources and Evaluation Conference,//Granada, Spain, 965-70. * Ide, N. (1998) [[http://www.cs.vassar.edu/~ide/papers/ces.wvlc.pdf|Encoding Linguistic Corpora]]. //Proceedings of the Sixth Workshop on Very Large Corpora,//Montreal, 9-17. * Ide, N. (1998). [[http://www.cs.vassar.edu/~ide/papers/CES.granada.pdf|Corpus Encoding Standard: SGML Guidelines for Encoding Linguistic Corpora.]] //Proceedings of the First International Language Resources and Evaluation Conference,//Granada, Spain, 463-70. * Ide, N., McGraw, T., Welty, C. (1997) [[http://www.cs.vassar.edu/~ide/papers/tei10.pdf|Representing TEI Documents in the CLASSIC Knowledge Representation System.]] //Proceedings of the Tenth Anniversary Conference of the Text Encoding Initiative,//Providence, RI. * Tufis, D., Ide, N., Erjavec, T. (1998) [[http://www.cs.vassar.edu/~ide/papers/LEX-Gran.ps|Standardized Specifications, Development and Assessment of Large Morpho-syntactic Resources for Six Central and Eastern European Languages]]. //Proceedings of the First International Language Resources and Evaluation Conference,//Granada, Spain, 233-40. * Ide, N. (1997) [[http://www.cs.vassar.edu/~ide/papers/MultextD1.2.ps|Final Report on Integration, Testing, and Refinement.]] Multext Deliverable D1.2F. 32 pp. * Ide, N., Sperberg-McQueen, C.M. (1997). [[http://www.cs.vassar.edu/~ide/papers/assisfmt.ps|Toward a Unified Docuverse: Standardizing Document Markup and Access Without Procrustean Bargains]]. //Proceedings of the American Society for Information Science Conference,//Washington, D.C., 347-60. * Barnard, D. and Ide, N. (1997) [[http://www.cs.vassar.edu/~ide/papers/JASIS.ps| The Text Encoding Initiative: Flexible and Extensible Document Encoding.]] //Journal of the American Society for Information Science.// * Erjavec, T., Ide, N., Petkevic, V., Véronis, J. (1996) [[http://www.cs.vassar.edu/~ide/papers/Tihanyi.ps|Multext-East: Multilingual Text, Tools and Corpora for Central and Eastern European Languages]]. //Proceedings of the First TELRI European Seminar,//87-98. * Ide, N. (1996) [[http://www.cs.vassar.edu/~ide/papers/tihanyi.tei.pdf|Encoding Standards for Linguistic Corpora.]] //Proceedings of the First TELRI European Seminar,//65-78. * Erjavec, T. and Ide, N. (1996) [[http://www.cs.vassar.edu/~ide/papers/mte-D21F.ps|Corpus Collection and Preparation.]] COP Project 106 MULTEXT-EAST Deliverable D2.1 F Final Report. * Priest-Dorman, G,. Erjavec, T., Ide, N. and Petkevic, V. (1996) [[http://nl.ijs.si/ME/ftp/doc/mteD23/mte-D23F/|Corpus Markup.]] COP Project 106 MULTEXT-EAST Deliverable D2.3F. * Ide, N. (Ed.) (1996). [[http://www.cs.vassar.edu/~ide/papers/ME-D1.2.html| Multext-East Language-specific Resources.]] COP Project 106 MULTEXT-EAST Deliverable D1.2. * Durand, D., Ide, N., LeMaitre, J., Véronis, J. (1995) [[http://www.cs.vassar.edu/~ide/papers/MultextD1.3.1B.ps|Internal Standard Formats. ]] Multext Deliverable 1.3.1B. 27 pp. * Véronis, J., Ide, N. (1996). [[http://www.lpl.univ-aix.fr/projects/multext/LSD/LSD1.html| Considerations for the Reusability of Linguistic Software.]] EAGLES-MULTEXT Report. * Véronis, J., Ide, N. (1996). [[http://www.lpl.univ-aix.fr/projects/multext/LSD/LSD2.html| Guidelines for Linguistic Software Development.]] EAGLES-MULTEXT Report. * Ide, N., Le Maitre, J., Véronis, J. (1995). [[http://www.cs.vassar.edu/~ide/papers/IPM.pdf|Outline of a Model for Lexical Databases.]] //Current Issues in Computational Linguistics: In Honour of Don Walker. Linguistica Computazionale//IX, X (Pisa, 1995), 283-320. [reprinted from Information Processing and Management., 29, 2, 159-186] * Ide, N., Véronis, J. (1995). [[http://www.cs.vassar.edu/~ide/papers/Veronis&Ide.pdf|Large Neural Networks for the Resolution of Lexical Ambiguity]]. In Saint-Dizier, P. and Viegas, E. (Eds.) //Computational Lexical Semantics.//Natural Language Processing Series, Cambridge University Press, 251-270. * Ide, N., Sperberg-McQueen, C.M. (1995). The Text Encoding Initiative: History and Context. In Ide, N., Veronis, J. (Eds.) //The Text Encoding Initiative: Background and Context.//Dordrecht: Kluwer Academic Publishers, 5-15. * Ide, N., Véronis, J. (1995). [[http://www.cs.vassar.edu/~ide/papers/Chumdict.pdf|Encoding dictionaries.]] In Ide, N., Veronis, J. (Eds.) //The Text Encoding Initiative: Background and Context.//Dordrecht: Kluwer Academic Publishers, 167-80. * Ide, N., Véronis, J. (1994) [[http://www.cs.vassar.edu/~ide/papers/Heidelberg.ps|Knowledge Extraction from Machine-Readable Dictionaries: An Evaluation]]. In Steffens, P. (Ed.) //Machine Translation and the Lexicon.// Springer-Verlag, 19-34. * Ide, N., Véronis, J. (1994). [[http://www.cs.vassar.edu/~ide/papers/COLING.pdf|A feature-based data model for lexical databases]]. In Hockey, S., Ide, N. //Research in Humanities Computing IV,//Oxford University Press , 193-206. * Le Maitre, J., Ide, N., Véronis, J. (1994) [[http://www.cs.vassar.edu/~ide/papers/ISI.ps|Modélisation et interrogation de bases de données lexicales]]. //Ingénierie des systemes d'informations,//2:1, 57-82. * Ide, N. (1994) [[http://www.cs.vassar.edu/~ide/papers/TEI.ps|Encoding standards for large text resources: The Text Encoding Initiative]]. //Proceedings of the 15th International Conference on Computational Linguistics, COLING'94,// Kyoto, Japan, August 1994, 574-78. * Ide, N., Véronis, J. (1994). [[http://www.cs.vassar.edu/~ide/papers/MULTEXT.pdf|MULTEXT: Multilingual Text Tools and Corpora]]. //Proceedings of the 15th International Conference on Computational Linguistics, COLING'94,// Kyoto, Japan, 588-92. * Ide, N., Véronis, J. (1994). [[http://www.cs.vassar.edu/~ide/papers/Beijing.pdf|Machine Readable Dictionaries: What have we learned, where do we go?]] //Proceedings of the International Workshop on the Future of Lexical Research,//Beijing, China, 137-46. * Véronis, J., Hirst, D. Espesser, R., Ide, N. (1994). [[http://www.cs.vassar.edu/~ide/papers/AAAI.pdf|NL and Speech in the MULTEXT Project]]. //Proceedings of the AIII-94 Workshop on Integration of Natural Language and Speech,//Seattle, July 1994, 72-8. * Hirst, D., Ide, N. Véronis, J. (1994) Coding fundamental frequency patterns for multi-lingual synthesis with INTSINT in the MULTEXT project. //Conference Proceedings of the 2nd ESCA/IEEE Workshop on Speech Synthesis,//New Paltz, NY, September 1994, 77-81. * Ide, N., Véronis, J. (1993). [[http://www.cs.vassar.edu/~ide/papers/Japan.pdf|Extracting knowledge bases from machine-readable dictionaries : Have we wasted our time?]] //KB&KS'93 Workshop,//Tokyo, 257-266. * Ide, N., Véronis, J. (1993). [[http://www.cs.vassar.edu/~ide/papers/RHC3.pdfs|Refining taxonomies extracted from machine-readable dictionaries.]] In Hockey, S., Ide, N. //Research in Humanities Computing II,//Oxford University Press, 145-59. * Ide, N., Walker, D. (1993) Common methodologies in humanities computing and computational linguistics. //Computers and the Humanities,//Special issue on Common Methodologies in Humanities Computing and Computational Linguistics, N. Ide and D. Walker, Eds., 327-330. * Ide, N., Véronis, J., Warwick-Armstrong, S., Calzolari, N., (1992). [[http://www.cs.vassar.edu/~ide/papers/Euralex92.pdf|Principles for encoding machine readable dictionaries]], //EURALEX'92 Proceedings,//H. Tommola, K. Varantola, T. Salmi-Tolonen, Y. Schopp, Eds., in //Studia Translatologica,//Ser. a, 2, Tampere, Finland, 239-246. * Véronis, J., Ide, N. (1992). [[http://www.cs.vassar.edu/~ide/papers/COLING'92.LDB.pdf|A feature-based model for lexical databases.]] //Proceedings of the 14th International Conference on Computational Linguistics, COLING'92,//Nantes, France, 588-594. * Ide, N.M., Véronis, J. (1991). [[http://www.cs.vassar.edu/~ide/papers/EACL90.pdf|An Assessment of Semantic Information Automatically Extracted from Machine Readable Dictionaries]]. //Fifth Conference of the European Chapter of the Association for Computational Linguistics,//Berlin, 227-233. * Ide, N.M., Véronis, J. (1990). [[http://www.cs.vassar.edu/~ide/papers/NewOED'90.pdf|Mapping Dictionaries: A Spreading Activation Approach.]] //Proceedings of the 6th Annual Conference of the Centre for the New Oxford English Dictionary,//Waterloo, Ontario, 52-64. * Ide, N.M., Véronis, J. (1990). [[http://www.cs.vassar.edu/~ide/papers/ECAI90.pdf|Very large neural networks for word sense disambiguation.]] //Proceedings of the 9th European Conference on Artificial Intelligence, ECAI'90,//Stockholm, 366-68. * Ide, N.M., Véronis, J. (1990). [[http://www.cs.vassar.edu/~ide/papers/coling90.pdf|Word Sense Disambiguation with Very Large Neural Networks Extracted from Machine Readable Dictionaries.]] //Proceedings of the 13th International Conference on Computational Linguistics, COLING'90,//Helsinki, vol. 2, 389-94.