Nédellec C, Sauvion C, Bossy R, Borovikova M, Deléger L. TaeC: A manually annotated text dataset for trait and phenotype extraction and entity linking in wheat breeding literature. Plos one. 2024 Jun 13;19(6):e0305475.
Rumeau M, Fenaille F, Girard A, Loux V, Ba M, Nédellec C, Deléger L, Bossy R, Aubin S, Knudsen C, Combes S. MilkOligoThesaurus, a dataset of mammalian milk oligosaccharide synonyms. Data in Brief. 2024 Jun 1;54:110404.
Dérozier S, Bossy R, Deléger L, Ba M, Chaix E, Harlé O, Loux V, Falentin H, Nédellec C. Omnicrobe, an open-access database of microbial habitats and phenotypes using a comprehensive text mining and data fusion approach. PloS one. 2023 Jan 20;18(1):e0272473.
Tang A, Bossy R, Deléger L, Nédellec C, Zweigenbaum P. Exploitation de plongements de graphes pour l'extraction de relations biomédicales. In 18e Conférence en Recherche d'Information et Applications--16e Rencontres Jeunes Chercheurs en RI--30e Conférence sur le Traitement Automatique des Langues Naturelles--25e Rencontre des Étudiants Chercheurs en Informatique pour le Traitement Automatique des Langues 2023 Jun 5 (pp. 298-310). ATALA.
Tang A, Deléger L, Bossy R, Zweigenbaum P, Nédellec C. Do syntactic trees enhance Bidirectional Encoder Representations from Transformers (BERT) models for chemical–drug relation extraction?. Database. 2022 Jan 1;2022:baac070.
A. Ferré, L. Deléger, R. Bossy, P. Zweigenbaum, C. Nédellec. C-Norm: a neural approach to few-shot entity normalization. BMC Bioinformatics, BioMed Central, 2020, 21 (S23), doi:10.1186/s12859-020-03886-8
A. Ferré, R. Bossy, M. Ba, L. Deléger, T. Lavergne, P. Zweigenbaum, C. Nédellec. Handling Entity Normalization with no Annotated Corpus: Weakly Supervised Methods Based on Distributional Representation and Ontological Information. In Proceedings of the 12th Conference on Language Resources and Evaluation, Marseille, France, 2020.
R. Bossy, L. Deléger, E. Chaix, M. Ba, C. Nédellec. Bacteria biotope at BioNLP Open Shared Tasks 2019. In Proceedings of the 5th workshop on BioNLP open shared tasks (pp. 121-131). 2019.
E. Chaix, L. Deléger, R. Bossy, C. Nédellec. Text mining tools for extracting information about microbial biodiversity in food. Food Microbiology, 81, 63-75, doi:10.1016/j.fm.2018.04.011
L. Campillos, L. Deléger, C. Grouin, T. Hamon, AL. Ligozat, A. Névéol. A French clinical corpus with comprehensive semantic annotations: development of the Medical Entity and Relation LIMSI annOtated Text corpus (MERLOT). Language Resources and Evaluation, 52(2), 571-601. (2018). doi:10.1007/s10579-017-9382-y
L. Deléger, L. Campillos, A. L. Ligozat, A. Névéol. Design of an extensive information representation scheme for clinical narratives. Journal of biomedical semantics, 8(1), 37. (2017).
L. Deléger, R. Bossy, E. Chaix, M. Ba, A. Ferré, P. Bessieres, C. Nédellec. Overview of the bacteria biotope task at BioNLP shared task. In BioNLP Shared Task, p. 101. The Association for Computational Linguistics, 2016.
E. Chaix, B. Dubreucq, A. Fatihi, D. Valsamou, R. Bossy, M. Ba, L. Deléger, P. Zweigenbaum, P. Bessieres, L. Lepiniec, C. Nédellec. Overview of the regulatory network of plant seed development (seedev) task at the bionlp shared task 2016. In Proceedings of the 4th BioNLP shared task workshop. Berlin: Association for Computational Linguistic, pp. 1-11. 2016
Y. Ni, J. Wright, J. Perentesis, T. Lingren, L. Deléger, M. Kaiser, I. Kohane, I. Solti. Increasing the efficiency of trial-patient matching: automated clinical trial eligibility pre-screening for pediatric oncology patients. BMC medical informatics and decision making 15, no. 1 (2015): 28.
A. Rosier, P. Mabo, L. Temal, P. Van Hille, O. Dameron, L. Deléger, C. Grouin, P. Zweigenbaum, J. Jacques, E. Chazard, L. Laporte. Personalized and automated remote monitoring of atrial fibrillation. Europace (2015): euv234.
L. Deléger, C. Grouin, R. Bossy. Hybrid approaches for the DNER task at BioCreative V. In Proceedings of the fifth BioCreative challenge evaluation workshop, 2015
L. Deléger, T. Lingren, Y. Ni, M. Kaiser, L. Stoutenborough, K. Marsolo, M. Kouril, K. Molnar, I. Solti. Preparing an Annotated Gold Standard Corpus to Share with Extramural Investigators for De-identification Research. Journal of Biomedical Informatics (JBI), 2014 doi: 10.1016/j.jbi.2014.01.014. Impact Factor 2.061 [PDF]
L. Deléger, AL. Ligozat, C. Grouin, P. Zweigenbaum, A. Névéol. Annotation of specialized corpora using a comprehensive entity and relation scheme. Language Resources and Evaluation Conference (LREC) 2014.
L. Deléger, A. Névéol. Identification automatique de zones dans des documents pour la constitution d'un corpus médical en français. Traitement Automatique du Langage Naturel (TALN). 2014:568-573
L. Deléger, C. Grouin, A. Névéol. Automatic Content Extraction for Designing a French Clinical Corpus. American Medical Informatics Association Annual Symposium Proceedings (AMIA). 2014
C. Grouin, L. Deléger, JB. Escudié, G. Groisy, AS. Jannot, B. Rance, X. Tannier, A. Névéol. How to de-identify a large clinical corpus in 10 days. American Medical Informatics Association Annual Symposium Proceedings (AMIA). 2014
L. Deléger, H. Brodzinski, H. Zhai, Q. Li, T. Lingren, E. S. Kirkendall, E. Alessandrini, I. Solti. Developing and Evaluating an Automated Appendicitis Risk Stratification Algorithm for Pediatric Patients in the Emergency Department. Journal of the American Medical Informatics Association (JAMIA), Published Online First October 15, 2013 doi: 10.1136/amiajnl-2013-001962. Impact Factor 3.571 [PDF]
T. Lingren, L. Deléger, K. Molnar, H. Zhai, J. Meinzen-Derr, M. Kaiser, L. Stoutenborough, Q. Li, I. Solti. Evaluating the impact of pre-annotation on annotation speed and potential bias: NLP gold standard development for clinical named entity recognition in clinical trial announcements. Journal of the American Medical Informatics Association (JAMIA), [Epub ahead of print], September 3, 2013. doi:10.1136/amiajnl-2013-001837. Impact Factor 3.571 [PDF]
L. Deléger, B. Cartoni, P. Zweigenbaum. Paraphrase Detection in Monolingual Specialized/Lay Comparable Corpora. BUCC: Building and Using Comparable Corpora. Springer-Verlag Berlin And Heidelberg Gmbh & Co. K, (Editors: Sharoff, S.; Rapp, R.; Zweigenbaum, P.; Fung, P.). 2013; 223-241.
H. Zhai, T. Lingren, L. Deléger, Q. Li, M. Kaiser, L. Stoutenborough, I. Solti. Web 2.0 based crowdsourcing for high quality gold standard development for clinical Natural Language Processing. Journal of Medical Internet Research (JMIR), 2013 Apr 2;15(4):e73. doi: 10.2196/jmir.2426. Impact Factor 4.7 [PDF]
Q. Li, L. Deléger, T. Lingren, H. Zhai, M. Kaiser, L. Stoutenborough, A.G. Jegga, K.B. Cohen, I. Solti. Mining FDA drug labels for medical conditions. BMC Medical Informatics and Decision Making 2013, 13:53 doi:10.1186/1472-6947-13-53. Impact Factor 1.6 [PDF]
J. Pestian, L. Deléger, G. Savova, J. Dexheimer, I. Solti. Natural Language Processing - The Basics. Pediatric Biomedical Informatics: Computer Applications in Pediatric Research (Translational Bioinformatics). Springer Verlag, (Editor: J. Hutton), ISBN-10: 9400751486 | ISBN-13: 978- 9400751484 | Edition: 2013
G. Savova, L. Deléger, I. Solti, J. Pestian, J. Dexheimer. Natural Language Processing - Applications in pediatric research. Pediatric Biomedical Informatics: Computer Applications in Pediatric Research (Translational Bioinformatics). Springer Verlag, (Editor: J. Hutton), ISBN-10: 9400751486 | ISBN-13: 978-9400751484 | Edition: 2013
W. Chapman, D. Hilert, S. Velupillai, M. Kvist, M. Skeppstedt, B. Chapman, M. Conway, M. Tharp, D. Mowery, L. Deléger. Extending the NegEx Lexicon for Multiple Languages. Proceedings of the 14th World Congress on Medical and Health Informatics (MedInfo 2013). [PDF]
Q. Li, H. Zhai, L. Deléger, T. Lingren, M. Kaiser, L. Stoutenborough, I. Solti. A Sequence Labeling Approach to Link Medications and Their Attributes in Clinical Notes and Clinical Trial Announcements for Information Extraction. Journal of the American Medical Informatics Association (JAMIA). December 25, 2012. [Epub ahead of print] doi:10.1136/amiajnl-2012-001487. Impact Factor 3.571 [PDF]
L. Deléger, K. Molnar, G. Savova, F. Xia, T. Lingren, Q. Li, K. Marsolo, A. Jegga, M. Kaiser, L. Stoutenborough, I. Solti. Large Scale Evaluation of Automated Clinical Note De-identification and its Impact on Information Extraction. Journal of the American Medical Informatics Association (JAMIA). August 2, 2012. [Epub ahead of print] doi:10.1136/amiajnl-2012-001012. Impact Factor 3.571 [PDF]
L. Deléger, Q. Li, T. Lingren, M. Kaiser, K. Molnar, L. Stoutenborough, M. Kouril, K. Marsolo, I. Solti. Building Gold Standard Corpora for Medical Natural Language Processing Tasks. American Medical Informatics Association Annual Symposium Proceedings. 2012;144-153. [PDF]
L. Deléger & C. Grouin. Detecting Negation of Medical Problems in French Clinical Notes. Proceedings of the 2nd ACM International Health Informatics Symposium, 2012;697-702.
AL. Minard, AL. Ligozat, A. Ben Abacha, D. Bernhard, B. Cartoni, L. Deléger, B. Grau, S. Rosset, P. Zweigenbaum & C. Grouin. Hybrid Methods for Improving Information Access in Clinical Documents: Concept, Assertion, and Relation Identification. Journal of the American Medical Informatics Association (JAMIA). 2011. http://jamia.bmj.com/content/18/5/588. Impact Factor 3.571 [PDF]
Burgun, A. Rosier, L. Temal, J. Jacques, R. Messai, L. Duchemin, L. Deléger, C. Grouin, P. Van Hille, P. Zweigenbaum, R. Beuscart, D. Delerue, O. Dameron, P. Mabo & C. Henry. Aide à la décision en télécardiologie par une approche basée ontologie et centrée patient. In: Ingénierie et Recherche Biomédicale, 32(2). 2011. Numéro spécial ANR TecSan. http://dx.doi.org/10.1016/j.irbm.2011.01.035
C. Wu, F. Xia, L. Deléger, I. Solti. Statistical Machine Translation for Biomedical Text: Are We There Yet? Proceedings of American Medical Informatics Association Annual Fall Symposium (AMIA) 2011;1290-1299. [PDF]
C. Grouin, L. Deléger, A. Rosier, L. Temal, O. Dameron, P Van Hille, A Burgun, P Zweigenbaum. Automatic computation of CH2DS2-VASc score: Information extraction from clinical texts for thromboembolism risk assessment. Proceedings of the American Medical Informatics Association Annual Fall Symposium (AMIA) 2011;501-510. [PDF]
O. Dameron, P Van Hille, L Temal, A Rosier, L Deléger, C Grouin, P Zweigenbaum, A Burgun. Comparison of OWL and SWRL-Based Ontology Modeling Strategies for the Determination of Pacemaker Alerts Severity. Proceedings of the American Medical Informatics Association Annual Fall Symposium (AMIA) 2011.
C. Grouin, L. Deléger, B. Cartoni, S. Rosset, P. Zweigenbaum. Accès au contenu sémantique en langue de spécialité : extraction des prescriptions et concepts médicaux. Proceedings of TALN 2011(Traitement automatique des langues naturelles), Montpellier, 24-28 juin 2011. [PDF]
B. Cartoni & L. Deléger. Découverte de patrons paraphrastiques en corpus comparable: une approche basée sur les n-grammes. Proceedings of TALN 2011 (Traitement automatique des langues naturelles), Montpellier, 24-28 juin 2011. [PDF]
L. Deléger, C. Grouin & P. Zweigenbaum. Extracting medical information from narrative patient records: the case of medication-related information. Journal of the American Medical Informatics Association (JAMIA), 17:555-558, 2010. http://dx.doi.org/10.1136/jamia.2010.003962. Impact Factor 3.571 [PDF]
L. Deléger, T. Merabti, T. Lecrocq, M. Joubert, P. Zweigenbaum & S. Darmoni. A twofold strategy for translating a medical terminology into French. Proceedings of American Medical Informatics Association Annual Fall Symposium (AMIA) 2010;152–156. [PDF]
L. Deléger, C. Grouin & P. Zweigenbaum. Extracting medication information from French clinical texts. Proceedings of MEDINFO 2010, Studies in Health Technology and Informatics. 2010;160(Pt 2):949-53. [PDF]
L. Deléger & P. Zweigenbaum. Identifying paraphrases between technical and lay corpora. Proceedings of the Seventh conference on International Language Resources and Evaluation (LREC'10), Valletta, Malta, May 2010. [PDF]
O. Galibert, L. Quintard, S. Rosset, P. Zweigenbaum, C. Nédellec, S. Aubin, L. Gillard, J-P. Raysz, D. Pois, X. Tannier, L. Deléger & D. Laurent. Named and specific entity detection in varied data: the Quæro named entity baseline evaluation. Proceedings of the Seventh conference on International Language Resources and Evaluation (LREC'10), Valletta, Malta, May 2010. [PDF]
L. Deléger & B. Cartoni. Adjectifs relationnels et langue de spécialité : vérification d’une hypothèse linguistique en corpus comparable médical. Proceedings of TALN 2010 (Traitement automatique des langues naturelles), Montréal, July 2010. [PDF]
C. Grouin, A. Ben Abacha, D. Bernhard, B. Cartoni, L. Deléger, B. Grau, A-L. Ligozat, A-L. Minard, S. Rosset & P. Zweigenbaum. CARAMBA: Concept, assertion and relation annotation using machine-learning based approaches. i2b2/VA Challenge Workshop on Natural Language Processing for Clinical Data, 2010.
L. Deléger, M. Merkel, & P. Zweigenbaum. Translating medical terminologies through word alignment in parallel text corpora. Journal of Biomedical Informatics (JBI), 42(4):692-701, 2009. Epub 2009 Mar 9, http://dx.doi.org/10.1016/j.jbi.2009.03.002. Impact Factor 2.131 [PDF]
L. Deléger, F. Namer, & P. Zweigenbaum. Morphosemantic parsing of medical compound words: transferring a French analyzer to English. International Journal of Medical Informatics (IJMI), 78(Supplement 1):S48-S55, 2009. Epub 2008 Sep 17, http://dx.doi.org/10.1016/j.ijmedinf.2008.07.016. Impact Factor 2.061 [PDF]
L. Deléger & P. Zweigenbaum. Extracting lay paraphrases of specialized expressions from monolingual comparable medical corpora. Proceedings of the 2nd Workshop on Building and Using Comparable Corpora: from Parallel to Non-Parallel Corpora, pages 2-10, Association for Computational Linguistics, Singapore, August 2009. [PDF]
C. Grouin, L. Deléger, & P. Zweigenbaum. A simple rule-based medication extraction system. In Özlem Uzuner and Imre Solti, editors, i2b2 Medical Extraction Challenge Workshop Proceedings, pages 2-10. i2b2, 2009.
L. Deléger. Exploitation de corpus parallèles et comparables pour la détection de correspondances lexicales : application au domaine médical. PhD thesis, Pierre et Marie Curie University (UPMC-Paris 6), October 2009. [PDF]
L. Deléger & P. Zweigenbaum. Paraphrase acquisition from comparable medical corpora of specialized and lay texts. Proceedings of the American Medical Informatics Association Annual Fall Symposium (AMIA) 2008, pages 146-150, Washington, DC, November 2008. [PDF]
L. Deléger & P. Zweigenbaum. Aligning lay and specialized passages in comparable medical corpora. Proceedings of Medical Informatics in Europe (MIE2008), volume 136 of Studies in Health Technology and Informatics, pages 89-94, Amsterdam, 2008. IOS Press. [PDF]
L. Deléger, F. Namer, & P. Zweigenbaum. Defining medical words: transposing morphosemantic analysis from French to English. Proceedings of MEDINFO 2007, volume 129 of Studies in Health Technology and Informatics, pages 152-156, Amsterdam, 2007. IOS Press. [PDF]
D. Dannells & L. Deléger. Multilingual Generation of Medical Information. Proceedings of the 9th Bar-Ilan Symposium on the Foundations of Artificial Intelligence (BISFAI), Israel, 2007.
L. Deléger, F. Namer, & P. Zweigenbaum. Analyse morphosémantique des composés savants : transposition du français à l'anglais. In Nabil Hathout and Philippe Muller, editors, Proceedings of TALN 2007 (Traitement automatique des langues naturelles), pages 79-88, Toulouse, June 2007. [PDF]
L. Deléger, M. Merkel, & P. Zweigenbaum. Contribution to terminology internationalization by word alignment in parallel corpora. Proceedings of the American Medical Informatics Association Annual Fall Symposium (AMIA) 2006, pages 185-189, Washington, DC, November 2006. [PDF]
L. Deléger, M. Merkel, & P. Zweigenbaum. Enriching medical terminologies: an approach based on aligned corpora. Proceedings of Medical Informatics in Europe (MIE2006), Studies in Health Technology and Informatics, pages 747-752, 2006. [PDF]
L. Deléger, M. Merkel, & P. Zweigenbaum. Using word alignment to extend multilingual medical terminologies. Proceedings of the LREC Workshop on acquiring and representing multilingual, specialized lexicons: the case of biomedicine, pages 9-14, Genoa, Italy, 2006. ELDA. [PDF]
L. Deléger. Alignement de mots dans un corpus parallèle pour l’enrichissement de la terminologie médicale. Master's thesis, INALCO, 2005.