Selected publications
AKIMUSHKIN, C.; AMANCIO, D.R.; OLIVEIRA JR, O.N. Text Authorship Identified Using the Dynamics of Word Co-Occurrence Networks. Plos One, v. 12, p. 1-15, 2017. link to the paper
AKIMUSHKIN, C.; AMANCIO, D.R.; OLIVEIRA JR, O.N. On the role of words in the network structure of texts: application to authorship attribution. Physica A: Statistical Mechanics and its Applications, vol. 495, p. 49-58, 2018. link to the paper
ALMEIDA, G.M.B.; FERREIRA, J.P.; CORREIA, M.; OLIVEIRA, G.M. Vocabulário Ortográfico Comum (VOC): constituição de uma base lexical para a língua portuguesa. Estudos Linguísticos, v. 42, n.1, p. 204-215, 2013. link to the paper
ALMEIDA, G.M.B.; OLIVEIRA, L.H.M. Terminology and computational linguistics: new praxes in terminography. Cahiers de Lexicologie, v. 101, p. 139-153, 2012. link to the paper
ALMEIDA, G.M.B. A Teoria Comunicativa da Terminologia e a sua prática. Alfa, v. 50, n. 2, p. 85-101, 2006. link to the paper
ALUÍSIO, S.M.; GASPERIN, C. Fostering Digital Inclusion and Accessibility: The PorSimples project for Simplification of Portuguese Texts. Proceedings of the NAACL HLT 2010 Young Investigators Workshop on Computational Approaches to Languages of the Americas, p. 46-53, 2010. link to the paper
AMANCIO, D.R.; ALTMANN, E.G.; RYBSKI, D.; OLIVEIRA JR., O.N.; COSTA, L.D. Probing the Statistical Properties of Unknown Texts: Application to the Voynich Manuscript. PLOS One, v. 8, n. 7, 2013. link to the paper
AMANCIO, D.R.; ANTIQUEIRA, L.; PARDO, T.A.S.; COSTA, L.F.; OLIVEIRA JR., O.N.; NUNES, M.G.V. Complex networks analysis of manual and machine translations. International Journal of Modern Physics C, v. 19, n. 4, p. 583-598, 2008. link to the paper
AMANCIO, D.R.; OLIVEIRA JR., O.N.; COSTA, L.D. Structure–semantics interplay in complex networks and its effects on the predictability of similarity in texts. Physica A: Statistical Mechanics and its Applications, vol. 391, n. 15, p. 4406-4419, 2012. link to the paper
ANCHIÊTA, R.T.; PARDO, T.A.S. Analise Semântica com base em AMR para o Português. LinguaMÁTICA, v. 14, p. 33-48, 2022. link to the paper
ANCHIÊTA, R.T.; PARDO, T.A.S. Semantically Inspired AMR Alignment for the Portuguese language. Proceedings of the Conference on Empirical Methods in Natural Language Processing, p. 1595-1600, 2020. link to the paper
ANTIQUEIRA, L.; NUNES, M.G.V.; OLIVEIRA JR, O.N.; COSTA, L.F. Strong correlations between text quality and complex networks features. Physica A: Statistical Mechanics and its Applications, v. 373, p. 811-820, 2007. link to the paper
ANTIQUEIRA, L.; OLIVEIRA JR, O.N; COSTA, L.F.; NUNES, M.G.V. A complex network approach to text summarization. Information Sciences, v. 179, p. 584-599, 2009. link to the paper
CARDOSO, P.C.F.; PARDO, T.A.S.; Taboada, M. Subtopic annotation and automatic segmentation for news texts in Brazilian Portuguese. Corpora, v. 12, p. 23-54, 2017. link to the paper
CARDOSO, P.C.F.; MAZIERO, E.G.; CASTRO JORGE, M.L.R.; SENO, E.R.M.; DI FELIPPO, A.; RINO, L.H.M.; NUNES, M.G.V.; PARDO, T.A.S. CSTNews - A Discourse-Annotated Corpus for Single and Multi-Document Summarization of News Texts in Brazilian Portuguese. Proceedings of the 3rd RST Brazilian Meeting, p. 88-105, 2011. link to the paper
CASANOVA, E.; CANDIDO JR, A.; SHULBY, C.; OLIVEIRA, F.S.; TEIXEIRA, J.P.; PONTI, M.A.; ALUÍSIO, S.M. TTS-Portuguese Corpus: a corpus for speech synthesis in Brazilian Portuguese. Language Resources and Evaluation, v. 56, p. 1043-1055, 2022. link to the paper
COMIN, C.H.; PERON, T.; SILVA, F.N.; AMANCIO, D.R.; RODRIGUES, F.A.; COSTA, L.F. Complex systems: Features, similarity and connectivity. Physics Reports, v. 861, p. 1-41, 2020. link to the paper
CORRÊA JR, E.A.; LOPES, A.A.; AMANCIO, D.R. Word sense disambiguation: a complex network approach. Information Sciences, v. 442, p. 103-113, 2018. link to the paper
DIAS, M.S.; DI FELIPPO, A.; RASSI, A.P.; CARDOSO, P.C.F.; NOBREGA, F.A.A.; PARDO, T.A.S. An investigation of linguistic problems in automatic multi-document summaries. Revista de Estudos da Linguagem, v. 29, p. 859-907, 2021. link to the paper
DIAS-DA-SILVA, B.C. Brazilian Portuguese WordNet: A Computational Linguistic Exercise of Encoding Bilingual Relational Lexicons. International Journal of Computational Linguistics and Applications, v. 1, p. 137-150, 2010. link to the paper
DIAS-DA-SILVA, B.C. O estudo lingüístico-computacional da linguagem. Letras de Hoje, v. 41, n. 2, p. 103-138, 2006. link to the paper
DURAN, M.S.; NUNES, M.G.V.; LOPES, L.; PARDO, T.A.S. Manual de anotação como recurso de Processamento de Linguagem Natural: o modelo Universal Dependencies em língua portuguesa. Domínios de Lingu@gem, v. 16, n. 4, p. 1608-1643, 2022. link to the paper
FELTRIM, V.D.; TEUFEL, S.; NUNES, M.G.V.; ALUÍSIO, S.M. Argumentative Zoning Applied to Critiquing Novices’ Scientific Abstracts. Computing Attitude and Affect in Text: Theory and Applications, v. 20, p. 233-246, 2006. link to the paper
GEWERS, F.L.; FERREIRA, G.R.; ARRUDA, H.F.; SILVA, F.N.; COMIN, C.H.; AMANCIO, D.R.; COSTA, L.F. Principal Component Analysis. ACM Computing Surveys, v. 54, n. 4, p. 1-34, 2021. link to the paper
HARTMANN, N.; DURAN, M.; ALUÍSIO, S.M. Automatic Semantic Role Labeling on Non-revised Syntactic Trees of Journalistic Texts. Proceedings of the 12th International Conference on the Computational Processing of Portuguese, p. 202-212, 2016. link to the paper
INACIO, M.L.; SOBREVILLA CABEZUDO, M.A.; DI FELIPPO, A.; RAMISCH, R.; PARDO, T.A.S. The AMR-PT corpus and the semantic annotation of challenging sentences from journalistic and opinion texts. DELTA: Documentação de Estudos em Linguística Teórica e Aplicada, 2022. link to the paper
INACIO, M.L.; PARDO, T.A.S. Semantic-Based Opinion Summarization. Proceedings of Recent Advances in Natural Language Processing, p. 624-633, 2021. link to the paper
LEITE, D.S.; RINO, L.H.M.; PARDO, T.A.S.; NUNES, M.G.V. Extractive Automatic Summarization: Does more linguistic knowledge make a difference? Proceedings of the Second Workshop on TextGraphs: Graph-Based Algorithms for Natural Language Processing, p. 17-24, 2007. link to the paper
LOPES, L.; DURAN, M.S.; FERNANDES, P.H.L.; PARDO, T.A.S. PortiLexicon-UD: a Portuguese Lexical Resource according to Universal Dependencies Model. Proceedings of the 13th Edition of the Language Resources and Evaluation Conference, p. 6635‑6643, 2022. link to the paper
LÓPEZ CONDORI, R.E.; PARDO, T.A.S. Opinion Summarization Methods: Comparing and Extending Extractive and Abstractive Approaches. Expert Systems with Applications, v. 78, p. 124-134, 2017. link to the paper
NUNES, M.G.V.; CASELI, H.M.; FORCADA, M. Automatic induction of bilingual resources from aligned parallel corpora: application to shallow-transfer machine translation. Machine Translation, v. 20, p. 227-245, 2008. link to the paper
MACHADO, M.T.; PARDO, T.A.S.; RUIZ, E.E.S.; DI FELIPPO, A.; VARGAS, F. Implicit opinion aspect clues in Portuguese texts: analysis and categorization. Proceedings of the International Conference on Computational Processing of Portuguese, p. 68-78, 2022. link to the paper
MARTINS, R.T.; HASEGAWA, R.; NUNES, M.G.V.; MONTILHA, G.; OLIVEIRA JR, O.N. Linguistic Issues in the Development of ReGra: a Grammar Checker for Brazilian Portuguese. Natural Language Engineering, v. 4, n. 4, p. 287-307, 1998. link to the paper
MAZIERO, E.G.; CASTRO JORGE, M.L.R.; PARDO, T.A.S. Revisiting Cross-document Structure Theory for multi-document discourse parsing. Information Processing and Management, v. 50, n. 2, p. 297-314, 2014. link to the paper
MONTEIRO, R.A.; SANTOS, R.L.S; PARDO, T.A.S; ALMEIDA, T.A.; RUIZ, E.E.S; VALE, O.A. Contributions to the Study of Fake News in Portuguese: New Corpus and Automatic Detection Results. Proceedings of the International Conference on Computational Processing of the Portuguese Language, p. 324-334, 2018. link to the paper
MENDES, A.R.; PASSADOR, R.V.P.; CASELI, H.M. Identificando sintomas de depressão em postagens do Twitter em português do Brasil. Proceedings of the XIII Symposium in Information and Human Language Technology, p. 162-171, 2021. link to the paper
PARDO, T.A.S.; DURAN, M.S.; DI FELIPPO, A.; ROMAN, N.; NUNES, M.G.V. Porttinari - a large multi-genre treebank for Brazilian Portuguese. Proceedings of the XIII Symposium in Information and Human Language Technology, p. 1-10, 2021. link to the paper
PICOLI, L.; LAPORTE, E.; VALE, O. Aspecto verbal nas construções com Verbo-Suporte. Revista do GEL, v. 18, n. 1, p. 204-229, 2021. link to the paper
RASSI, A.P.; BAPTISTA, J.; VALE, O.A.; MAMEDE, N. Integração de predicados nominais em parser: uma experiência com as construções com o verbo-suporte dar em português brasileiro. Alfa – Revista de Linguística, v. 62, n. 3, p.543-571, 2018. link to the paper
RASSI, A.P.; SANTOS-TURATI, C.; BAPTISTA, J.; MAMEDE, N.; VALE, O. The fuzzy boundaries of operator verb and support verb constructions with dar “give” and ter “have” in Brazilian Portuguese. Proceedings of Workshop on Lexical and Grammatical Resources for Language Processing, p. 92-101, 2014. link to the paper
RODRIGUEZ, M.Z.; COMIN, C.H.; CASANOVA, D.; BRUNO, O.M.; AMANCIO, D.R.; COSTA, L.F.; RODRIGUES, F.A. Clustering algorithms: A comparative approach. PLoS One, v. 14, p. 1-34, 2019. link to the paper
SANTOS, L.B.; CORRÊA JR., E.A. ; OLIVEIRA JR., O.N. ; AMANCIO, D.R. ; MANSUR, L.L.; ALUÍSIO, S.M. Enriching Complex Networks with Word Embeddings for Detecting Mild Cognitive Impairment from Speech Transcripts. Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), p. 1284-1296, 2017. link to the paper
SATO, J.; CASELI, H.M.; SPECIA, L. Multilingual and Multimodal Learning for Brazilian Portuguese. Proceedings of the 13th Conference on Language Resources and Evaluation, p. 919-927, 2022. link to the paper
SENO, E.R.M.; CASELI, H.M.; INÁCIO, M.L.; ANCHIÊTA, R.T.; RAMISCH, R. XPTA: um parser AMR para o Português baseado em uma abordagem entre línguas. LinguaMÁTICA, v. 14, p. 49-68, 2022. link to the paper
SILVA, C.F.; RASSI, A.P.; SOUZA, J.W.C.; RAMISCH, R.; ANTUNES, R.A.M.R.; CASELI, H.M. Quality of argumentation in political tweets: what is and how to measure it. Revista de Estudos da Linguagem, v. 29, p. 2537-2586, 2021. link to the paper
SILVA, E.H.; PARDO, T.A.S.; ROMAN, N.T.; DI FELIPPO, A. Universal Dependencies for Tweets in Brazilian Portuguese: Tokenization and Part of Speech Tagging. Proceedings of the 18th National Meeting on Artificial and Computational Intelligence, p. 434-445, 2021. link to the paper
SILVA, F.N.; AMANCIO, D.R.; BARDOSOVA, M.; COSTA, L.F.; OLIVEIRA JR., O.N. Using network science and text analytics to produce surveys in a scientific topic. Journal of Informetrics, v. 10, n. 2, p. 487-502, 2016. link to the paper
SILVA, J.R.; CASELI, H.M. Sense representations for Portuguese: experiments with sense embeddings and deep neural language models. Language Resources and Evaluation, v. 25, p. 901-924, 2021. link to the paper
SILVA, R.M.; SANTOS, R.L.S.; ALMEIDA, T.A.; PARDO, T.A.S. Towards automatically filtering fake news in Portuguese. Expert Systems with Applications, v. 146, p. 1-14, 2020. link to the paper
SOBREVILLA CABEZUDO, M.A.; PARDO, T.A.S. Low-resource AMR-to-Text Generation: A Study on Brazilian Portuguese. Procesamiento del Lenguaje Natural, v. 68, p. 85-97, 2022. link to the paper
SOUSA, R.F.; PARDO, T.A.S. The Challenges of Modeling and Predicting Online Review Helpfulness. Proceedings of the 18th National Meeting on Artificial and Computational Intelligence, p. 727-738, 2021. link to the paper
SOUZA, J.W.C.; DI FELIPPO, A. Characterization of temporal complementarity: fundamentals for Multi-Document Summarization. ALFA: Revista de Linguística, v. 62, p. 125-150, 2018. link to the paper
SOUZA, J.W.C.; DI FELIPPO, A. Evaluating a typology of signals for automatic detection of complementarity. Domínios de Lingu@gem, v. 16, p. 1517-1543, 2022. link to the paper
SPECIA, L.; SRINIVASAN, A.; JOSHI, S.; RAMAKRISHNAN, G.; NUNES, M.G.V. An investigation into feature construction to assist word sense disambiguation. Machine Learning, v. 76, p. 109-136, 2009. link to the paper
UZÊDA, V.R.; PARDO, T.A.S.; NUNES, M.G.V. A comprehensive comparative evaluation of RST-based summarization methods. ACM Transactions on Speech and Language Processing, v. 6, n. 4, p. 1-20, 2010. link to the paper
VARGAS, F.; CARVALHO, I.; GÓES, F.; PARDO, T.A.S.; BENEVENUTO, F. HateBR: A Large Expert Annotated Corpus of Brazilian Instagram Comments for Offensive Language and Hate Speech Detection. Proceedings of the 13th Edition of the Language Resources and Evaluation Conference, p. 7174-7183, 2022. link to the paper