[See papers and citations on Scholar]
Book Cyril Goutte, Nicola Cancedda, Marc Dymetman and George Foster (2009) Learning Machine Translation, MIT Press.
This is the followup book to the Machine Learning for Multilingual Information Access workshop (at NIPS'06), published by MIT Press in January 2009. [order from amazon .ca .uk .fr, or others]
No doubt due to its amazing success :) , this bestseller was reprinted in 2010 in a cheaper edition by PHI India.
Other publications, by date
These are the main publications I have been associated with. I am trying to make available as many of these as possible, either through the publisher's website, or as preprints, when available. If you are interested in a publication that is not in this list, or can't access the publisher's website and need a pre-print, please send me an email at NRC (Cyril dot Goutte at nrc dot ca).
2012- Marco Turchi, Cyril Goutte, and Nello Cristianini (2012) Learning Machine Translation for In-domain and Out-of-domain Data, 16th Annual Conference of the European Association for Machine Translation, to appear.
Marco Turchi, Tijl De Bie, Cyril Goutte, and Nello Cristianini (2012) Learning to Translate: a statistical and computational analysis, Advances in Artificial Intelligence, vol. 2012, Article ID 484580.
2011
2010Young-Min Kim, Massih-Reza Amini, Cyril Goutte and Patrick Gallinari (2010) Multi-view clustering of multilingual documents, ACM Conference on Research and Development in Information Retrieval (SIGIR 2010), ACM, pp. 821-822. Anastasia Krithara, Massih-Reza Amini, Cyril Goutte and Jean-Michel Renders (2010) An extension of the Aspect PLSA Model to Active and Semi-supervised Learning for Text Classification, 6th Hellenic Conference on Artificial Intelligence (SETN). Roland Kuhn, Pierre Isabelle, Cyril Goutte, Jean Senellart, Michel Simard and Nicola Ueffing (2010) Automatic Post-Editing, MultilLingual 21(2):43-46, March 2010. Massih-Reza Amini and Cyril Goutte (2010) A Co-classification Approach to Learning from Multilingual Corpora, Machine Learning, 79(1-2):105-121, May 2010.
2009Massih-Reza Amini, Nicolas Usunier and Cyril Goutte (2009) Learning from Multiple Partially Observed Views -- an Application to Multilingual Text Categorization, Advances in Neural Information Processing Systems (NIPS'09). Lasse L. Mølgaard, Jan Larsen and Cyril Goutte (2009) Temporal analysis of text data using latent variable models, IEEE International Workshop on Machine Learning for Signal Processing (MLSP). David Kurokawa, Cyril Goutte and Pierre Isabelle (2009) Automatic Detection of Translated Text and its Impact on Machine Translation, Proceedings of MT-Summit XII. Cyril Goutte, David Kurokawa and Pierre Isabelle (2009) Improving SMT by learning the translation direction, EAMT 2009 workshop "Statistical Multilingual Analysis for Retrieval and Translation", Barcelona. Nicola Cancedda, Marc Dymetman, George Foster and Cyril Goutte (2009) A Statistical Machine Translation Primer, Chapter 1 in Learning Machine Translation, MIT Press.
2008
- Anastasia Krithara, Massih-Reza Amini, Jean-Michel Renders and Cyril Goutte (2008) Semi-supervised Document Classification with a Mislabeling Error Model, in C. Macdonald, I. Ounis, V. Plachouras, I. Ruthven, and R.W. White (Eds.) Advances in Information Retrieval -- 30th European Conference on IR Research (ECIR'08), Lecture Notes in Computer Science 4956, Springer, pp. 370-381.
2007
- Michel Simard, Cyril Goutte and Pierre Isabelle (2007) Statistical Phrase-based Post-editing, Human Language Technologies: The Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL-HLT 2007).
2006- Anasthasia Krithara, Cyril Goutte, Massih-Reza Amini and Jean-Michel Renders (2006) Reducing the Annotation Burden in Text Classification, First International Conference on Multidisciplinary Information Sciences and Technologies (InSciT2006), Merida, Spain, 25-28 October.
- Anasthasia Krithara, Cyril Goutte, Massih-Reza Amini and Jean-Michel Renders (2006) Active, Semi-Supervised Learning for Textual Information Access, International Workshop on Intelligent Information Access (IIIA-2006), Helsinki, Finland, 6-8 July.
- Stéphane Clinchant, Cyril Goutte and Eric Gaussier (2006) Lexical Entailment for Information Retrieval, Advances in Information Retrieval - 28th European Conference on IR Research (ECIR'06), Lecture Notes in Computer Science 3936, Springer, pp. 217-228.
2005
M. Simard, N. Cancedda, B. Cavestro, M. Dymetman, E. Gaussier, C. Goutte, Philippe Langlais, Arne Mauser and K. Yamada (2005) Translating with non-contiguous phrases, Proceedings of HLT-EMNLP 2005. [Citations] P. Ahrendt, J. Larsen and C. Goutte (2005) Co-occurrence models in music genre classification, IEEE International Workshop on Machine Learning for Signal Processing, pp. 247-252. E. Gaussier and C. Goutte (2005) Relation between PLSA and NMF and Implications, ACM Conference on Research and Development in Information Retrieval, SIGIR 2005, pp. 601-602. [Citations] E. Gaussier and C. Goutte (2005) Learning from partially labelled data_with confidence, ICML 2005 Workshop on Learning with Partially Classified Training Data. M. Simard, N. Cancedda, B. Cavestro, M. Dymetman, E. Gaussier, C. Goutte, P. Langlais, A. Mauser, K. Yamada (2005) Traduction automatique statistique avec des segments discontinus, Traitement Automatique des Langues Naturelles (TALN 2005), Volume 1, pp. 233-242. C. Goutte and E. Gaussier (2005) A Probabilistic Interpretation of Precision, Recall and F-score, with Implication for Evaluation, in D.E. Losada and J.M. Fernandez-Luna (eds), Advances in Information Retrieval - 27th European Conference on IR Research (ECIR'05), Lecture Notes in Computer Science 3408, Springer, pp. 345-359. [Citations]
2004
J. Blatz, E. Fitzgerald, G. Foster, S. Gandrabur, C. Goutte, A. Kulesza, A. Sanchis, N. Ueffing (2004) Confidence Estimation for Machine Translation, Proceedings of the 20th International Conference on Computational Linguistics (COLING 2004), pp. 315-321. [Citations]
C. Goutte, K. Yamada, E. Gaussier (2004) Aligning words with matrix factorisation, Proceedings of the 42nd Annual Meeting of the Association for Computational Linguistics (ACL 2004), pp. 503-510. [Citations]
E. Gaussier, J.-M. Renders, I. Matveeva, C. Goutte, H.Déjean (2004) A Geometric view on bilingual lexicon extraction from comparable corpora, Proceedings of the 42nd Annual Meeting of the Association for Computational Linguistics (ACL 2004), pp. 527-534. [Citations]
C. Goutte, P.B. Dobrokhotov, E. Gaussier, A.-L. Veuthey (2004) Corpus-Based vs. Model-Based Selection of Relevant Features, Proc. COnférence en Recherche d'Information et Applications (CORIA'2004), pp. 75-88. C. Goutte, E. Gaussier, N. Cancedda, H.Déjean (2004) Generative vs Discriminative Approaches to Entity Recognition from Label-Deficient Data, in G. Purnelle, C. Fairon and A. Dister (eds), Le poids des mots - Actes des 7èmes Journées internationales d'Analyse statistique des Données Textuelles (JADT04), pp. 515-523. Presses Universitaires de Louvain. P.B. Dobrokhotov, C. Goutte, A.-L. Veuthey, E. Gaussier (2004) Assisting medical annotation in Swiss-Prot using statistical classifiers, International Journal of Medical Informatics, 74(2{4):317-324.
2003
H. Déjean, E. Gaussier, C. Goutte and K. Yamada (2003) Reducing parameter space for word alignment, NAACL/HLT Workshop: Building and Using Parallel Texts. [Citations] P.B. Dobrokhotov, C. Goutte, A.-L. Veuthey and E. Gaussier (2003) Combining NLP and Probabilistic Categorisation for Document and Term Selection for Swiss-Prot Medical Annotation, Proceedings of the 11th International Conference on Intelligent Systems for Molecular Biology (ISMB 2003), BioInformatic 19(Suppl 1):I91-I94. [Citations]
P.B. Dobrokhotov, C. Goutte, A.-L. Veuthey and E. Gaussier (2003) A Probabilistic Information Retrieval Approach to Medical Annotation in Swiss-Prot, Proceedings of Medical Informatics Europe (MIE2003), In R. Baud, M. Fieschi, P. Le Beux and P. Ruch (eds) The New Navigators: from Professionals to Patients, Studies in Health Technology and Informatics, 95:421-426. N. Cancedda, E. Gaussier, C. Goutte and J.M. Renders (2003) Word-Sequence Kernels, Journal of Machine Learning Research 3:1059-1082. [Citations]
C. Goutte, P. Dobrokhotov, E. Gaussier and A.-L. Veuthey (2003) Catégorisation de documents PubMed pour l'annotation médicale dans Swiss-Prot, Actes de l'atelier "Fouille de données et recherche d'information dans les bases de données multimedia semi-structurées", conférence EGC 2003.
2002
Nicola Cancedda, Cyril Goutte, Jean-Michel Renders, Nicolò Cesa-Bianchi, Alex Conconi, Y. Li, John Shawe-Taylor, Alexei Vinokourov, Thore Graepel, Claudio Gentile (2002) Kernel Methods for Document Filtering, NIST Special Publication 500-XXX: The Eleventh Text Retrieval Conference (TREC 2002). [Citations] Cyril Goutte, Hervé Déjean, Eric Gaussier, Nicola Cancedda and Jean-Michel Renders (2002) Combining labelled and unlabelled data: a case study on Fisher kernels and transductive inference for biological entity recognition, Proceedings of the Sixth Conference on Natural Language Learning (CoNLL-02). http://ilk.kub.nl/~signll/conll02/cfp.html Eric Gaussier, Cyril Goutte, Kris Popat and Francine Chen (2002) A hierarchical model for clustering and categorising documents, in F. Crestani, M. Girolami and C.J. van Rijsbergen (eds), Advances in Information Retrieval_Proceedings of the 24th BCS-IRSG European Colloquium on IR Research (ECIR'02), Lecture Notes in Computer Science 2291, Springer, pp. 229-247. [Citations] Eric Gaussier and Cyril Goutte (2002) Probabilistic Models for Hierarchical Clustering and Categorisation: Applications in the Information Society, Proceedings of the International Conference on Advances in Infrastructures for e-Business, e-Education, e-Science and e-Medicine on the Internet (SSGRR2002w).
2001
2000
1999
Jan Larsen and Cyril Goutte (1999) On optimal data split for generalization estimation and model selection, in S.-Y. Kung, J. Larsen, E. Wilson and S. Douglas (eds), Neural Networks for Signal Processing IX - Proceedings of the 1999 IEEE Workshop, pp. 225-234, IEEE (Piscataway, NJ). [Citations]
Cyril Goutte, Peter Toft, Egil Rostrup, Finn Aarup Nielsen and Lars Kai Hansen (1999) On clustering fMRI time series, NeuroImage, 9(3):298-310. [Citations]
1998
C. Goutte and J. Larsen (1998) Optimal Cross-Validation Split Ratio: Experimental Investigation, in L. Niklasson, M. Bodn and T. Ziemke (eds), Proceedings of the 8th International Conference on Artificial Neural Networks (Skvde), pp. 681-686, Perspectives in Neural Computing, Springer Verlag (Berlin). C. Goutte and J. Larsen (1998) Adaptive metric kernel regression, in T. Constantinides, S.-Y. Kung, M. Niranjan and E. Wilson (eds), Neural Networks for Signal Processing VIII - Proceedings of the 1998 IEEE Workshop (Cambridge), pp. 184-193, IEEE (Piscataway, NJ). C. Goutte and J. Larsen (1998) Adaptive Regularization of Neural Networks Using Conjugate Gradient, in Proceedings of the 1998 Intl. Conference on Acoustics, Speech and Signal Processing - ICASSP'98 (Seattle), vol. 2, pp. 1205-1208, IEEE (Piscataway, NJ). C. Goutte (1998) Behaviour in 0 of the neural networks training cost, Neural Processing Letters, 8(2):107-116.
1997
Cyril Goutte and Lars Kai Hansen (1997) Regularization with a pruning prior, Neural Networks, 10(6):1053-1059. [Citations] Cyril Goutte (1997) Note on free lunches and cross-validation, Neural Computation, 9(6):1211-1215. [Citations] Cyril Goutte (1997) Extracting the relevant delays in time series modelling, in J. Principe, L. Gile, N. Morgan and E. Wilson (eds.) Neural Networks for Signal Processing VII - Proceedings of the 1997 IEEE Workshop (Florida), pp. 92-101, IEEE (Piscataway, NJ). [Citations] Cyril Goutte (1997) Lag space estimation in time series modelling, in Proceedings of the 1997 Intl. Conference on Acoustics, Speech, and Signal Processing - ICASSP 97 (Munich), vol. 4, pp. 3313-3316, IEEE (Piscataway, NJ). [Citations]
1996
|