Scientific publications
2023
E. Gugliotta, Marco Dinarelli
An Empirical Analysis of Task Relations in the Multi-Task Annotation of an Arabizi Corpus
Language, Data and Knowledge Conference (LDK), Vienna, Austria, 2023.
Accepted for publication
An Empirical Analysis of Task Relations in the Multi-Task Annotation of an Arabizi Corpus
Language, Data and Knowledge Conference (LDK), Vienna, Austria, 2023.
Accepted for publication
L. Lupo, Marco Dinarelli, L. Besacier
Encoding Sentence Position in Context-Aware Neural Machine Translation with Concatenation
Workshop on Insights from Negative Results in NLP, Dubrovnik, Croatia, 2023.
Accepted for publication
Encoding Sentence Position in Context-Aware Neural Machine Translation with Concatenation
Workshop on Insights from Negative Results in NLP, Dubrovnik, Croatia, 2023.
Accepted for publication
2022
L. Lupo, Marco Dinarelli, L. Besacier
Focused Concatenation for Context-Aware Neural Machine Translation
Seventh Conference on Machine Translation, Abu Dhabi, 2022.
Focused Concatenation for Context-Aware Neural Machine Translation
Seventh Conference on Machine Translation, Abu Dhabi, 2022.


Marco Dinarelli, M. Naguib, F. Portet
Toward Low-Cost End-to-End Spoken Language Understanding
Interspeech, Incheon, Korea, 2022.
Toward Low-Cost End-to-End Spoken Language Understanding
Interspeech, Incheon, Korea, 2022.


M. Naguib, F. Portet, Marco Dinarelli
Vers la compréhension automatique de la parole bout-en-bout à moindre effort
Traitement Automatique des Langues Naturelles, Avignon, France, 2022.
Vers la compréhension automatique de la parole bout-en-bout à moindre effort
Traitement Automatique des Langues Naturelles, Avignon, France, 2022.


E. Gugliotta, Marco Dinarelli
TArC: Tunisian Arabish Corpus, First complete release
Language Resources and Evaluation Conference (LREC), Marseille, France, 2022.
TArC: Tunisian Arabish Corpus, First complete release
Language Resources and Evaluation Conference (LREC), Marseille, France, 2022.


S. Evain, H. Nguyen, H. Le, M. Zanon Boito, S. Mdhaffar, S. Alisamir, Z. Tong, N. Tomashenko, Marco Dinarelli, T. Parcollet, A. Allauzen, Y. Esteve, B. Lecouteux, F. Portet, S. Rossato, F. Ringeval, D. Schwab, L. Besacier
LeBenchmark, un référentiel d'évaluation pour le français oral
Journée d'Étude sur la Parole, Île de Noirmoutier, France, 2022.
LeBenchmark, un référentiel d'évaluation pour le français oral
Journée d'Étude sur la Parole, Île de Noirmoutier, France, 2022.


S. Evain, H. Nguyen, H. Le, M. Zanon Boito, S. Mdhaffar, S. Alisamir, Z. Tong, N. Tomashenko, Marco Dinarelli, T. Parcollet, A. Allauzen, Y. Esteve, B. Lecouteux, F. Portet, S. Rossato, F. Ringeval, D. Schwab, L. Besacier
Modèles neuronaux pré-appris par auto-supervision sur des enregistrements de parole en français
Journée d'Étude sur la Parole, Île de Noirmoutier, France, 2022.
Modèles neuronaux pré-appris par auto-supervision sur des enregistrements de parole en français
Journée d'Étude sur la Parole, Île de Noirmoutier, France, 2022.


L. Lupo, Marco Dinarelli, L. Besacier
Divide and Rule: Effective Pre-Training for Context-Aware Multi-Encoder Translation Models
Association for Computational Linguistics, 2022.
Pre-print of an earlier version available on arXiv
Divide and Rule: Effective Pre-Training for Context-Aware Multi-Encoder Translation Models
Association for Computational Linguistics, 2022.
Pre-print of an earlier version available on arXiv


2021
S. Evain, H. Nguyen, H. Le, M. Zanon Boito, S. Mdhaffar, S. Alisamir, Z. Tong, N. Tomashenko, Marco Dinarelli, T. Parcollet, A. Allauzen, Y. Esteve, B. Lecouteux, F. Portet, S. Rossato, F. Ringeval, D. Schwab, L. Besacier
Task Agnostic and Task Specific Self-Supervised Learning from Speech with LeBenchmark
In proceedings of NeurIPS, Datasets and Benchmarks Track, 2021.
Task Agnostic and Task Specific Self-Supervised Learning from Speech with LeBenchmark
In proceedings of NeurIPS, Datasets and Benchmarks Track, 2021.


S. Evain, H. Nguyen, H. Le, M. Zanon Boito, S. Mdhaffar, S. Alisamir, Z. Tong, N. Tomashenko, Marco Dinarelli, T. Parcollet, A. Allauzen, Y. Esteve, B. Lecouteux, F. Portet, S. Rossato, F. Ringeval, D. Schwab, L. Besacier
LeBenchmark: A Reproducible Framework for Assessing Self-Supervised Representation Learning from Speech
In proceedings of Interspeech, Brno, Czech, 2021.
LeBenchmark: A Reproducible Framework for Assessing Self-Supervised Representation Learning from Speech
In proceedings of Interspeech, Brno, Czech, 2021.


2020
E. Gugliotta, Marco Dinarelli, O. Kraif
Multi-Task Sequence Prediction For Tunisian Arabizi Multi-Level Annotation
The Fifth Arabic Natural Language Processing Workshop (WANLP), Barcelona, Spain, 2020.
Multi-Task Sequence Prediction For Tunisian Arabizi Multi-Level Annotation
The Fifth Arabic Natural Language Processing Workshop (WANLP), Barcelona, Spain, 2020.


E. Gugliotta, Marco Dinarelli
TArC Un corpus d'Arabish tunisien
Traitement Automatique des Langues Naturelles, Nancy, France, 2020.
TArC Un corpus d'Arabish tunisien
Traitement Automatique des Langues Naturelles, Nancy, France, 2020.


2019
Marco Dinarelli, L. Grobol
Hybrid Neural Models For Sequence Modelling: The Best Of Three Worlds
arXiv Technical Report, 2019.
English translation of the 2019 TALN paper
Hybrid Neural Models For Sequence Modelling: The Best Of Three Worlds
arXiv Technical Report, 2019.
English translation of the 2019 TALN paper


2018
2017
Marco Dinarelli, Y. Dupont, I. Tellier
Effective Spoken Language Labeling with Deep Recurrent Neural Networks
arXiv Technical Report, 2017.
Effective Spoken Language Labeling with Deep Recurrent Neural Networks
arXiv Technical Report, 2017.


Marco Dinarelli, V. Vukotic, C. Raymond
Label-dependency coding in Simple Recurrent Networks for Spoken Language Understanding
In proceedings of Interspeech, Stockholm, Sweden, 2017.
Label-dependency coding in Simple Recurrent Networks for Spoken Language Understanding
In proceedings of Interspeech, Stockholm, Sweden, 2017.


T. Tian, Marco Dinarelli, P. Cardoso, I. Tellier
Détection des mots non-standards dans les tweets avec des réseaux de neurones
Article court à Traitement Automatique des Langues Naturelles (TALN), Orléans, France, 2017.
Détection des mots non-standards dans les tweets avec des réseaux de neurones
Article court à Traitement Automatique des Langues Naturelles (TALN), Orléans, France, 2017.


L. Grobol, I. Tellier, E. de La Clergerie, Marco Dinarelli, F. Landragin
Apports des analyses syntaxiques pour la détection automatique de mentions dans un corpus de français oral
Article court à Traitement Automatique des Langues Naturelles (TALN), Orléans, France, 2017.
Apports des analyses syntaxiques pour la détection automatique de mentions dans un corpus de français oral
Article court à Traitement Automatique des Langues Naturelles (TALN), Orléans, France, 2017.


Y. Dupont, Marco Dinarelli, I. Tellier
Réseaux neuronaux profonds pour l’étiquetage de séquences
Article court à Traitement Automatique des Langues Naturelles (TALN), Orléans, France, 2017.
Réseaux neuronaux profonds pour l’étiquetage de séquences
Article court à Traitement Automatique des Langues Naturelles (TALN), Orléans, France, 2017.


Marco Dinarelli, Y. Dupont
Modélisation de dépendances entre étiquettes dans les réseaux neuronaux récurrents
Revue TAL (Traitement Automatique des Langues) Volume 58 Numéro 1, France, 2017.
Modélisation de dépendances entre étiquettes dans les réseaux neuronaux récurrents
Revue TAL (Traitement Automatique des Langues) Volume 58 Numéro 1, France, 2017.


Y. Dupont, Marco Dinarelli, I. Tellier, C. Lautier
Structured Named Entity Recognition by Cascading CRFs
International Conference on Intelligent Text Processing and Computational Linguistics (CICling), Budapest, Hungary, 2017.
Published in Lecture Notes in Computer Sciences (LNCS), Springer
Structured Named Entity Recognition by Cascading CRFs
International Conference on Intelligent Text Processing and Computational Linguistics (CICling), Budapest, Hungary, 2017.
Published in Lecture Notes in Computer Sciences (LNCS), Springer


Y. Dupont, Marco Dinarelli, I. Tellier
Label-Dependencies Aware Recurrent Neural Networks
International Conference on Intelligent Text Processing and Computational Linguistics (CICling), Budapest, Hungary, 2017.
Published in Lecture Notes in Computer Sciences (LNCS), Springer
Best Verifiability, Reproducibility, and Working Description award
Label-Dependencies Aware Recurrent Neural Networks
International Conference on Intelligent Text Processing and Computational Linguistics (CICling), Budapest, Hungary, 2017.
Published in Lecture Notes in Computer Sciences (LNCS), Springer
Best Verifiability, Reproducibility, and Working Description award


2016
Marco Dinarelli, I. Tellier
Improving Recurrent Neural Networks for Sequence Labelling
arXiv Technical Report, 2016.
Improving Recurrent Neural Networks for Sequence Labelling
arXiv Technical Report, 2016.


Marco Dinarelli, I. Tellier
Étude des réseaux de neurones récurrents pour étiquetage de séquences
23ème Conférence sur le Traitement Automatique des Langues Naturelles (TALN), Paris, France, 2016.
Étude des réseaux de neurones récurrents pour étiquetage de séquences
23ème Conférence sur le Traitement Automatique des Langues Naturelles (TALN), Paris, France, 2016.


A. Désoyer, F. Landragin, I. Tellier, A. Lefevre, J.-Y. Antoine, Marco Dinarelli
Coreference Resolution for French Oral Data: Machine Learning Experiments with ANCOR
International Conference on Intelligent Text Processing and Computational Linguistics (CICling), Konya, Turkey, 2016.
Published in Lecture Notes in Computer Sciences (LNCS), Springer
Coreference Resolution for French Oral Data: Machine Learning Experiments with ANCOR
International Conference on Intelligent Text Processing and Computational Linguistics (CICling), Konya, Turkey, 2016.
Published in Lecture Notes in Computer Sciences (LNCS), Springer


T. Tian, I. Tellier, Marco Dinarelli, P. Dias Cardoso
Understanding Social Media Texts with Minimum Human Effort on #Twitter
Language and the new (instant) media (PLIN), Louvain-la-Neuve, Belgium, 2016.
Accepted for publication
Understanding Social Media Texts with Minimum Human Effort on #Twitter
Language and the new (instant) media (PLIN), Louvain-la-Neuve, Belgium, 2016.
Accepted for publication
Marco Dinarelli, I. Tellier
New Recurrent Neural Network Variants for Sequence Labeling
International Conference on Intelligent Text Processing and Computational Linguistics (CICling), Konya, Turkey, 2016.
Published in Lecture Notes in Computer Sciences (LNCS), Springer
New Recurrent Neural Network Variants for Sequence Labeling
International Conference on Intelligent Text Processing and Computational Linguistics (CICling), Konya, Turkey, 2016.
Published in Lecture Notes in Computer Sciences (LNCS), Springer


T. Tian, Marco Dinarelli, I. Tellier, P. Dias Cardoso
Domain Adaptation for Named Entity Recognition Using CRFs
Language Resources Evaluation Conferences (LREC), Portoroz, Slovenia, 2016.
Domain Adaptation for Named Entity Recognition Using CRFs
Language Resources Evaluation Conferences (LREC), Portoroz, Slovenia, 2016.


Y. Dupont, I. Tellier, C. Lautier, Marco Dinarelli
Extraction automatique d'affixes pour la reconnaissance d'entités nommées chimiques
Extraction et Gestion des Connaissances, Reims, France, 2016.
Accepted for publication
Extraction automatique d'affixes pour la reconnaissance d'entités nommées chimiques
Extraction et Gestion des Connaissances, Reims, France, 2016.
Accepted for publication
2015
2012
A. Garcia-Fernandez, A.L. Ligozat, Marco Dinarelli, D. Bernhard
Méthodes pour l'archéologie linguistique: datation par combinaison d'indices temporels
Expérimentations et évaluations en fouille de textes, un panorama des campagnes DEFT, sous la direction de Cyril Grouin et Dominic Forest, Hermes Lavoisier, 2012
Méthodes pour l'archéologie linguistique: datation par combinaison d'indices temporels
Expérimentations et évaluations en fouille de textes, un panorama des campagnes DEFT, sous la direction de Cyril Grouin et Dominic Forest, Hermes Lavoisier, 2012
Marco Dinarelli, S. Rosset
Tree-Structured Named Entity Recognition on OCR Data: Analysis, Processing and Results
In Proceedings of the Language Resources and Evaluation Conference (LREC), Istanbul, Turkey, 2012.
Tree-Structured Named Entity Recognition on OCR Data: Analysis, Processing and Results
In Proceedings of the Language Resources and Evaluation Conference (LREC), Istanbul, Turkey, 2012.


Marco Dinarelli, S. Rosset
Tree Representations in Probabilistic Models for Extended Named Entity Detection
In Proceedings of the European chapter of the Association for Computational Linguistics (EACL), Avignon, France, 2012.
Tree Representations in Probabilistic Models for Extended Named Entity Detection
In Proceedings of the European chapter of the Association for Computational Linguistics (EACL), Avignon, France, 2012.


2011
C. Grouin, Marco Dinarelli, S. Rosset, G. Wisniewski, P. Zweigenbaum
Coreference Resolution in Clinical Reports. The LIMSI Participation in the i2b2/VA 2011 Challenge
In Proceedings of i2b2/VA 2011 Coreference Resolution Workshop, 2011.
Coreference Resolution in Clinical Reports. The LIMSI Participation in the i2b2/VA 2011 Challenge
In Proceedings of i2b2/VA 2011 Coreference Resolution Workshop, 2011.


Marco Dinarelli, S. Rosset
Models Cascade for Tree-Structured Named Entity Detection
In Proceedings of International Joint Conference on Natural Language Processing (IJCNLP), Chiang Mai, Thailand, 2011.
Models Cascade for Tree-Structured Named Entity Detection
In Proceedings of International Joint Conference on Natural Language Processing (IJCNLP), Chiang Mai, Thailand, 2011.


A. Garcia-Fernandez, A.L. Ligozat, Marco Dinarelli, D. Bernhard
When was it written ? Automatically Determining Publication Dates
In proceedings of String Processing and Information Retrieval (SPIRE), Pisa, Italy, 2011.
Draft
When was it written ? Automatically Determining Publication Dates
In proceedings of String Processing and Information Retrieval (SPIRE), Pisa, Italy, 2011.


2010
S. Hahn, Marco Dinarelli, C. Raymond, F. Lefèvre, P. Lehnen, R. De Mori, A. Moschitti, H. Ney, G. Riccardi
Comparing Stochastic Approaches to Spoken Language Understanding in Multiple Languages
IEEE Journal of Transactions on Audio, Speech and Language Processing (TASLP), volume 19, issue 6, pages 1569 - 1583, 2010.
Comparing Stochastic Approaches to Spoken Language Understanding in Multiple Languages
IEEE Journal of Transactions on Audio, Speech and Language Processing (TASLP), volume 19, issue 6, pages 1569 - 1583, 2010.


Marco Dinarelli, A. Moschitti, G. Riccardi
Hypotheses Selection For Re-ranking Semantic Annotations
IEEE Workshop on Spoken Language Technology (SLT), Berkeley, U.S.A., 2010.
Hypotheses Selection For Re-ranking Semantic Annotations
IEEE Workshop on Spoken Language Technology (SLT), Berkeley, U.S.A., 2010.


2009
S. Quarteroni, Marco Dinarelli, G. Riccardi
Ontology-Based Grounding Of Spoken Language Understanding
IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), Merano, Italy, 2009.
Ontology-Based Grounding Of Spoken Language Understanding
IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), Merano, Italy, 2009.


Marco Dinarelli, A. Moschitti, G. Riccardi
Reranking Models Based On Small Training Data For Spoken Language Understanding
In Proceedings of Empirical Methods for Natural Language Processing (EMNLP), Singapore, 2009.
Reranking Models Based On Small Training Data For Spoken Language Understanding
In Proceedings of Empirical Methods for Natural Language Processing (EMNLP), Singapore, 2009.


S. Quarteroni, G. Riccardi, Marco Dinarelli
What's In An Ontology For Spoken Language Understanding
In Proceedings of Interspeech, Brighton, U.K., 2009.
What's In An Ontology For Spoken Language Understanding
In Proceedings of Interspeech, Brighton, U.K., 2009.


Marco Dinarelli, A. Moschitti, G. Riccardi
Concept Segmentation And Labeling For Conversational Speech
In Proceedings of Interspeech, Brighton, U.K., 2009.
Concept Segmentation And Labeling For Conversational Speech
In Proceedings of Interspeech, Brighton, U.K., 2009.

