Publications
2024
-
Samia Touileb, Jeanett Murstad, Petter Mæhlum, Lubos Steskal, Lilja Charlotte Storset, Huiling You, and Lilja Øvrelid. EDEN: A Dataset for Event Detection in Norwegian News. Proceedings of The 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation. Torino, Italy, May 20-25, 2024.
-
A chapter in the book Current Issues in English Teaching, published by Fagbokforlaget. The chapter is tilted ``Large Language Models and their Usage in Education’’.
2023
-
Helene Bøsei Olsen, Samia Touileb, Erik Velldal. Arabic dialect identification: An in-depth error analysis on the MADAR parallel corpus. Proceedings of The First Arabic Natural Language Processing Conference (ArabicNLP 2023). Singapore, Desember 7, 2023.
-
Sophie Blum, Raoul Koudijs, Ana Ozaki, Samia Touileb. Learning Horn Envelopes via Queries to Neural Networks: The BERT Case. International Journal of Approximate Reasoning.
-
Huiling You, Lilja Øvrelid, Samia Touileb. JSEEGraph: Joint Structured Event Extraction as Graph Parsing. Proceedings of The 12th Joint Conference on Lexical and Computational Semantics. Toronto, Canada, July 13-14, 2023.
-
Ghazaal Sheikhi, Andreas Opdhal, Samia Touileb, Vinay Setty. Making Sense of Nonsense: Integrated Gradient-based Input Reduction for Check-worthy Claim Detection. Proceedings of The 2023 symposium of the Norwegian AI Society. Bergen, Norway, June 14-15, 2023.
-
Rustam Galimullin, Samia Touileb. Proceedings of the 5th Symposium of the Norwegian AI Society. Proceedings of the 5th Symposium of the Norwegian AI Society; Volum 3431.100. CEUR Workshop Proceedings.
-
Jeremy Barnes, Samia Touileb, Petter Mæhlum, Pierre Lison. Identifying Token-Level Dialectal Features in Social Media. Proceedings of the The 24rd Nordic Conference on Computational Linguistics (NoDaLiDa2023). Tórshavn, Faroe Islands, May 22-24, 2023.
-
David Samuel, Andrey Kutuzov, Samia Touileb, Erik Velldal, Lilja Øvrelid, Egil Rønningstad, Elina Sigdel, Anna Sergeevna Palatkina. NorBench – A Benchmark for Norwegian Language Models. Proceedings of the The 24rd Nordic Conference on Computational Linguistics (NoDaLiDa2023). Tórshavn, Faroe Islands, May 22-24, 2023.
-
Ghazaal Sheikhi, Samia Touileb, Sohail Ahmed Khan. Automated Claim Detection for Fact-checking: A Case Study using Norwegian Pre-trained Language Models. Proceedings of the The 24rd Nordic Conference on Computational Linguistics (NoDaLiDa2023). Tórshavn, Faroe Islands, May 22-24, 2023.
-
Samia Touileb, Lilja Øvrelid, Erik Velldal. Measuring Normative and Descriptive Biases in Language Models Using Census Data. In Proceedings of the The 17th Conference of the European Chapter of the Association for Computational Linguistics (EACL2023). EACL 2023.
2022
-
Samia Touileb, Deborra Nozza. Measuring Harmful Representations in Scandinavian Language Models. In Proceedings of the 5th Workshop on NLP and CSS workshop. Workshop at The Conference on Empirical Methods in Natural Language Processing (EMNLP2022). NLP+CSS, at EMNLP 2022.
-
Huiling You, David Samuel, Samia Touileb, Lilja Øvrelid. Event identification and extraction as a graph parsing problem. In Proceedings of the 5th Workshop on Challenges and Applications of Automated Extraction of Socio-political Events from Text (CASE202). Workshop at The Conference on Empirical Methods in Natural Language Processing (EMNLP2022).
-
Huiling You, David Samuel, Samia Touileb, Lilja Øvrelid. A General Graph-based Approach to Protest Event Extraction. In Proceedings of the 5th Workshop on Challenges and Applications of Automated Extraction of Socio-political Events from Text (CASE202). Workshop at The Conference on Empirical Methods in Natural Language Processing (EMNLP2022).
-
Samia Touileb. Exploring the Effects of Negation and Grammatical Tense on Bias Probes. In Proceedings of the 2nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 12th International Joint Conference on Natural Language Processing (AACL-IJCNLP2022).
-
Samia Touileb. NERDz: A Corpus of Named Entities for Algerian. In Proceedings of the 2nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 12th International Joint Conference on Natural Language Processing (AACL-IJCNLP2022).
-
Petter Mæhlum, Andre Kåsen, Samia Touileb, Jeremy Barnes. Annotating Norwegian language varieties on Twitter for Part-of-speech. In Proceedings of the Ninth Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial2022). Workshop at the 29th International Conference on Computational Linguistics (COLING2022).
-
Samia Touileb, Lilja Øvrelid, Erik Velldal. Occupational Biases in Norwegian and Multilingual Language Models. In Proceedings of the 3rd Workshop on Gender Bias in Natural Language Processing. Workshop at the Northern Chapter of the Association for Computational Linguistics (NAAACL2022).
-
Andrey Kutuzov, Samia Touileb, Petter Mæhlum, Tita Ranveig Enstad, Alexandra Wittemann. NorDiaChange: Diachronic Semantic Change Dataset for Norwegian. Proceedings of the Thirteenth International Conference on Language Resources and Evaluation (LREC 2022). Marseille, France; June 20-25, 2018.
2021
-
Samia Touileb, Lilja Øvrelid, Erik Velldal. Using Gender- and Polarity-Informed Models to Investigate Bias. In Proceedings of the 2nd Workshop on Gender Bias in Natural Language Processing. Workshop at the Association for Computational Linguistics (ACL2021). Bangkok, Thailand, August 1-6, 2021.
-
Samia Touileb, Jeremy Barnes. The interplay between language similarity and script on a novel multi-layer Algerian dialect corpus. Proceedings of the 59th annual meeting of the Association for Computational Linguistics: Findings of ACL2021. Bangkok, Thailand, August 1-6, 2021.
-
Jeremy Barnes, Petter Mæhlum, Samia Touileb. NorDial: A Preliminary Corpus of Written Norwegian Dialect Use. Proceedings of the 23rd Nordic Conference on Computational Linguistics, NoDaLiDa2019. Reykjavik, Iceland; May 31-June 2, 2021. All three authors have contributed equally, and are therefore ordered alphabetically.
-
Nizar Habash, Houda Bouamor, Hazem Hajj, Walid Magdy, Wajdi Zaghouani,Fethi Bougares, Nadi Tomeh, Ibrahim Abu Farha, Samia Touileb. Proceedings of the Sixth Arabic Natural Language Processing Workshop. In Proceedings of the Sixth Arabic Natural Language Processing Workshop, EACL 2021. Online, April 19th, 2021
2020
-
Samia Touileb, Lilja Øvrelid, Erik Velldal. 2020. Gender and sentiment, critics and authors: a dataset of Norwegian book reviews. In Proceedings of the 2nd Workshop on Gender Bias in Natural Language Processing. Workshop at The 28th International Conference on Computational Linguistics (COLING). Barcelona, Spain; December 8-13. 2020.
-
Samia Touileb. 2020. LTG-ST at NADI Shared Task 1: Arabic Dialect Identification using a Stacking Classifier. In Proceedings of the Fifth Arabic Natural Language Processing Workshop (WANLP 2020). Workshop at The 28th International Conference on Computational Linguistics (COLING). Barcelona, Spain; December 8-13. 2020.
-
Pierre Lison, Aliaksandr Hubin, Jeremy Barnes, and Samia Touileb. 2020. Named Entity Recognition without Labeled Data: A Weak Supervision Approach. In Proceedings of The 58th annual meeting of the Association for Computational Linguistics (ACL2020). Seattle, USA; July 5-10, 2020.
-
Wafia Adouane, Samia Touileb, and Jean-Philippe Bernardy. 2020. Identifying Sentiments in Algerian Code-switched User-generated Comments. Proceedings of The 12th Language Resources and Evaluation Conference LREC2020. Marseilles, France.
2019
-
Jeremy Barnes, Samia Touileb, Lilja Øvrelid, and Erik Velldal. 2019. Lexicon information in neural sentiment analysis: a multi-task learning approach. Proceedings of the 22nd Nordic Conference on Computational Linguistics, NoDaLiDa2019. Turku, Finland; October 1-2, 2019.
-
Julia Rodina, Daria Bakshandaeva, Vadim Fomin, Andrei Kutuzov, Samia Touileb, and Erik Velldal. 2019. Measuring Diachronic Evolution of Evaluative Adjectives with Word Embeddings: the Case for English, Norwegian, and Russian. Proceedings of the 1st International Workshop on Computational Approaches to Historical Language Change. Association for Computational Linguistics. Workshop at ACL2019. Florence, Italy; July 28 - August 2, 2019.
2018
-
Samia Touileb, Truls Pedersen, and Helle Sjøvaag. 2018. Automatic identification of unknown names with specific roles. Proceedings of the Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature, COLING 2018. Santa Fe, USA; August 20-25, 2018.
-
Erik Velldal, Lilja Øvrelid, Eivind Aleksander Bergem, Cathrine Stadsnes, Samia Touileb, and Fredrik Jørgensen. 2018. NoReC: The Norwegian review corpus. Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018). Miyazaki, Japan; May 7-12, 2018.
2017
- Samia Touileb. 2017. Automatically Inducing Information Structures – A Text Mining Approach Based on the Distributional Hypothesis (PhD thesis). University of Bergen 2017.
2016
-
Samia Touileb and Lubos Steskal. 2016. ADIOS LDA: When Grammar Induction Meets Topic Modeling. NIK: Norsk Informatikkonferanse 2016. Bergen, Norway; November 28-30, 2016.
- Samia Touileb and Katherine Duarte. 2016. Getting to know large newsflows: Automatically induced information structures as keyphrases for news content analysis. Proceedings of Workshop on Natural Language Processing meets Journalism, at the International Joint Conference on Artificial Intelligence (IJCAI-16). New York, USA; July 09-15, 2016.
2014
-
Samia Touileb and Andrew Salway. 2014. Construction: a new unit of analysis for corpus-based discourse analysis. Proceedings of the 28th Pacific Asia Conference on Language, Information and Computation (PACLIC 28). Phuket, Thailand; December 12-14, 2014.
-
Andrew Salway, Samia Touileb and Endre Tvinnereim. 2014. Inducing Information Structures for Data-driven Text Analysis. In Proceedings of the ACL 2014 Workshop on Language Technologies and Computational Social Science. Baltimore, USA; June 22-27, 2014.
-
Andrew Salway and Samia Touileb. 2014. Applying Grammar Induction to Text Mining. In Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (ACL2014). Baltimore, USA; June 22-27, 2014.
2013
- Andrew Salway, Knut Hofland and Samia Touileb. 2013. Applying Corpus Techniques to Climate Change Blogs. Corpus Linguistics Conference CL2013. Lancaster University, UK; July 22-26, 2013.