Never you wished to share a book you like with a friend? If only you could remember this great quote! This project aims at exporting quotes and highlights directly from your ebook reader to your personal website via a gitlab page.
What to do when you get thousands of txt files in Cyrillic and you need to transliterate them in latin alphabet? I wrote a program that make the conversion over all files of your folders and subfolders. Everything is explained for beginners opening their terminal for the first time.
I wrote a ebook analyser. It does preprocessing of *.epub such as, conversion to .txt, removal of editorial metadata, lemmatization, stopwordisation... and all of that in Swedish. This project is a professional partnership with a researcher in Swedish literature analysis.
RSS feeds used to be on any website before facebook and other closed plateform decide to bann this universal protocol. As a former trainee in information intelligence I launched my own instance of, feed-me-up-scotty to watch out for strategic sources of info in my life.
I noticed that researchers at my work ended up with black margins and other printing suprises due to poor printer settings, provoking a lot of stress and waste at work. These scripts help you keep control over your printing jobs at UU. Thanks to your Terminal (Linux, Mac) you can print in color, gray, double sided...
Mycroft is the open source platform for vocal home assistance. It is like Alexa / Cortana except that it is made by the open source community and with respect of privacy. I created my own mycroft skill to turn on and off a projector via ssh and a raspberry pi. While installing and creating I also participated to correct both bugs and public documentation.
This personal project, uses computational linguistics, webscraping, and image recognition to present each days the lunch menus in Uppsala city. 100% open source. You want to know what's for lunch today but you are too lazy to search all the restaurants? Go to https://lunch.uppsala.ai .
In this professional project performed for the Association for Computational Linguistics I helped archiving old conference papers. This project represents:200+ inconsistent webpages from the 2000s converted into csv files, 3000+ pdf inserted into the new scientific database, 10 000+ names of scientists, titles of articles and pdf links scraped and normalised
Keywords: distance reading, digital humanities and philology, discourse analysis, rhetorics
I build tools for literature analysis. My specialty is detecting figure of speech. I work specially on figures of repetition (here the yellow ones in the periodic period table above).
I attach great importance to application of my research as well. You will see demos, code, samples that I provide. They will give you a concrete experience of what I do. Date:
Dubremetz, Marie and Nivre, Joakim (2018) Rhetorical Figure Detection: Chiasmus, Epanaphora, Epiphora. Frontiers in Digital Humanities. 5:10. doi: 10.3389/fdigh.2018.00010
Dubremetz, Marie and Nivre, Joakim (2016). Syntax Matters for Rhetorical Figure Detection: the Case of Chiasmus. In Computational Linguistics for Literature (CLFL 2016). San-Diego, United-States.
Litteræ et Linguæ. Rhetoric Workshop, Uppsala. (February 2016) Sweden.
Dubremetz, Marie and Nivre, Joakim (2015). Rhetorical Figure Detection: the Case of Chiasmus In Computational Linguistics for Literature (CLFL 2015). Denver, United-States.
Dubremetz, Marie and Nivre, Joakim (2014). Extraction of nominal multiword expressions in French. In Proceedings of the 10th Workshop on Multiword Expressions (MWE). Gothenburg, Sweden.
A beginner workshop for learning terminal compatible with both windows and unix users
I am one of the main instructors together with Karl Berglund and Ekta Vats for the course "Cultural analytics" I teach Command line and webscraping.
Basic text analysis. Lectures given in English. I was teaching computational linguistics treatment such as: lemmatisation, tokenization, finite state transducers, HMM and more. I was the main responsible for this course with responsibility such as: managing the schedule, design and correction of assignment, individual support to students, preparation of lab. The audience were heterogeneous profiles of first year bachelor. I developed pedagogical tools such as MCQs with team work and use of flash cards.