Current Topics in Computational Linguistics

Universität Potsdam

Guidelines

Detailed guidelines for this seminar.

Each one of you will be the blogger once during the semester, working on one paper of your choice. Every participant is always a debater in the audience throughout the semester, except during the week when she/he is a blogger.

During the first week of the course, I will publish the schedule with papers and assigned participants, based on those who have enroled at Moodle (so please do it) and as far as possible on your paper preferences.

In the first (probably two) weeks there will be no bloggers: I have posted the introductory articles we will read together and I will propose the discussion topics. The role of debaters comes into force for every participant as of April 20.

Blogger

Duties

Instructions

A few examples of blog posts about papers: here, here and here. This one and this one have a good overview of RL, that I recommend you to read before the course starts.

Getting used to reading papers after years of nice textbooks takes a while. These slides may be a place to start, and you can easily find more material on the internet (search for how to read a scientific paper). If you need inspiration on how to critically evaluate a paper, pretend you are an ACL reviewer following their guidelines.

Feel free to contact me if you have any trouble. I do not expect you to be experts, neither on scientific literature nor on RL.

Debaters

Duties

Be sincere on the feedback and provide constructive suggestions. This will be helpful for everyone. Do not write general comments that do not mean much (like ‘the post was very useful’). Instead, talk about the structure of the text, how information was given, what caused you confusion etc.

Final project

You can choose between writing a literature review article or implementing (Python code+report) a model and running experiments.

Review article: summarize the current state of understanding in applications of Reinforcement Learning to Natural Language Processing into an article. Follow the points in the Wikipedia’s link above: mention the main people working in a field, recent major advances and discoveries, significant gaps in the research, current debates and ideas of where research might go next. Notice the idea is not describing paper by paper, by building a structured general analysis by finding patterns and similarities among all (or as many as possible) the papers listed on the bibliography (and any other you discover during the research). You can find great instructions here.

Some examples of good review articles: the second paper we covered, this one about incremental processing and this one on corpora for dialog systems (this is a long one, though).

Experiments: implement a Reinforcement Learning approach applied to a Natural Language Processing task and run experiments. I do not expect originality, but it cannot be an exact copy of a work that already exists. Some idea, even if incremental, must be your own. Or your can use ideas from the papers that have not been tested yet. It is not a problem if your approach does not lead to good results, as long as you explain your motivation and describe what can be the causes for such results. Please implement it in Pyhton.

The final report should follow the structure of a paper: introduction, related word, description of your model, results of your experiments and conclusion. If your choose this option, send me a short proposal (half a page) describing what you intend to do by July 30.

Format

In both cases, for you to practice how a real paper submission works, write your article following the formatting guidelines of long papers either of COLING2020 or EMNLP2020. The only difference is that your report must have exactly 9 pages plus blibliography and (if you need) appendix.

There will be a link in Moodle for you to submit a pdf. Those who opt for the experiments must also submit a compressed file with your code or point to a repository.

Submission deadline for the final project: September 30, 23:59.