Academics
  /  
Courses
  /  
Descriptions
COMP_SCI 397: Seminar in Statistical Language Modeling


VIEW ALL COURSE TIMES AND SESSIONS

Prerequisites

CS-349 (or equivalent) and intermediate Python

Description

Statistical language models assign probabilities to sequences of words, and are used in systems that perform text summarization, machine translation, question answering, information extraction, text generation (dialogue system) and many other tasks. In recent years, language models based on deep neural networks have dramatically improved the state of the art. This course will cover the fundamental technologies comprising statistical language models, such as word embedding methods, recurrent neural networks and transformers, and selected natural language processing tasks by reading canonical research papers that helped define the field. Students will be required to read and present research papers and to complete a substantial research project. This is a research-focused class, with several groups submitting papers to major conferences in the past.

TEXTBOOK: None (the class consists of reading/presenting research papers)

COMPUTER USAGE: Python, PyTorch, GPUs and Huggingface codebase

GRADING: A/B/C

COURSE INSTRUCTOR: David Demeter

COURSE COORDINATOR: David Demeter