Course Description

Because one of the primary goals of cognitive science has been to reverse-engineer the human mind, there have been periods of intense interaction between the field and engineering developments in related fields like computational linguistics. The recent development of (large) language models that are capable of producing large quantities of fluent text has (i) re-raised some fundamental questions about what can be learned from data alone, (ii) invited comparisons between language model processing and human language processing, and (iii) enabled new forms of controlled experimentation that can illuminate debates on acquisition. This seminar will be a focused reading group, where we will closely read a body of recent work at the intersection of computational linguistics and cognitive science, centered around questions like: Must certain linguistic biases be innate or can they be learned from data? When and why are language models good predictors of human linguistic behavior? A recurring focus will be using language models as model learners through the systematic manipulation of the training data along various axes.

Days	Time	Location
Mondays	3:30 - 5:50 PM	SAV 130 https://washington.zoom.us/j/99209774552

Note: while the course will occur live at the above time and location, it will also be recorded and posted to the course Canvas page.

Teaching Staff

Role	Name	Office	Office Hours
Instructor	Shane Steinert-Threlkeld	GUG 415K and https://washington.zoom.us/my/shanest	Monday 3-5PM

Policies

As a project-oriented, student-driven, seminar-style class, active participation---in the classroom, or in Zoom, as well as on Canvas---is expected. Primary course-time will be conducted as a "role-playing reading seminar", where each person reads a paper with a particular role / perspective and uses that to kick off discussion (see Course Structure for more information).

A final project will be carried out in small groups. Groups are free to divide up work as they see fit, but will be required to explain the division of labor with their final project. Except under rare circumstances, every member of a group will receive the same grades.

We understand that you may face hard times as we navigate an ever-changing world due to the COVID-19 pandemic and many other world events. If you find yourself struggling with a difficult concept; stressed over politics or health; slowed by monopolistic internet providers; or annoyed at a classmate, please remember that they might feel similar. Maybe not in your very moment, but certainly recently or soon. Some of you may find the return to hybrid teaching conducive to your style of learning and personality. Others may find it stressful or difficult. These are all normal reactions. Please have compassion and empathy, and assume that everyone is doing their best.

If you find yourself having trouble learning in class, please do not hesitate to let me know. My goal is to make this class a bright spot in these unprecedented times, and to do whatever we can to promote a healthy learning environment for all.

A note on time zones

All deadlines and meeting times for this class are in "Pacific Time". Now that we are in Daylight Savings Time, this is UTC-7.

Grading

60% weekly reading assignments
40% final project. More information will be released on Canvas, but there will be three families of final project: literature review, replication study, or new experimental design.

Communication

Any questions concerning course content and logistics should be posted on the Canvas discussion board. If a more personal issue arises, you can email me personally; include "LING575" in the subject line. You can expect responses from teaching staff within 24 hours, but only during normal business hours, and excluding weekends.

Religious Accommodation

Washington state law requires that UW develop a policy for accommodation of student absences or significant hardship due to reasons of faith or conscience, or for organized religious activities. The UW's policy, including more information about how to request an accommodation, is available at Religious Accommodations Policy (https://registrar.washington.edu/staffandfaculty/religious-accommodations-policy/). Accommodations must be requested within the first two weeks of this course using the Religious Accommodations Request form (https://registrar.washington.edu/students/religious-accommodations-request/).

Access and Accommodations

Your experience in this class is important to me. If you have already established accommodations with Disability Resources for Students (DRS), please communicate your approved accommodations to me at your earliest convenience so we can discuss your needs in this course.

If you have not yet established services through DRS, but have a temporary health condition or permanent disability that requires accommodations (conditions include but not limited to; mental health, attention-related, learning, vision, hearing, physical or health impacts), you are welcome to contact DRS at 206-543-8924 or uwdrs@uw.edu or disability.uw.edu. DRS offers resources and coordinates reasonable accommodations for students with disabilities and/or temporary health conditions. Reasonable accommodations are established through an interactive process between you, your instructor(s) and DRS. It is the policy and practice of the University of Washington to create inclusive and accessible learning environments consistent with federal and state law.

Safety

Call SafeCampus at 206-685-7233 anytime---no matter where you work or study---to anonymously discuss safety and well-being concerns for yourself or others. SafeCampus's team of caring professionals will provide individualized support, while discussing short- and long-term solutions and connecting you with additional resources when requested.

Course Structure

This course will function as a role-playing seminar: each week, we will read and discuss two papers as a group. Students will sign up for a role for one of the papers, and will be expected to contribute to the discussion from that perspective. Given the size of the seminar, you will be expected to sign up for a role-playing presentation 4 out of 8 weeks. The remaining 4 weeks, you will do a written response to one of the papers to help develop critical reading and writing skills. (More information is on Canvas.) The roles:

🕵 Archaeologist: Determine where this paper sits in the wider landscape of previous and subsequent work. More concretely: find and report on one prior paper that substantially influenced the current paper and one newer paper that cites this current paper. (You can use Google Scholar or SemanticScholar to find papers citing the current paper.)
🔬 Reproducer: You are attempting to reproduce the main results of the paper, either to confirm its conclusions, or to carry out similar experiments. If you are unable to reproduce the results, try to understand why. If you are able to, explain what helped the most. (Note: you are not expected to actually reproduce papers for this class. Rather, assume that you are trying to, and report on what you find on whether you would be able to.)
🧑‍🔬 Researcher: Based on the current paper, propose an experiment or analysis that would extend the work in a new direction. This could be a new dataset, a new model, a new evaluation metric, etc. In particular, try to think of something that is only possible thanks to the current paper, not just a simple extension. (Note: these can serve as inspiration for your final project.)
⚖ Reviewer: Write a referee report for the current paper, following the form for ACL Rolling Review (up through the "Overall Assessment"). Aim to approach papers charitably, but critically, thinking of how the paper can be made into the best version of itself. Their reviewer tutorial may also be helpful.

Each week, a shared Google Slides deck will be created, and each role-player will be expected to contribute a couple of slides, summarizing their view of the paper from their role. The deck will be shared with the class, and will be used to guide the discussion of the paper.

For the weeks that you are not a role-player, you will write a short (~250 word) response to one of the papers. These responses should critically engage with the paper by (i) offering a brief summary, (ii) identifying and explaining one strength of the paper, and (iii) identifying and explaning one weakness of the paper. These responses will be written in a private shared Google Doc and should be completed by the night before the seminar in order to further help stimulate discussion.

Non-presenter assignment: for the other paper that week, for which you have not signed up for one of the roles or written a response, you will be asked to (i) contribute an alternative title for the paper and (ii) at least one question about the paper (e.g. something that was confusing, or someothing that you want to hear more about).

Because of the length of the seminar meetings, we will take a 10-minute break in the middle of each class, in between the discussion of each paper.

One final note: in the interest of this being a student-led seminar, the reading list below can change based on the interests of the participants and everyone's projects. If you have a paper that you would like to discuss, please let me know.

Date	Topic	Primary Readings	Supplementary Readings	Additional info
Apr 2	Introduction
Apr 9	Filtered corpus training, I	Filtered Corpus Training (FiCT) Shows that Language Models Can Generalize from Indirect Evidence Language Models Learn Rare Phenomena from Less Rare Phenomena: The Case of the Missing AANNs	What Artificial Neural Networks Can Tell Us About Human Language Acquisition Language Models Use Monotonicity to Assess NPI Licensing
Apr 16	Filtered corpus training, II	Generating novel experimental hypotheses from language models: A case study on cross-dative generalization Testing learning hypotheses using neural networks by manipulating learning data	Networks and Theories: The Place of Connectionism in Cognitive Science Black Big Boxes: Do Language Models Hide a Theory of Adjective Order? Both Direct and Indirect Evidence Contribute to Dative Alternation Preferences in Language Models	Proposal guidelines out
Apr 23	Quantity of data, I	How poor is the stimulus? Evaluating hierarchical generalization in neural networks trained on child-directed speech Is Child-Directed Speech Effective Training Data for Language Models?	Bridging the data gap between children and large language models Findings of the BabyLM Challenge: Sample-Efficient Pretraining on Developmentally Plausible Corpora Findings of the Second BabyLM Challenge: Sample-Efficient Pretraining on Developmentally Plausible Corpora
Apr 30	Quantity of data, II	Transformer-Based Language Model Surprisal Predicts Human Reading Times Best with About Two Billion Training Tokens Artificial Neural Network Language Models Predict Human Brain Responses to Language Even After a Developmentally Realistic Amount of Training	Frequency Explains the Inverse Correlation of Large Language Models' Size, Training Data Amount, and Surprisal's Fit to Reading Times Scaling laws for language encoding models in fMRI	Proposal due
May 7	Acquisition trajectories	Word Acquisition in Neural Language Models Language acquisition: do children and language models follow similar learning stages?	The Roles of Neural Networks in Language Acquisition WordBank and the corresponding book: Variability and Consistency in Early Language Learning: The Wordbank Project Predicting Age of Acquisition for Children's Early Vocabulary in Five Languages Using Language Model Surprisal Investigating Critical Period Effects in Language Acquisition through Neural Language Models Connectionist Models of Second Language Acquisition and Processing
May 14	Multimodal data	Grounded language acquisition through the eyes and ears of a single child Visual Grounding Helps Learn Word Meanings in Low-Data Regimes	Experience Grounds Language Language Models, World Models, and Human Model-Building
May 21	No class (Shane traveling for conference)
May 28	Formal language pretraining	Injecting structural hints: Using language models to study inductive biases in language learning Between Circuits and Chomsky: Pre-pretraining on Formal Languages Imparts Linguistic Biases	Examining the Inductive Bias of Neural Language Models with Artificial Languages Evaluating Transformer's Ability to Learn Mildly Context-Sensitive Languages	Final paper guidelines out
Jun 4	Impossible and implausible languages	Mission: Impossible Language Models Can Language Models Learn Typologically Implausible Languages?	Kallini et al. (2024) Do Not Compare Impossible Languages with Constituency-based Ones What Languages are Easy to Language-Model? A Perspective from Learning Probabilistic Regular Languages