Final Project

Default research project: MinBERT Base implementation

Use base given code to implement MinBERT
Utilize pre-trained model weights + embeddings to perform sentiment analysis on two datasets
Train for sentiment classification Extend: how to build robust embeddings which can perform well across a large range of different tasks, not just one
Adjust BERT embeddings to perform following 3 tasks
1. Sentiment analysis
2. Paraphrase detection
3. Semantic textual similarity
Find relevant research paper for each improvement (some suggestions given)

Notes on finding research projects (how to build an economic model in your spare time)

Getting ideas
1. Journals not great
2. Look in media, news, etc. that aren’t about your topic area
3. Conversations with people in business
Think through your own idea independently, somewhat thoroughly
Go find somebody else that did your idea but 10x better- ask yourself why you missed what they did
Give seminar
Planning

Winning default papers

Walk Less + Only Down Smooth Valleys
1. Pretrained embedings from BERT for 3 fine-grained tasks”
2. First- test ability to be tuend towards sentence sentiment classification only
3. Then implement SMART which aims to tackle overffiting
4. Apply multitask learning approaches that learn on all 3 aforementioned tasks
Basically
1. INvestigate of pre-trained + fine-tuned BERT model on 3 downsteam prediction tasks when include
  1. Regularization (SMART)
  2. Multitask Learning with task-specific datasets
  3. Rich relational layers that xploit similarity between tasks
Approach
1. Starting point: BERT
2. Focusing on the 3 specific fine-tuning tasks (set up basic baselines)
3. Extending
  1. Regularization of loss + optimizer step (SMART)
    1. Coded up themselves
  2. Round-robin multitask fine-tuning
    1. Baseline BERT assumes fine-tuning only on sentiment classification generalizes well to paragraphising and similarity prediction tasks- not true
    2. Instead, implement batch-level round-robin MTL routine (SST, paragraph, and STS ata)
    3. test 2 versions
  3. Rich relational layer combining similar tasks
    1. Adapt model to handle relations acros tasks
4. Experiments 1.

Finding research topics Generally 2 ways in science

Most projects

Find application / task + expore how to approach / solve it effectively, often with existing model
Implement complex neural arch. + demonstrate performance
- Ideally find some way to tweak it- make it better (kind of #3)
Come up with new / vaiant neural network model + explore empirical success
Analysis project- anayze behavior of model- how it represents linguisitc knowledge or what kinds of phenomena it can handle / errors it makes
Rare theoretical project

Examples

Using LSTM togenerate lyrics (adding components for metric structrus + rhyme)
Complex neural model: implement differential neural computers + get to work (I believe building an implementatin of existing closed source paper)
Got published- showed improvements to RNNLMs (Title: Improving Learning through Augmenting the Loss)
Quantization of word vectors
- Counted for class because evaluated on natural language tasks

Finding a place to start

If want to beat the state of the art, look at leadeerboards

Modern Deep Learning NLP

Most works- download big pre-trained model + work from there
Recommended for practical projects-
- Transformer from Huggingface
- Load a big pre-trained language model
- Fine tune it for our task
- Test it Exciting areas now
Evaluating / improving models for something other than accuraacy
Empirical work on what PLMs have learned
Transfer learning with ittle data
Low resource stuff
Addressing bias
Scaling models down (pruning, quantization, etc.)
More advanced fucntionality (compositionality, generalization, fast leraning (e.g. meta learning) on smaller problems)

Datasets

Example of doing research (e.g. orf applying NN to summariation)

Define task
1. Summarization
Define dataset
1. Search for academic datasets (already have baselines, helpful)
  1. e.g. newsroom summmarization dataset
2. Or- define your own dataset
  1. Fresh problem
  2. be creative
Dataset hygiene
1. Separate test and dev test data splits
Define metric
1. Search for well etablished metrics
2. Summarization: ROUGE or human eval
Establish baselin
1. Implement simple model first
2. Summarization: LEAD-3 baseline
3. Compute metrics on Train AND dev NOT test
4. often will have errors- analyze
Imlement existing neural net mdel
1. Compute metric to train + dev
2. Analyze output + error
Always be cose to the data (except final test set)
1. Visualize dataset
2. Collect statistics
3. Look at errors
4. Analyze hyperparameters
Try out diferent models + variants (set up quick iteration)
1. Fixed dwindow Nn
2. RNN
3. Recursive NN
4. CNN
5. Attention based Model
6. Etc.
Ideally only use test set once.

Getting nns to train

Pablo's Reference Notes