Posts by Tag

Machine Learning

1 Year of a Challenging Big-Bench Task

16 minute read

In 2021 I contributed to the Big-Bench suite of NLP tasks, aiming to probe the abilities of large language models. Inspired by sports, I developed a task aim...

SPANBERT

less than 1 minute read

What is the name of the SpanBERT paper? SpanBERT: Improving Pre-training by Representing and Predicting Spans Mandar Joshi, ...

TransformerXL

2 minute read

Introduction Transformer models typically have a fixed context window that is hard to scale due to the $O(n^2)$ cost of the attention mechanism. Extending th...

Back to Top ↑

Paper

Back to Top ↑

Reading

Back to Top ↑

NLP

TransformerXL

2 minute read

Introduction Transformer models typically have a fixed context window that is hard to scale due to the $O(n^2)$ cost of the attention mechanism. Extending th...

Back to Top ↑

Transformer

SPANBERT

less than 1 minute read

What is the name of the SpanBERT paper? SpanBERT: Improving Pre-training by Representing and Predicting Spans Mandar Joshi, ...

TransformerXL

2 minute read

Introduction Transformer models typically have a fixed context window that is hard to scale due to the $O(n^2)$ cost of the attention mechanism. Extending th...

Back to Top ↑

Parameter Efficiency

Back to Top ↑

xgboost

XGboost Part 1: Gradient Boosting

4 minute read

Introduction Xgboost is a powerful yet simple algorithm that has achieved state of the art results on tabular datasets. The Xgboost algorithm uses an ensembl...

Back to Top ↑

computer_vision

Back to Top ↑

parallelization

Back to Top ↑

audio

Back to Top ↑

self_supervised_learning

Back to Top ↑

parameter_efficiency

Back to Top ↑

adapters

Back to Top ↑

Transformers

Back to Top ↑

finetuning

Back to Top ↑

SentenceEmbeddings

Back to Top ↑

Prompting

Back to Top ↑

Few Shot Learning

Back to Top ↑

Soft Prompting

Back to Top ↑

Paper Summaries

Back to Top ↑