Transformer distiled, Part 2 of 2

August 1, 2022

Transformer distiled, Part 2 of 2

Regularization

Self- vs Cross-Attention

Adding vs concatenating positional encoding

Python Comprehension vs map()

July 22, 2022

The comprehension techiques for the data types `list`, `dict` and `set` append values to new data objects. They can be used to replace loops with `for` and functions like `map` and `filter`.

Static-sites on GH-Pages

July 15, 2022

Recap on ML

Github Pages

Jekyll small themes

As easy as adding remote_theme inside _config.yml, e.g. from Cayman theme:

remote_theme: pages-themes/cayman@v0.2.0
plugins:
- jekyll-remote-theme

Jekyll full themes

Fork or use repo template for clean commit history
E.g. Reverie theme: fork repo or use repo template

Gatsby

Gatsby’s blog starter

Sphinx

ReadTheDocs

ReadTheDocs
Builder and .readthedocs.yaml

Python Closures and Decorators

July 8, 2022

Functions are treated as objects and can be used as arguments or return values for other higher functions. This paradigm is called first-class functions. A built-in example in python would be map() which takes a function as an argument.

Recap on ML

Transformer distiled, Part 2 of 2

Transformer distiled, Part 2 of 2

Regularization

Self- vs Cross-Attention

Adding vs concatenating positional encoding

Python Comprehension vs map()

Static-sites on GH-Pages

Recap on ML

Github Pages

Jekyll small themes

Jekyll full themes

Gatsby

Sphinx

ReadTheDocs

Python Closures and Decorators

Transformer distiled, Part 1 of 2

Scaled dot-product

Softmax and multi-head attention

Linear layers

Learned Embeddings