GPT-OSS is an open source, open weight model by OpenAI with Apache 2.0 license. It is an autoregressive mixture-of-experts (MoE) model that specialized in reasoning (chain of thought), tool use, and support structured output. Similar to Llama architecture, it uses pre-norm with RMS norm and each expert is a SwiGLU-based...
[more]
RF603N II for Nikon Z5ii
I have a few Yongnuo RF603N II remote flash units. They are old but works. Their primary use is for remote flash over its hotshoe, but they also allows remote shutter release via the same 2.4GHz radio. It comes with a 10-pin Nikon cable that works for older high-end DSLRs...
[more]
How to Read MTF Chart
MTF stands for Modulation Transfer Function. It illustrates the optical quality
of camera lens, a.k.a. “sharpness”. MTF chart is a curve of the transmittance
of light (0 to 100%) against the distance from the centre of the image towards
its edges (in millimetres).
[more]
Zhang et al (2024) TinyLlama
This paper is a report on training the TinyLlama model, following the Llama 2 architecture but with 1.1B parameters.
The objective is to experiment with the idea that a smaller model but trained with more data can match or overperform a larger model.
[more]
Gerber (2025) Attention Is Not All You Need
This is a paper to investigate the importance of the MLP sublayer in a
decoder-only transformer model. In particular, the common design of a two-layer
feed-forward network is challenged. Alternatives of 0 to 3 layers of
feed-forward networks are compared for the standard error loss.
[more]
