Tutorial
- GitHub Crawler: Beyond Basic Scraping (04 Jul 2021)
When scraping makes sense, when it doesn't, and how to do it responsibly. Building a GitHub user crawler with async programming. - Introduction to Web Scraping with Python (25 Jun 2021)
An introduction to extracting data from the web. HTML structure, DOM traversal, and building a working crawler with Python and BeautifulSoup. - Common Metrics Derived From the Confusion Matrix (15 Jun 2021)
The practical side of classification metrics. Precision, recall, F1, MCC, and the implementation details that textbooks skip over. - The Confusion Matrix: Why Accuracy Is a Dangerous Illusion (07 Jun 2021)
A fraud detection system with 99.9% accuracy might be worthless. Why scalar metrics hide what matters, and how to think about errors that aren't symmetric. - The Least Squares Method (01 Jun 2021)
Deriving the normal equations by working backwards from what we want to achieve. The foundation of linear regression, explained geometrically.