-
Finite-State Transducers for Cards and SpeechBuilding FSTs to recognize poker hands and composing WFSTs for isolated digit speech recognition.
5 min -
Word2Vec from ScratchBuilding skip-gram and CBOW word2vec models with negative sampling, training on the Stanford Sentiment Treebank.
6 min -
Logistic Regression Decision BoundariesBuilding logistic regression from scratch in NumPy with gradient checking, feature scaling, and decision boundary visualization on exam data.
6 min -
Twitter Streaming and Language DetectionCollecting 15K tweets from the Twitter Streaming API, comparing language detection methods, and analyzing geolocation patterns.
6 min -
Soft-Margin SVMs: Linear and KernelImplementing soft-margin linear SVM and polynomial kernel SVM from scratch with gradient descent, grid search over learning rates and regularization.
6 min -
COTA Buses on a MapVisualizing Columbus public transit routes using GTFS data, React, and Leaflet as part of the Smart Columbus initiative.
13 min -
Logistic Regression from Scratch on MNISTImplementing binary logistic regression with gradient descent on 784-dimensional image data. Numerically stable gradients and learning rate tuning.
4 min -
Bayes vs LDA and VC Generalization BoundsComparing the Bayes-optimal quadratic boundary to Fisher LDA on 2D Gaussians, then testing VC dimension bounds on a rectangle concept learner.
6 min
Back
Blog
Page 2 - Showing 8 of 39 posts
View all posts by years →