The dataset is a rich compilation of recipes, spanning a wide range of cuisines and styles. It offers a unique perspective on what makes a recipe more than just a list of ingredients and steps. With over 500k recipes, it’s a deep dive into the culinary world, providing data enthusiasts, chefs, and food bloggers an opportunity to analyze and understand cooking trends on a macro scale.
Tag: machine learning
Training a Neural Network for Word Separation
The Hacker News Posts dataset from Kaggle contains an entry for each post made on Hacker News around the year 2016. Hacker News is a social media site where, like on Reddit, users share URLs, write posts, give likes, and leave comments. In this EDA we examine the words used in post titles, identifying embeddings… Continue reading Training a Neural Network for Word Separation
RRQR Factorization and Spanning Boxes
The famous QR factorization algorithm can be BLAS-3 optimized. Using the Woodbury matrix identity, we implement a block-based Rank-Revealing QR and point out a connection between pivot selection and object detection.
Solution Space Techniques for Faster Convergence, an introduction
Many strategies in Machine Learning involve the iterative search of a solution space Ω. We begin with an initial solution ω₀ and update ωₙ to minimize an objective function f(x)
Solve a Substitution Cipher with a Markov chain
There are k! substitution ciphers for an alphabet with k letters—too many for an exhaustive search. With a frequency-based approach adapted to the graph of alphabetic ciphers, we redefine the act of deciphering as a sampling problem suitable for a Metropolis-Hastings random walk. A substitution cipher is thus solvable with a Markov chain. Let’s begin… Continue reading Solve a Substitution Cipher with a Markov chain