Unraveling Culinary Secrets with a Restricted Boltzmann Machine: A Deep Dive into Ingredient Patterns

At the heart of our project lies the Restricted Boltzmann Machine, a type of neural network particularly adept at uncovering hidden patterns in data. In the culinary world, ingredients form complex relationships, creating a web of flavors and textures. Our goal is to untangle this web and discover how ingredients co-occur in recipes. We feed… Continue reading Unraveling Culinary Secrets with a Restricted Boltzmann Machine: A Deep Dive into Ingredient Patterns

Streamlining Data Handling in PyTorch: Building an Efficient Data Pipeline

Introduction In the realm of machine learning, managing large datasets efficiently is often a critical task. PyTorch, known for its flexibility and ease of use, offers robust tools for this purpose. This article aims to guide you through constructing a data pipeline that not only manages memory efficiently by streaming data from the hard drive… Continue reading Streamlining Data Handling in PyTorch: Building an Efficient Data Pipeline

The Food.com 2023 Dataset

The dataset is a rich compilation of recipes, spanning a wide range of cuisines and styles. It offers a unique perspective on what makes a recipe more than just a list of ingredients and steps. With over 500k recipes, it’s a deep dive into the culinary world, providing data enthusiasts, chefs, and food bloggers an opportunity to analyze and understand cooking trends on a macro scale.

Bootstrapping Estimates for Comment Likelihood, Hacker News: EDA II

In my previous Hacker News EDA we looked at how words could be embedded in two dimensions. This time we implement a bootstrapping simulator for seeing the impact of posting time on number of comments received. Examining the dataset To get an idea of what keywords are popular at different times of the day, we… Continue reading Bootstrapping Estimates for Comment Likelihood, Hacker News: EDA II

Training a Neural Network for Word Separation

The Hacker News Posts dataset from Kaggle contains an entry for each post made on Hacker News around the year 2016. Hacker News is a social media site where, like on Reddit, users share URLs, write posts, give likes, and leave comments. In this EDA we examine the words used in post titles, identifying embeddings… Continue reading Training a Neural Network for Word Separation

Searching for a switch: the Circuit Breaker Panel problem

Photo by Marco Verch on Flickr

You want to identify the breaker switch for a particular outlet. This problem inspired me to write an article on Medium one day as I was thinking about the electric wiring at home. As you’ll see, there are commonalities between decidely challenging tasks in Machine Learning and the problems you might encounter day to day.… Continue reading Searching for a switch: the Circuit Breaker Panel problem

Published
Categorized as Algorithms

Solve a Substitution Cipher with a Markov chain

Photo by Mauro Sbicego on Unsplash

There are k! substitution ciphers for an alphabet with k letters—too many for an exhaustive search. With a frequency-based approach adapted to the graph of alphabetic ciphers, we redefine the act of deciphering as a sampling problem suitable for a Metropolis-Hastings random walk. A substitution cipher is thus solvable with a Markov chain. Let’s begin… Continue reading Solve a Substitution Cipher with a Markov chain