Toggle Menu
The Human In The Machine

States Title's Data Science Blog

Daniel Sammons

The human in the machine

Visualizing Hyperparameter Optimization with Hyperopt and Plotly

A machine learning (ML) model is rarely ready to be launched into production without tuning. That's why hyperparameter tuning – the science of choosing all the right settings for ML – is a core competency of the data science team at States Title.

Jay Ozer

The human in the machine

Runaway Query Termination via Looker and Slack

States Title Staff Data Analyst, Jay Ozer, shows you step-by-step how to find runaway queries that bog down your system, terminate them, and then post the details to Slack using the Looker SDK

Apoorv Sharma

The human in the machine

Neural Language Models as Domain-Specific Knowledge Bases

The fundamental challenge of natural language processing (NLP) is resolution of the ambiguity that is present in the meaning of and intent carried by natural language.

Ying Jiang

The human in the machine

Understanding BERT’s Semantic Interpretations

What’s the difference between “It’s good” and “It’s OK?” Find out how our algorithms learn subtle language distinctions to automate the title process and make mortgage closings seamless.

Matthew Phillips

The human in the machine

Building a Modern Data Stack at States Title

Machine learning can deliver an incredible amount of value to business, but when it comes to building a functional and sophisticated data stack, there are numerous choices and challenges to resolve along the way.

Erica Mason

Articles

The human in the machine

Do Your Civic Duty and Help Out a Data Scientist

April 1, 2020 is census day in the United States, when a constitutional mandate for every county in the country to count its population is fulfilled.

Brian Holligan

Articles

The human in the machine

Stochastic Growth: Becoming an Effective Data Scientist with Grit

In the course of three years, States Title has evolved into a top 10 provider of real estate settlement services, an outcome in no small part powered by patented and highly effective data science algorithms.

Andy Mahdavi

Articles

The human in the machine

The Galactic Adventures of Mark V. Shaney

One of the most talented and profound colleagues I’ve had in my life has not been a person, but an algorithm: the Markov Chain.

Ravi Ilango

Articles

The human in the machine

Using NLP (BERT) to improve OCR accuracy

Optical Character Recognition (OCR) is a popular technique used to extract data from scanned documents.

Allen Ko

Articles

The human in the machine

A Custom Imputer – Why and How

Earlier this year, our data science team of three people were gathered in a small meeting room, looking over plots describing feature behavior for a new model.

Daniel Sammons

Articles

The human in the machine

pytest for Data Scientists

Creating automated tests for software is second nature to great software engineers, and it’s a habit that data scientists should mirror.

Andy Mahdavi

Articles

The human in the machine

A RESTful Development

As disciplines mature, practitioners tend to specialize. During the Renaissance, physics was a branch of philosophy, and Isaac Newton, inventor of calculus, the prism, and aspiring alchemist, considered himself a philosopher. He called his masterpiece the Mathematical Principles of Natural Philosophy. In the 19th century, philosophy split off from physics, but a single physicist could still be reasonably expected to know the entire subject.