I write about design patterns, trends, and project managment strategies for data science projects.

Lessons from Rust 2: Stronger Typing

Tips for integrating Rust's strong typing system into your Python code.


Lessons from Rust 1: Enums for Errors and Missing Data

The Option and Result enum types from Rust can be useful design patterns for working with missing data and propagating errors.


Patterns for data collection types

Patterns for creating useful collections of data objects: filtering, grouping, aggregation, mutation, parallelization, and plotting.


Patterns and Antipatterns for Dataclasses

Tips for building clean data objects using the dataclasses module.


Are data frames too flexible?

Custom types VS data frames: choosing the right data structures for your project.


Using Slot Classes in DocTable Schemas

Allowing for defaulted parameters in slot classes using decorators so they can be used as doctable schemas.


Using SQLite to store and manage large Python objects

My solution in doctable and some basic benchmark results.


Reflections on creating text analysis workshops

Learning these methods is about developing your own design patterns and learning the most fundamental approaches.


New repo for cleaned NSS documents

I created some python code to download US National Security Strategy documents from github for your text analysis examples.


UCSB Instructional Development Grant

We received an instructional development grant to help sociology students learn computational methods.


Topic Modeling for Undergraduate Sociology Students

Thoughts from a pilot program to teach Sociology undergraduates how to use topic modeling.


Theory, Method, and Reproducibility in Text Analysis

I respond to critiques of computational methods for hermeneutic analysis.