I am a software developer at a stealth biotech startup in the San Francisco Bay Area. We are growing rapidly and actively improving humanity.

I used to be a software developer at on the browse relevance team. I worked on the left-hand navigation for search pages, trying to present relevant category results to users.

While at Amazon I internationalized our service, expanding from only the US market to Canada, the UK, France, Germany, Italy, Spain, Japan, and China; developed a system to determine if a user's search was specific or generic and tailor our results accordingly; participated heavily in hiring; and mentored one intern and two new hires. I left because I was bored and unchallenged.

I work on a few projects in my spare time, which can all be found on github.

I'm interested in free speech, data mining, machine learning, and open source.


email: steven.buss at gmail



Tigershark is a python X12 EDI file parser. It's based on code by Steven Lott and is one of the only open source X12 parsers available.

Everyone working in the insurance billing industry claims that billing can't be automated. I prove them a little more wrong every day.


Sadly on the back-burner due to time constraints. Revisionews is a project to bring more accountability to online news publishers. You can find out more about revisionews on github.


I did some research on the construction of a large scale recommender system for social networks in college. This involved mining data from a large social news website and then applying PCA/LSI/etc. You can see some of my matrices in the UF Sparse Matrix Collection.

Some C code to do Probabilistic Latent Semantic Indexing can be found on my cs_plsi github repo.