I'm a data scientist and machine learning practitioner with expertise in AI ethics. My data experience includes working with text, images, electronic health records, and DNA sequences. My main area of machine learning expertise is natural language processing (NLP) and language models. I've developed NLP models for diverse use cases including conversational AI, extracting information from tweets and medical literature, and gathering genealogical information from historical newspapers.

Prior to becoming a data scientist, I was an academic molecular biologist focusing on chromosomes and genome-wide analysis of recombination. I trained at UCSF and later held a tenure-track faculty position at Indiana University.