Current Members

The Core is led by Shannan Ho Sui, a Senior Research Scientist in the Department of Biostatistics at the Harvard Chan School of Public Health, an Affiliated Faculty member of the HSCI, and the Director of the Core. Her expertise includes data integration, data standards, next-generation sequence analysis, and transcriptional gene regulation. She also has experience in pathogen bioinformatics through her postdoctoral work on anti-infective drug discovery.

She developed the oPOSSUM system for predicting transcriptional regulators from gene expression data, the HSCI Stem Cell Discovery Engine to facilitate molecular comparisons of stem cell experiments, and is the Director of the HSCI Stem Cell Commons - an initiative to promote open exchange of stem cell data and analysis tools through a central stem cell data repository and analytical system. You can find her on Twitter. Shannan has a PhD in Genetics from the University of British Columbia.

John Hutchinson is a Senior Research Scientist in the Department of Biostatistics at the Harvard Chan School of Public Health and helps lead the core as it’s Associate Director. He has a background in academia, and transitioned from the wet lab to computational approaches during his postdoctoral training. He has pursued research ranging from studying the role of the Akt serine threonine kinase in breast cancer biology, characterizing the nuclear enriched lncRNAs NEAT1 and MALAT1, helping show evidence of widespread allele specific expression of autosomal genes and developing novel bioinformatic methods to study the effect of DNA variation on methylation. He enjoys adapting standard technologies and public data to novel uses and designing data visualizations to better illustrate experimental results.

His varied background has familiarized him with many members of the research community in Boston and his deep background as both a molecular biologist and bioinformatician give him unique insight into translating biological questions into practical bioinformatic approaches. He is on Twitter. John has a Ph.D. in Biology from McMaster University.

Radhika Khetani is a Research Scientist in the Department of Biostatistics and the Training Director at the Core, with a focus on developing bioinformatics workshops and training modules of various lengths for the Harvard community through funding from HMS and (formerly from) the HNDC).

Her passion is to make genomics data and its analysis accessible to everyone. She has spent the last several years training wet lab biologists to independently perform bioinformatics analyses. She joined the core in Fall 2014 from the High-Performance Biological Computing (HPCBio) group at University of Illinois (UIUC) where she was involved in several research projects, and was in charge of the training program. Her background is in molecular biology, but she switched to computational work as a postdoctoral fellow when she started studying the impact of stress factors on the health of honey bees. Her sequence-based analysis repertoire includes metagenomics, metatranscriptomics, RNA-Seq, genome and transcriptome assembly, in species across the spectrum of life. Radhika has a PhD in Molecular and Cellular Biology from Dartmouth College.

Mary Piper is a Research Associate in the Department of Biostatistics and the Associate Training Director at the Core. She serves dual roles as research consultant and bioinformatics trainer, but her primary role is the development and instruction of bioinformatics workshops focused on the analysis of next-generation sequencing data. Her background in scientific curriculum design and instruction began at the University of Michigan, where she taught undergraduates for over five years while completing her PhD in cellular and molecular biology and her MPH in epidemiology. She combined her passions for scientific research and teaching at the Broad Institute of MIT and Harvard as the genomics instructor for MIT’s Minority Introduction to Engineering and Science program where her course focused on cutting-edge biomedical research and bioinformatics analysis. Her exposure to the research at the Broad Institute, MIT, and Harvard, led to her desire to pursue bioinformatics as a career and to share that knowledge with the community.

Rory Kirchner is a Research Scientist with background in neuroscience and engineering from the Harvard-MIT Health Science and Technology department. He works on creating software to implement best-practice approaches for common informatics analyses and methods development for tackling novel informatics problems. His code can be found on github and he blogs at spliced.

Meeta Mistry is a Research Scientist in the Department of Biostatistics at the Harvard Chan School of Public Health and a Senior Bioinformatics Trainer and Analyst at the Core. Her training began in Biochemistry, however postgraduate interests in computational approaches fostered the transition to Bioinformatics. During her graduate career she has pursued various areas of research including the identification of bacterial drug targets, implementing metrics for functional similarity, and cancer proteomics. Her expertise includes complex statistical analysis of large-scale datasets pertaining to postmortem human brain, meta-analysis and genomics of neuropsychiatry. She also has a passion for teaching bioinformatics with experience at the highschool, graduate and postgraduate level. Her current role involves both bioinformatics consulting and training efforts. Meeta has a PhD in Bioinformatics from the University of British Columbia. s

Victor Barrera is a Research Scientist in the department of Biostatistics at the Harvard Chan School of Public Health. He graduated with a BSc in Biochemistry and began his thesis in Cancer Epigenetics. During his first PhD year he shifted to Bioinformatics, obtaining a MSc in Bioinformatics and Biostatistics, and designed custom pipeline and analysis for NGS epigenetic data. He is interested in the study of big volumes of data such as next generation sequencing or Big Data in order to find hidden relationships. He is experienced in custom analysis development and automation using Python, R and Bash as his main worktools.

James Billingsley is a Research Scientist in the Department of Biostatistics at the Harvard Chan School of Public Health. He has a PhD in Immunology and developed his interests in high dimension computational methods during his postdoctoral training. Prior to joining the Core, James worked as a Bioinformatics Analyst at the Yerkes Non-human Primate Genomics Core at Emory University. His primary interests at the Chan Bioinformatics Core are focused on single-cell RNA-seq, and his additional research interests include characterizing correlates of protection in non-human primate models of HIV vaccines, and understanding mechanisms maintaining HIV latency.

Zhu Zhuo is a Research Associate in the Department of Biostatistics at the Harvard Chan School of Public Health. She has a PhD degree in Animal and Food Sciences from the University of Delaware. During her PhD, she worked on analyses of differential gene expression and allele specific gene expression in chickens. Her postdoctoral training focused on de novo genome assembly in plants and supporting undergraduate education in Genetics and Genomics. Zhu’s role at the Harvard Chan Bioinformatics Core is to provide Bioinformatics consulting.

Sergey Naumenko is a Research Associate in the Department of Biostatistics at the Harvard Chan School of Public Health. Sergey graduated with BSc/MSc in Computer Science from Kharkiv National University of Radio and Electronics (Kharkiv, Ukraine), defended PhD thesis in Mathematical Biology and Bioinformatics in the joint laboratory of Evolutionary Genomics and Molecular Evolution (Moscow State University/Institute for Information Transmission Problems, Moscow, Russia), and went through a postdoctoral training at the Hospital for Sick Children (Toronto, Canada). Sergey is performing RNA-seq analyses and is helping to maintain the bcbio ecosystem. Publications, GitHub, Twitter.

Joon Yoon is a Research Associate in the Department of Biostatistics at the Harvard Chan School of Public Health. Joon obtained his MSc and PhD degrees in Bioinformatics from the Seoul National University where he defended his thesis on custom workflows and visualizations for transcriptomic analyses (using R and python scripting). During his Ph.D. Joon joined the bioinformatics solution company C&K Genomics’ RNA-seq team as a research consultant where he helped set update their bulk RNA-seq analysis workflows by switching out outdated protocols and expanding on types of analyses. While there he also participated in the T2D international consortium and several other RNA-seq projects.

Jihe Liu is a Research Associate in the Department of Biostatistics at the Harvard Chan School of Public Health. Jihe obtained his BS degree in Bioinformatics from Huazhong University of Science and Technology, and PhD degree in Chemistry (with focus on chemical biology and synthetic biology) from University of Pittsburgh. He received postdoctoral training at The Jackson Laboratory, where he worked on genomic analysis of Glioblastoma using short-read and long-read sequencing technologies. Jihe’s role at the core is to instruct bioinformatics workshops, develop training materials, and support selected data analysis projects.

Preetida Bhetariya Dr. Preetida J Bhetariya is a Research Associate in the Department of Biostatistics. She received her PhD in Biotechnology from Institute of Genomics and Integrative Biology, India where she studied Aspergillus genomics for secondary metabolites. Along with molecular and functional genomics approaches, she developed a diagnostic test for fungal (Aspergillus) detection from clinical samples. She continued pursuing her computational genomics interests during her post-doctoral training at University of Utah where she used RNA-seq to study microbial interactions in Multiple Sclerosis and characterized human gene expression patterns in MS and other autoimmune diseases. Following her passion for diagnostic research, she moved to Dr. Gabor Marth’s lab where she studied cancer genomics; developing a bioinformatics pipeline for low-frequency variant calling for circulating tumor DNA from liquid biopsies. She is an expert in designing custom custom analysis workflows in Python and R. She supports the data analytics of projects involving single cell genomics and cancer genomics at the core. Github, Google Scholar, Twitter

Ilya Sytchev is a Senior Bioinformatics Software Developer in the Department of Biostatistics. He was one of the founders of OpenWetWare - an online community for biologists and is currently helping build and maintain the HSCI Stem Cell Commons. He has a BS in Computer Science from University of Massachusetts Boston and a MS in Bioinformatics from Northeastern University.

Maria Simoneau is the Project Manager for the Core and helps with onboarding consults, grant support, record keeping, reporting, planning and training and event coordination. She joins us with nine years of experience at Harvard Chan in the Onnela Lab in Biostatistics and in Social and Behavioral Sciences. She has an MBA from the F.W. Olin Graduate School of Business at Babson College in Wellesley, MA and a BS in Computer Science from Boston College in Chestnut Hill, MA.

Faculty Director

Peter Kraft is a Professor of Epidemiology and Biostatistics at the Harvard T.H. Chan School of Public Health. Dr. Kraft’s research focuses on statistical issues in the design and analysis of genetic association studies, with an emphasis on applications in cancer epidemiology. He has published over 300 articles and was recognized by Thompson Reuters as a highly cited researcher in 2014. Dr. Kraft has also developed and taught courses on introductory genetic epidemiology and “big data” for epidemiologists. Dr. Kraft is the Director of the Harvard Chan Program in Genetic Epidemiology and Statistical Genetics, and Faculty Director of the Harvard Chan Bioinformatics Core (HBC).

Before taking over as Faculty Director of HBC from founding Faculty Advisor Win Hide in August 2014, Dr. Kraft worked closely with the Core to develop pipelines to call and analyze high-depth targeted sequencing in a multi-ethnic sample of 4,600 breast cancer cases and controls.

Administrative support

We are fortunate to have access to dedicated administrative support. Rachel Boschetto is our grants manager and all-around financial guru.


Winston Hide created the Core in his role as an Associate Professor of Bioinformatics and Computational Biology in the Department of Biostatistics at Harvard T.H. Chan School of Public Health. Dr Hide is currently an Associate Professor of Pathology at the Beth Israel Deaconess Medical Center where he also co-directs the Non-coding RNA Core Facility. He founded and directed the South African National Bioinformatics Institute (SANBI) near Cape Town. With over 20 years of experience in computational biology, his expertise addresses integration of ‘omics data to deliver clinical translation. He uses standardized approaches to disease gene discovery in stem cells and cancer stem cells, host response to pathogens, and complex diseases.

Oliver Hofmann is the Director of Bioinformatics at the Wolfson Wohl Cancer Research Centre and a Reader in Bioinformatics at the University of Glasgow. Previously the Director of the Core, Oliver continues to consult for the Core as an Affiliated Faculty member of HSPH and the HSCI.

Oliver worked in a molecular biology wetlab environment before switching to natural language processing, database development and curation as part of his PhD thesis, eventually branching out to sequence clustering and pattern recognition as a visiting researcher at deCODE Genetics. His work as a postdoctoral fellow at the South African National Bioinformatics Institute focused on data integration, second-generation sequencing data management and network analysis. His recent research focuses on stem cell biology and integrating heterogeneous data sources to provide biological context to results obtained from high throughput experiments, as well as applications of next generation sequencing to cancer genomics. He can be found on Twitter and blogs about workshops and conferences. Oliver has a PhD in Biochemistry from the University of Cologne, Germany.

Brad Chapman was a Senior Research Scientist in the Department of Biostatistics. Currently working at Gingko Bioworks, he has spent more than 15 years answering biological questions with computational approaches after switching over from a background in wet-lab research. He combines automated high-throughput analysis pipelines with custom visualization and processing tools. By utilizing a wide variety of languages, he strives to maximize code re-use while maintaining the flexibility to answer highly-specific collaborative questions.

Brad is involved in the open source community as a member of the Open Bioinformatics Foundation, bcbio, Biopython and CloudBioLinux, as well as contributing regularly to freely available GitHub and Bitbucket repositories. He posts about his research on Blue Collar Bioinformatics and can be found on Twitter as chapmanb. He has a PhD in plant biology from the University of Georgia.

Lorena Pantano was a Research Scientist in the Department of Biostatistics. Currewntly at eGenesis, she graduated with a BSc in Biochemistry, and extended her career in Bioinformatics during her MSc and PhD thesis. Her interests are the application of statistical and mathematical models to large-scale data, mainly (small) RNA-seq data. She likes to think of new visualization concepts to improve the interpretation of complex data providing a dynamic user-data interaction. She is involved on the development/maintenance of SeqBuster, a framework for the analysis of small RNA seq data that integrates Java, Python and R modules. You can find her on Twitter.

Michael Steinbaugh was a Research Associate in the department of Biostatistics at the Harvard Chan School of Public Health. He graduated with a BSc in Microbiology, Immunology and Molecular Genetics from UCLA and did his Ph.D. at the University of Michigan Medical School where he studied stress resistance and xenobiotic metabolism in slow-aging mice. Mike did his postdoc at the Joslin where he worked in Keith Blackwell’s group on lipid homeostasis and lifespan extension in C. elegans. Mike has a keen interest in R and Bioconductor development. Mike left the core for a position at Constellation Pharmaceuticals, where he puts his expertise in single cell RNA sequencing to good use..

Kayleigh Rutherford was a Research Data Analyst in the Department of Biostatistics. She graduated with a BSc in Genetics and Computer Science from the University of Cape Town, and went on to complete a Medical Science Honours in Bioinformatics, with a thesis that focused on computational methods for drug repositioning. She came to Boston to pursue a Masters in Bioinformatics and since graduating, joined the core where she focuses on statistical analysis and visualisations of large-scale data from siRNA and compound screens. She has since moved on to a Computational Biology position at Memorial Sloan Kettering.

Yufei Lin was an Intern in the core and came to us from the Boston University where she was pursuing a Masters in Statistical Practice, which fostered her interest in the application of statistics to biology. Yufei has an extensive and varied background in data science and analysis and a Bachelor of Management in Electronic Commerce. Her working experience in a neuroscience lab and a biomedical genetics lab makes her determined to pursue bioinformatics as a career, specifically in genomics studies, where she is most interested in studying next generation sequencing data to find relationships between the genome and disease. She is experienced in statistical analysis and visualization of large-scale data using R.

Andreas Sjödin was a Research Associate in the Department of Biostatistics and is currently the Deputy Research Director at the FOI in Umea, Sweden. He has been in the field of genomics for more than ten years and closely followed the development of emerging sequencing technologies. The focus of his research has changed from plant transcriptomics through pathogen genome sequencing to human WGS and metagenomics. He is also an expert on forensics of select agents (BSATs) using sequencing technologies. Andreas has a PhD in Cell and Molecular Biology of Plants from Umeå University. Follow him on twitter.

John Morrissey was part of the FAS Research Computing team and provided research computing support for the Harvard Chan School of Public Health. Prior to joining Research Computing, John was a Senior Software Engineer at Frontier Communications where he was the lead on writing new code and modifying existing code, designing/implementing systems and network infrastructure, performance planning and tuning, and final escalation for production troubleshooting and failure analysis. He works in close collaboration with core staff and external partners to identify and resolve bottlenecks in our workflows. You can follow his latest work on Twitter. John has a BS in Information Technology from Rochester Institute of Technology.