Eleanor initially joined the lab in June as an intern from the University of Michigan’s Data Science Master’s Program, and has now decided to stay with us! She will be working as a research technician in our lab. We are happy to have you, Eleanor!
Kenia presented an hour-long seminar on her proposed thesis research. Her talk was about dissecting changes in the genetic architecture of fitness across environments and cross-species predictions of genetic interactions.
Elijah will spend the summer at Washington State University where he will be working with Dr. Arron H. Carter’s lab as part of the USDA/NIFA FACT: Research Experience for Undergraduates on Phenomics Big Data Management program.
Krishen is a third year undergraduate student studying Plant Biology, Biochemistry, Human Biology, and Genetics. We’re so happy to have you here with us!
Sci-Files is a science communication podcast produced by MSU’s SciComm club, where graduate students are interviewed about their research. Serena was interviewed about her work on knowledge graphs for the February 20th episode, which is published here as a podcast.
Fitness (number of seeds or fruits producted by an individual) is challenging to measure accurately due to the high fecundity and relatively small propagule sizes. Peipei and Fanrui, with significant contribution from other colleagues, established Faster R-CNN based object detection models which can automatively detect and count Arabidopsis seeds and siliques in high throughput. This work is now accepted in New Phytologist.
Uncovering genes encoding enzymes responsible for the biosynthesis of diverse plant metabolites is essential for metabolic engineering and production of plant metabolite-derived medicine. Peipei, Ally and Shin-Han wrote a review paper regarding recent progress in the computational prediction of plant metabolic pathway genes. This review is now published in Current Opinion in Plant Biology.
Plants respond to wounding stress by changing gene expression patterns and inducing the production of hormones including jasmonic acid. Bethany led a study modeling the temporal and hormonal regulation of plant transcriptional response to wounding in Arabidopsis. This work is now published in The Plant Cell.
Huan is officially a member of the Shiu lab! She is a third year graduate student in the Genetics and Genome Sciences and Molecular Plant Sciences Programs. We’re glad to have you Huan!
We are very excited to welcome Elijah to the team! Elijah is a third year undergraduate student majoring in Plant Biology.
Brianna rotated in the lab during the summer and surprised us all one day when she walked in and exclaimed “I’m joining the lab!”. She graduated from Stanford University with a Bachelor’s degree in Earth Systems and Human Biology. Brianna is also an NRT-IMPACTS fellow and is a first year graduate student in the Plant Biology program. Welcome Brianna!
In collaboration with two MSU faculty Jiliang Tang and Yuying Xie, Melissa and Shin-Han got an NSF Plant Genome Research Program grant titled Connecting sequences to functions within and between species through computational modeling and experimental studies. To have more complete knowledge of how plants work, we will connect DNA sequences with traits they control using an Artificial Intelligence-based approach, machine learning where computers are used to uncover hidden patterns from a wide range of biological data. In addition, we will apply transfer learning to translate knowledge from one plant species to another so we can later transfer what we know about model plants to other species. The outcome of the project will be computer programs that can predict the connections between DNA sequence and traits and transfer information across species. Using these programs, scientists can better understand how plants work and this knowledge can ultimately be used to create more productive and resilient plants.
Genetic redundancy refers to a situation where an individual with a loss-of-function mutation in one gene (single mutant) does not show an apparent phenotype until one or more paralogs are also knocked out (double/higher-order mutant). Siobhan led a study aiming at understanding the factors contributing to genetic redundancy with machine learning approaches. In addition to Siobhan, Peipei, Serena, Beth, Fanrui, and Melissa have contributed significantly to the study. This work is now published in Molecular Biology & Evolution.
Computational modeling approaches are underutilized in plant biology. Lead by Renee Dale from Danforth Plant Science Center, Shinhan and seven other colleagues published an opinion piece in Frontiers in Plant Science identifying challenges one may face in adopting modeling approach.
For the past school year 2020-2021 the lab has been working remotely. Today was the first time all members were able to experience the lab environment together in over a year! This was Kenia and Ally’s first oppertunity to settle into the lab space due to their first year being entirely remote.
Congratulations to Kenia for receiving an NRT-IMPACTS fellowship, a training program for doctoral students to apply computational approaches to address challenges in plant biology. Kenia also received the PBHS fellowship, which is an NIH funded T32 training program for predoctoral students to pursue research related to plant biotechnology and chemical engineering.
Genes in the same metabolic pathway are assumed to be co-expressed. This study, led by Peipei, aiming to use multiple strategies (including unsupervised and supervised machine learning) to evaluated the utility of gene expression data in the prediction of plant metabolic pathway membership. In addition to Peipei, Bethany, Sahra, Melissa, and Cornelius have contributed significantly to the study. This work is now published in New Phytologist.
Machine learning (ML) is an important tool for making and interpreting predictions in biology. Serena gave a talk introducing the basic concepts of ML and how it can be used to answer biological questions at GLBRC’s Annual Science Meeting. A recording of the presentation can be found here
Serena gave a one-hour seminar on her proposed thesis research as part of the second half of her comprehensive exam. Her talk was about automated information extraction and knowledge graphs, and how we can apply them to generate novel hypotheses in the plant sciences. Afterwards she met with her committee for the proposal defense, and passed!
The broad host range of Fusarium virguliforme represents a unique comparative system to identify and define differentially induced responses between an asymptomatic monocot host, maize (Zea mays), and a symptomatic eudicot host, soybean (Glycine max). Led by Amy Baetsen-Young from Brad Day’s lab, Shinhan collaborated on a project examining how asymptomatic and symptomatic hosts respond to F. virguliforme transcriptionally. This work is now published in Plant Cell
Tandemly duplicated genomic regions can be misassembled together due to high sequence similarity in short read-based plant assembly, resulting higher read coverages than other regions. Peipei, Fanrui, and Bethany used comparative genomics and machine learning approaches to investigate the factors contributing to the uneven distribution of read depth and the potential misassembly. This work is now published in BMC Genomics.
Abigail graduated from Lyman Briggs College with degrees in Statistics and Plant Biology in a virtual commencement ceremony. She plans to continue working with the lab in the spring ‘21.
Kenia is a first year graduate student in the Genetics and Genome Sciences Program. She rotated over the summer and has made the decision to join our lab today! She graduated from University of Texas-Austin with families in the San Antonio area. Welcome to our lab Kenia!
The paper that Serena worked on during the IMPACTS course Foundations in Computational Plant Science, Composite modeling of leaf shape along shoots discriminates Vitis species better than individual leaves, has been published! A collaborative effort amont more than 30 authors, the paper is in the December issue of the journal Applications in Plant Sciences
Today during the Foundations of Computational Plant Sciences course, Serena and the team of IMPACTS trainees from the Spring 2020 Forum in Computational Plant Sciences presented their lesson plan to students. This lesson plan will soon be submitted for publication in the journal CourseSource.
Arabidopsis AINTEGUMENTA (ANT), an AP2 transcription factor, is known to control plant growth and floral organogenesis. This study is led by Dr. Wen-Hsiung Li from Academia Sinica, Taiwan, which reveals biological roles of ANT1 in several developmental processes beyond its known roles in plant growth and floral organogenesis. It is published in the Sep. 1st issue of Proc. Natl. Acad. Sci., USA.
Plants respond to their environment by dynamically modulating gene expression. Our findings demonstrate how in silico approaches can improve our understanding of the complex codes regulating response to combined stress and help us identify prime targets for future characterization. It is published in Nuc. Acid. Res. Genomics and Bioinformatics.
Plant specialized metabolites mediate interactions between plants and the environment and have significant agronomical/pharmaceutical value. This study demonstrates that specialized and general metabolic genes can be better predicted by leveraging cross-species information. Additionally, our findings provide an example for transfer learning in genomics where knowledge can be transferred from an information-rich species to an information-poor one. The findings is now available form In Silico Plant.
The hairs on the surface of some species in the nightshade family (Solanaceae) produce acylsugars which deter herbivores and pests from damaging the plants. A tomato gene cluster involved in medium chain acylsugar accumulation was described, and the evolution of this gene cluster was reconstructed. This study is led by Dr. Pengxiang Fan from Prof. Robert L Last’s lab, and Peipei has contributed significantly to the evolutionary history reconstruction. It is published in Elife.
Ally has known she was joing the lab and has been occationally joing the lab for lunch meetings up to this point. Now it is the first day as an offical graduate student here at MSU for Ally. Welcoming her to the lab with virtual open arms!
Serena is the winner of the SciComm Blog Contest! Here is her winning post KNOWLEDGE GRAPHS: A New Way To Reason. Congrats Serena!
Fanrui, our deep learning expert, Peipei’s beloved husband, and a great cook, is finally back after getting stuck in China for >6 months due to the COVID19, repeated flight canceldation, and long stay in the secondary screen through the immigration. Welcome back!
In our lab lunch today, Ronan is biting her mom’s face!
In collaboration with Jiliang Tang from Comp. Sci. & Engr, MSU, Christina’s review on Opening the Black Box: Interpretable Machine Learning for Geneticists is now published in Trends in Genetics. This work highlights current progress in making machine learning model interpretable and explanable using applications in genetics as examples. And her graphics was picked as the cover as shown on the right!
With the social distancing in place, we continue to meet over Zoom. It seems that folks are doing ok!
Dr. Bao’s work on COST1 regulates autophagy to control plant drought tolerance is published where Peipei has contributed significantly using comparative genomics approaches.
Siobahn defended her thesis titled Modeling and prediction of genetic redundancy in Arabidopsis thaliana and Saccharomyces cerevisiae today virtually via Zoom with 89 people in the audience! Both us showed up in the lab (while keeping >6 ft distance) to get the virtual defense setup. Originally due to move to England next week, Siobhan will stay in East Lansing to finish her work. Good job Siobhan! The picture on the right shows Siobhan right after the defense.
With all of us working remotely, we decide to host a 15-min lab lunch to check in with everyone. The photo on the right is all of us experimenting with the background feature in Zoom - the two “ghosts” are Siobhan and Serena!
Michigan State has indicated all classes will be online only today. Given the nature of most of our work is computational and considering the risks of COVID-19, from today on all lab personnel will be working remotely, except for those who need to keep our plant alive.
Ally has decided to join the Plant Biology Program and, instead of doing any rotation, is coming to our lab! She is currently finishing up her Bachelor’s degree in Manchester University, a liberal art institution. She is awarded an MSU College of Natural Science Fellowship as well as an Early-Start Fellowship to begin her doctoral training on July 1st! Welcome to our lab Ally!
Serena is a new graduate student in the, hold your breath, Plant Biology Program, the Computational Math., Sci., Math Program, AND the Moleculr Plant Science Program. She is a MSU Plant Science Fellow and an NSF Fellow of the IMPACTS training program. She graduated from the Cornell University in 2019 and has been rotating in two other labs. Here is the funny Slack exchange when I found out she is joining us!
Can transcriptome data be used to predict traits? In the study titled Transcriptome-Based Prediction of Complex Traits in Maize, Christina, in collaboration with Jeremy, a rotating student at the time, and Bob VanBuren tested this and found out that not only we can predict traits with transcriptomes, but they may tell us more than genomic data can. The finding is featured in MSU Today!
Beth defended her PhD thesis today titled Specilized metabolism and stress response: studies in predicting gene function and regulation. The seminar was held in Plant Biology Lab 247 and was attended by both her colleagues, friends, and families. Beth is moving to University of Wisconsin-Madison to take a
toral position applying molecular evolutionary and computaional approaches to plant biochemistry. We wish her all the best! The picture on the right shows Beth right before her defense.
Siobhan gave birth today to her baby boy Ronan! Congrats Siobhan and Geoff!
Christina today defended her PhD thesis titled Interpretable machine laerning in plant genomes: studies in the complex relationship between genotype and phenotype. The seminar was held in Plant Biology Lab 247 and was full! Christina’s parents also attended her seminar. In the talk, Christina laid out the overarching Christina is now moving to Australia working in algorithm development in single cell genomics studies. All the best Christina! The picture on the right showing Chrisitna leaving the lab for good on 10/3/2020…
Our story titled Cis-regulatory code for predicting plant cell-type transcriptional response to high salinity is now published in Plant Physiology with Sahra and Christina as joint first authors. Using the root cell-type transcriptome data, Sahra and Christina identified cis-regulatory sequences likely specify cell-type response to high salinity stress. More importantly, machine learning models were built to ask how well the identified regulatory sequences can predict cell-type response. The findings not only advance our understanding of the regulatory mechanisms of the plant spatial transcriptional response, but also suggest broad applicability of the approach to any species, particularly those with little or no trans regulatory data.
Chrisitina’s paper on Benchmarking Parametric and Machine Learning Models for Genomic Prediction of Complex Traits is published today in G3. This is a collaborative between our colleague Gustavo de los Compos, Chrisitna’s internship mentors Andrew McCarren and Mark Roantree, and Emily Bolger a highly productive summer research student. Using data of 18 traits across six plant species with different marker densities and training population sizes, Christina spearheaded a comparison of the performance of six linear and six non-linear algorithms. The findings highlight that simpler, linear algorithm can beat out more sophiscated machine learning approaches, and the importance of algorithm selection for the prediction of trait values.
John’s paper on Evolutionary characteristics of intergenic transcribed regions indicate rare novel genes and widespread noisy transcription in the Poaceae is published in Scientific Report. Christina as well as Rosalie Sowers, a highly talented undergrad from Penn State also contribute to this work. Extensive transcriptional activity occurring in intergenic regions of genomes has raised the question whether intergenic transcription represents the activity of novel genes or noisy expression. To assess this, John et al. evaluated cross-species and post-duplication sequence and expression conservation of intergenic transcribed regions. This study provides a framework to identify novel genes using comparative transcriptomic data to improve genome annotation that is fundamental for connecting genotype to phenotype in crop and model systems.
Thilanka is a new graduate student in the Plant Biology Graduate Program. He graduated from the University of Peradeniya in Sri Lanka in 2016 and worked as a teaching assistant in the past two years in Peradeniya. He is energetic, communicative, and excited about learning new topics. He is also easy to get alone with, and has a black belt in Taekwondo!! Welcome to the lab!
Siobhan has been selected to receive a $7,500 College of Natural Science Dissertation Completion Fellowship during fall semester 2019. Congrats Siobhan!
Nick’s paper on Expression and regulatory asymmetry of retained Arabidopsis thaliana transcription factor genes derived from whole genome duplication is published in BMC Evoltionary Biology. Christina and Eamon Winship, an undergraduate reseracher at the time also contributed. Transcription factors (TFs) play a key role in gene regulation and tend to be retained after duplication. Nick’s work provide answers about about how TF duplicates have diverged in their expression and regulation that contribute to a better understanding of the elevated retention rate among TFs.
Led by Dr. Wen-Hsiung Li’s group in Taiwan, Shinhan has contributed to the study: Comparative transcriptomics method to infer gene coexpression networks and its applications to maize and rice leaf transcriptomes published in PNAS. This study involves a comprehensive time series data and a novel approach to build time-resolved co-expression network to hypothesize regulatory relationships between genes.
Beth’s work on Robust predictions of specialized metabolism genes through machine learning is published in PNAS with Peipei, Johnny, Melissa, and multiple colleagues in Rob Last’s group as co-authors. This work features a machine learning approach to predict general and specialized metabolic gene that are notoriously hard to distinquish. The work is featured in MSU Today and in NSF’s Awesome Discoveries You Probably Didn’t Hear About This Week.