NevadaToday

New tool allows computer scientists to support life scientists

Department of Computer Science and Engineering Assistant Professor Tin Nguyen and his lab have developed software to help life scientists efficiently analyze single-cell data using machine learning.

Research & Innovation | February 25, 2021

Four people in building with windows behind them

Nguyen (second from left) and his graduate students (left to right) Bang Tran, Hung Nguyen and Duc Tran in the new WIlliam N. Pennington Engineering Building. (Photo by Isaac Hoops)

New tool allows computer scientists to support life scientists

Department of Computer Science and Engineering Assistant Professor Tin Nguyen and his lab have developed software to help life scientists efficiently analyze single-cell data using machine learning.

Research & Innovation | February 25, 2021

Nguyen (second from left) and his graduate students (left to right) Bang Tran, Hung Nguyen and Duc Tran in the new WIlliam N. Pennington Engineering Building. (Photo by Isaac Hoops)

Tin Nguyen and his Ph.D. students Duc Tran, Hung Nguyen and Bang Tran have used the data processing power of machine learning to develop a novel tool to support the research of life scientists. Named scDHA (single-cell Decomposition using Hierarchical Autoencoder), the tool uses machine learning to address a key problem life scientists run into during their research: too much data to process. With the results of their efforts to solve this problem recently published in Nature Communications, “Fast and precise single-cell data analysis using a hierarchical autoencoder,” Nguyen’s team is now looking to serve fellow researchers by using the tool to support their analysis of large quantities of cell data.

Nguyen was kind enough to participate in a Q&A about the tool to illustrate how it works and describe some of its capabilities.

What problem does the tool help life scientists (biologists and medical doctors) overcome?

Biotechnologies has advanced to a degree where scientists can measure the gene expression of individual cells in our body. This technology is called single-cell sequencing. One experiment can generate the expression of millions of cells and tens of thousands of genes (can be represented as a matrix with millions of rows/cells and tens of thousands of columns/genes). It is hard to analyze such data. Adding to the challenge is the dropout events, in which many genes and cells cannot be measured due to the low amount of biological material available for the cells. It is extremely difficult to mine such data to gain biological knowledge.

How does the tool—scDHA (single-cell Decomposition using Hierarchical Autoencoder)—work?

To mine the data from the noisy and large data, we used multiple state-of-the-art techniques in machine learning. First, we developed a novel non-negative kernel autoencoder (neural networks) to eliminate genes that do not play important roles in differentiating the cells. Second, we developed a stacked Bayessian autoencoder (neural networks) to transform the data of tens of thousands of dimensions to a space with only 15 dimensions. Finally, we developed four different techniques to: (1) visualize the transcriptome landscape of millions of cells, (2) group them into different cell types, (3) infer the developmental stages of each cell, and (4) build a classifier to accurately classify the cell of new data.

The tool has four different applications. Can you explain what it can do?

Four people in computer lab in William N. Pennington Engineering Building — Professor Nguyen with his students in the William N. Pennington Engineering computer lab where the tool was developed. Pictured: Hung Nguyen, Bang Tran and Duc Tran with Professor Nguyen (right).

Visualization: The first step in the analysis pipeline for most life scientists. The data is transformed from high dimensional space into a 2D landscape, which is often called Transcriptome Landscape, so life scientists can observe the landscape of the cells, and the relative distance between them.

Cell segregation: The goal to separate the cells into groups that are likely to have the same bodily functions with similar biological features. This is particularly important in constructing cell atlas for tissues in different organisms.

Time-trajectory inference: The goal of this is to infer the developmental trajectory of the cells in the experiment. We will arrange cells in an order that presents the development process of the cells over time (time-trajectory). Biologists can use this trajectory to investigate the mechanism of how cells can develop to different cell types.

Cell classification: This is particularly important to reuse the data we already collected to study the new data. Given well-studied datasets with validated cell types and well-understood mechanisms, we build a classifier that can accurately classify the cells of new datasets.

What doors to new knowledge (or techniques or technologies) do you foresee the tool opening?

Visualization: Important in exploring the data, especially when analyzing tissues that have not been studied before.

Cell segregation: Important in studying new tissues. This allows scientists to group cells according to their functions.

Time-trajectory inference: This is important to understand how cells divide and develop over time.

Cell classification: Allows us to classify cells of new datasets.

Currently we only understand something about human tissues at the cell resolution. Single-cell technologies open up a whole new world for us to understand the composition of tissues, how they develop and interact. Without tools such as ours, it is impossible for life scientists to mine information from such large and high-dimensional data. The same can be said for thousands of model and non-model species.

What is next for the tool?

Now, we wish to connect to life scientists that want their single-cell data analyzed at UNR.

Single-cell technology is a relatively new and expensive. We developed and validated this tool using data that were made available by the Broad Institute, Wellcome Sanger Institute, and NIH Gene Expression Omnibus. Now, we wish to connect to life scientists that want their single-cell data analyzed at UNR.

Can the tool be applied in other arenas?

Even though we developed this platform for single-cell data analysis, we wish to apply this technology to other research areas too, including subtyping cancer patients (clustering), patient classification, functional analysis (pathway analysis), etc. Our research lab has published high-level journal articles in the above-mentioned research areas.

Calling all collaborators

The Department of Computer Science and Engineering has the expertise to support research across campus. Those hoping to work with Nguyen and his team are encouraged to email Professor Nguyen to discuss the details of their project.

Research & Innovation | February 25, 2021

Research & Innovation

Microplastic Mayhem: How three researchers are analyzing particles in Lake Tahoe

The University of Nevada, Reno research team uses latest tech for long-term project

A research boat floats on Lake Tahoe with blue water and skies.

7/22/2025

Research & Innovation

Chemistry professor named Fulbright Scholar plans to visit Brazil

Sergey Varganov is a theoretical and computational chemist

7/17/2025

Research & Innovation

Doctoral student explains metallic glass so it’s crystal clear

3-Minute Thesis winner Jerry Howard talks about research in Krista Carlson’s lab

Two people stand next to lab equipment used to research metallic glass.

7/17/2025

Research & Innovation

A library for the future of medicine

UNR Med’s Savitt Medical Library introduces new technology and learning spaces to support innovation and student success

A modern, open tech suite inside the Savitt Medical Library featuring 3D printers, VR headsets, workstations, and study pods designed for hands-on, interdisciplinary learning.

7/21/2025

Editor's Picks

Lauren holds a sign that reads "My white coat represents ... a pledge to my patients and community!"

Mother. Veteran. Future PA. Lauren Bell’s journey to medicine

Teachers standing around a planted 'ulu tree.

From Fallon to Hawai‘i: Online MPH alumnus Caden Salois plants seeds of sustainability

Ai Ana on a snowy hilltop in the mountains holding out her hand, smiling as a mountain chickadee bird has landed in it.

Ph.D. student Ai Ana Richmond named honorable mention from NSF Graduate Research Fellowship Program

Two Path to Independence summer camp participants, Lila Barber and Julia Layosa, smile outside the College of Education and Human Development.

Path to Independence hosts third annual summer camp at the University of Nevada, Reno

Latest From

Nevada Today

Education & Public Service

Refugee-focused nonprofit grows farm and opportunity with help from Extension

Regenerative farming classes help Lighthouse Charities blossom

A smiling man stands beside six wheelbarrows filled with orange fruits and vegetables.

7/21/2025

Health & Medicine

Medical training starts early

High school students get hands-on experience at Lake Tahoe

A group of students from the PA program and Upward Bound photographed together at the Lake Tahoe campus.

7/21/2025

Media & Society

Remembering David A. Schooley: Pioneering scientist, mentor and professor emeritus

David was a leader in the field of insect endocrinology and a guiding force in student success

A man in glasses sitting by a wired ab equipment.

7/18/2025

Impact & Student Success

Mother. Veteran. Future PA. Lauren Bell’s journey to medicine

From the military to medicine: How veteran Lauren Bell found new purpose in healing others

7/17/2025

Health & Medicine

From Fallon to Hawai‘i: Online MPH alumnus Caden Salois plants seeds of sustainability

Caden Salois ‘21, ‘25 MPH, MASUST, believes in the power of breadfruit so deeply he has a tattoo of one

7/16/2025

Research & Innovation

Anita Montero recognized with Honorable Mention by the NSF Graduate Research Fellowship Program

Ph.D. student’s research on animal behavior and hybrid zones receives national recognition

Anita smiles and walks in the sunshine through some bushes with a backpack that has as metal cage in it.

7/14/2025

Research & Innovation

Professor Emeritus Richard Tracy named an Ecological Society of America Fellow

ESA is the major professional society for ecologists in the US

A man, standing, and a woman, sitting, hold a toad and look over their shoulders to smile at the camera.

7/14/2025

Science & Technology

Global experts gather at Lake Tahoe to protect migratory fish and freshwater corridors

Workshops hosted at the University of Nevada, Reno at Lake Tahoe will help inform discussions and actions for future international conferences

Dr. Zeb Hogan and his team standing in the middle of a river holding a fish.

7/14/2025

News

Read, watch &
listen

About

New tool allows computer scientists to support life scientists

Department of Computer Science and Engineering Assistant Professor Tin Nguyen and his lab have developed software to help life scientists efficiently analyze single-cell data using machine learning.

New tool allows computer scientists to support life scientists

Department of Computer Science and Engineering Assistant Professor Tin Nguyen and his lab have developed software to help life scientists efficiently analyze single-cell data using machine learning.

What problem does the tool help life scientists (biologists and medical doctors) overcome?

How does the tool—scDHA (single-cell Decomposition using Hierarchical Autoencoder)—work?

The tool has four different applications. Can you explain what it can do?