New software processes huge amounts of single-cell data

Comprehensive analysis of large gene-expression datasets

13-Feb-2018 - Germany

Scientists from the Helmholtz Zentrum München have developed a program that is able to help manage enormous datasets. The software, named Scanpy, is a candidate for analyzing the Human Cell Atlas.

“It’s about analyzing gene-expression data of a large number of individual cells,” explains lead author Alex Wolf of the Institute of Computational Biology (ICB) at Helmholtz Zentrum München. He developed Scanpy together with his colleague Philipp Angerer in the Machine Learning Group of Prof. Dr. Dr. Fabian Theis. In addition to his position at Helmholtz Zentrum, Theis is also a professor of mathematical modelling of biological systems at the Technical University of Munich. “New technical advances generate several orders of magnitude more data with a correspondingly greater information content,” Theis says. “However, the historically evolved software infrastructure for gene-expression analysis simply wasn’t designed to cope with the new challenges. New analytic methods are therefore needed.”

The race for the Human Cell Atlas

According to Theis, a major international research project could also benefit from the software. A team of international scientists is compiling a reference database, called the Human Cell Atlas, which holds data on the gene activity of all human cell types. “For this project, and in a growing number of other projects in which databases are combined, it is important to have scalable software,” says Theis. It is therefore no surprise that Scanpy is currently a candidate for helping to analyze the Human Cell Atlas.

“The publication of Scanpy marks the first software that allows comprehensive analysis of large gene-expression datasets with a broad range of machine-learning and statistical methods,” explains Wolf, describing the achievement. “The software is already being used by a number of groups around the world, notably at the Broad Institute of Harvard University and the Massachusetts Institute of Technology, MIT.”

Technologically, the application is a trailblazing development: Whereas biostatistics programs are traditionally written in the programming language R, Scanpy is based on the Python language, the dominant language in the machine learning community. Another new feature is that graph-based algorithms lie at the heart of Scanpy. Unlike the usual approach of regarding cells as points in a coordinate system within gene-expression space, the algorithms use a graph-like coordinate system. Instead of characterizing a single cell by the expression value for thousands of genes, the system simply characterizes cells by identifying their closest neighbors – very much like the connections in social networks. In fact, to identify cell types, Scanpy uses the same algorithms as Facebook does for identifying communities.

Original publication

Wolf, A. et al.; "Scanpy: large-scale single-cell gene expression data analysis"; Genome Biology; 2018

https://www.analytica-world.com/en/news/1153440/new-software-processes-huge-amounts-of-single-cell-data.html

Original publication

Wolf, A. et al.; "Scanpy: large-scale single-cell gene expression data analysis"; Genome Biology; 2018

Topics

software gene expression biostatistics gene expression analysis data analysis data analysis software

Show all

Organizations

Helmholtz Zentrum München

Recognise, understand, heal: The World of Diagnostics

Diagnostics News

More from the department science Subscribe to newsletter

Get the analytics and lab tech industry in your inbox

New software processes huge amounts of single-cell data

Comprehensive analysis of large gene-expression datasets

The race for the Human Cell Atlas

Original publication

Analysis of Microscopic Images: New Open-Source Software Makes AI Models Lighter & Greener

Other news from the department science

Replacement for animal testing - now completely without animal suffering

Research under high pressure

Molecular dynamics in real time

Blood diagnostics modelled on leeches

Toxic chemicals can be detected with new AI method

Innovative bioreactor research processes and cryotechnologies improve active ingredient tests using human cell cultures

Tracking the dynamics of biomolecules with optofluidic antennas

Testing how well biomarkers work

The longer spilled oil lingers in freshwater, the more persistent compounds it produces

Barcodes expand range of high-resolution sensor

Diamond dust shines bright in Magnetic Resonance Imaging

Antimicrobial agents of the future

Scientists accelerate spectroscopic analysis

The Big Quantum Chill: NIST Scientists Modify Common Lab Refrigerator to Cool Faster With Less Energy

Electrified bacteria: Method developed for faster determination of antibiotic resistance

The enemy within: How pathogens spread unrecognized in the body

Real-time detection of infectious disease viruses by searching for molecular fingerprinting

Researchers Discover Novel Cell Type that Controls the Formation and Growth of New Blood Vessels

New Opportunity for Cancer Therapy: Miniature Lab Provides Insights into Metastases Development

Particle surface properties and their understanding - a trend also among young scientists

Get the analytics and lab tech industry in your inbox

Most read news

Bruker to acquire the NanoString business in an asset deal

Novel UV Broadband Spectrometer Revolutionises Air Pollutant Analysis

Artificial intelligence boosts super-resolution microscopy

analytica 2024 confirms its position as the world’s leading trade fair for the laboratory sector

Discovery of the first fractal molecule in nature

Scientists accelerate spectroscopic analysis

Electrified bacteria: Method developed for faster determination of antibiotic resistance

Flexible microspectrometer for mobile applications

New insights into the genetics of the common octopus: genome at the chromosome level decoded

Advanced imaging techniques on a semiconductor material reveal ‘surprising’ hidden activity

Non-destructive on-site analysis of organic substances using a small modular spectrometer platform

Researchers 3D print key components for a point-of-care mass spectrometer

More news from our other portals

Nadja Håkansson appointed Chief Executive Officer of thyssenkrupp Uhde

The mother's protein intake affects the newborn's face

Children, pregnant women and breastfeeding mothers should avoid Brazil nuts

Researchers create new chemical compound to solve 120-year-old problem

Unfavourable carbohydrates early in the morning - a potential problem for "owls"

Map of feelings of happiness

Research team develops sodium battery capable of rapid charging in just a few seconds

Innovative treatment helmet from a Basel spin-off promises advances in the treatment of Alzheimer’s

Take it from the rats: A junk food diet can cause long-term damage to adolescent brains

Tests confirm quality of purified graphite from used lithium-ion batteries

Microbe of the Year 2023: Bacillus subtilis – for health and technology

Krombacher Brewery launches three unusual beer mix drinks with party liqueur Berliner Luft

“Restore trust and get Bayer on track for better performance”

Breakthrough in research on brown fat

Asahi Brands relies on Warsteiner

Novel hydrogel removes microplastics from water

Memory self-test via smartphone can identify early signs of Alzheimer’s disease

Did you know that parents should be careful with rice cakes?

Longer-lasting and more sustainable green hydrogen production

How a “date” between immune cells makes rheumatism disappear

Newly sequenced genome reveals coffee’s prehistoric origin story — and its future under climate change

BASF has started prototype metal refinery for battery recycling

Common household chemicals pose new threat to brain health

Opening a window to the food industry’s future: the world’s first factory growing food out of thin air launches

Chemists pioneer work to reduce carbon emissions

AI designs new drugs based on protein structures

Size of salty snack influences eating behavior that determines amount consumed

BASF, SABIC, and Linde celebrate the start-up of the world's first large-scale electrically heated steam cracking furnace

Game changer in fighting cancer: BioCopy expands corporate portfolio with acquisition of start-up Perspix Biotech

E-tongue can detect white wine spoilage before humans can

Recognise, understand, heal: The World of Diagnostics