Unchartered territory in the human genome

18-Jul-2022 - Germany

An international consortium brings together 7,200 segments of the human genome that are virtually unexplored and presents a roadmap for integrating them into genome databases in “Nature Biotechnology”. They could hold information about what sets humans apart from other animals.

When researchers working on the Human Genome Project completely mapped the genetic blueprint of humans in 2001, they were surprised to find only around 20,000 genes that produce proteins. Could it be that humans have only about twice as many genes as a common fly? Scientists had expected considerably more.

Now, researchers from 20 institutions worldwide bring together more than 7,200 unrecognized gene segments that potentially code for new proteins. For the first time, the study makes use of a new technology to find possible proteins in humans – looking in detail at the protein-producing machinery in cells. The new study suggests the gene discovery efforts of the Human Genome Project were just the beginning, and the research consortium aims to encourage the scientific community to integrate the data into the major human genome databases.

Coronavirus: identification of new proteins that regulate infection

Read news

The study recently published story in “Nature Biotechnology”, was co-led by Dr. Jorge Ruiz Orera from Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC) in Germany, Dr. Sebastiaan van Heesch from the Princess Máxima Center for pediatric oncology in the Netherlands, Dr. Jonathan Mudge from the European Molecular Biology Laboratory – European Bioinformatics Institute (EMBL-EBI) in the United Kingdom, and Dr. John Prensner from the Broad Institute of MIT and Harvard in the United States.

New gene sequences remained out of reach

In the past few years, thousands of frequently very small open reading frames (ORFs) have been discovered in the human genome. These are spans of DNA sequence that may contain instructions for building proteins. Several authors of the current study have previously found ORFs and described them in scientific journals: Van Heesch, together with MDC-Professors Norbert Hübner and Uwe Ohler described new mini-proteins in the human heart and reported on them in “Cell” in 2019; Prensner also published on ORFs in “Nature Biotechnology” in 2021. Yet none of these previously virtually unexplored segments were included afterwards in reference databases. Other sequences were reported in journals such as “Science” or “Nature Chemical Biology”, but remained largely out of reach for most members of the scientific community – despite evidence that they produce RNA molecules that subsequently bind to ribosomes, the cell’s protein factories.

Traditionally, protein-coding regions in genes have been identified by comparing DNA sequences from multiple species: the most important coding regions have been preserved during animal evolution. But this method has a drawback: coding regions that are relatively young, i.e., that arose during the evolution of primates, fall through the cracks and are therefore missing from the databases.

So now the task is to integrate the largely ignored ORFs into the largest reference databases, because researchers have so far had to specifically search for them in the literature if they wanted to study them.

As a first step, the international research team collected information on sequences that had been discovered using ribosome profiling – a technique that determines which part of the messenger RNA (mRNA) the ribosome interacts with. They then assembled the data into a standardized catalogue. This was no small feat, as data obtained in a wide variety of ways from different laboratories cannot simply be combined.

Once this was accomplished, the international consortium labored over central questions that define our very notion of the human genome: What is a gene? What is a protein? Do we need flexible notions of whether ribosomes always produce a protein or rather some other cellular output?

The group now calls for the human genome databases used by scientists worldwide to be revised. Ensembl-GENCODE are configuring this ORF catalog as a component of their reference annotation database. The approach will be supported by many others like UniProt, HGNC, PeptideAtlas and HUPO.

ORFs likely play a role in common diseases

Dr. Sebastiaan van Heesch, group leader at the Princess Máxima Center for pediatric oncology, says: “Our research marks a huge step forward in understanding the genetic make-up and complete number of proteins in humans. It’s tremendously exciting to enable the research community with our new catalog. It’s too soon to say whether all of the unexplored sections of DNA truly represent proteins, but we can clearly see that something unexplored is happening across the human genome and that the world should be paying attention.”

“For too long, the scientific community has been mostly left in the dark about these ORFs,” says Jonathan Mudge of the EMBL-EBI. “We’re very proud that our work will be able to let researchers across the world start to study them. This is the point at which they enter the mainstream of genomic and medical science – an effort which we expect to have wide-ranging ripple effects.”

“It is especially remarkable that most of these 7,200 ORFs are exclusive to primates and might represent evolutionary innovations unique to our species,” reports Jorge Ruiz-Orera, an evolutionary biologist working in Hübner’s lab at the MDC. “This shows how these elements can provide important hints of what makes us humans.”

So, what’s next? John Prensner, Broad Institute of MIT and Harvard, says: “These ORFs almost certainly will be contributing factors to many human traits and diseases, both rare diseases and common ones such as cancer. The challenge is now to figure out which ones have which roles in which diseases.”

Original publication

Jorge Ruiz-Orera et al (2022): “A community-driven roadmap to advance research on translated open reading frames”. Nature Biotechnology

https://www.analytica-world.com/en/news/1176902/unchartered-territory-in-the-human-genome.html

Original publication

Jorge Ruiz-Orera et al (2022): “A community-driven roadmap to advance research on translated open reading frames”. Nature Biotechnology

Topics

databases genes proteins ribosomes human genome

Show all

Organizations

MDC

EMBL-EBI

Last viewed contents

3M Completes Acquisition of CUNO Incorporated

Go to page

More from the department science Subscribe to newsletter

Unchartered territory in the human genome

Coronavirus: identification of new proteins that regulate infection

New gene sequences remained out of reach

ORFs likely play a role in common diseases

Original publication

Enigmatic gene critical for a healthy brain

Other news from the department science

Researchers watch chemistry unfold atom by atom

Bacteria convert uranium into a stable chemical compound

Marburg researchers decode one of nature's largest enzymes

Laser light controls molecular structures

Magnetic imaging: Micro-flowers increase the local magnetic field

Microscopy at the Space-Time Limit

Dog Noses and AI Provide New Clues About Long COVID

TU Graz Unravels Mystery of the Structure of MOF Thin Films

Virtual tissue staining in 3-D

Hospital "Fingerprint" Helps AI Identify Misdiagnoses in Cancer Tissue

2026 Future Insight Prize Goes to Spear’s Vasilis Ntziachristos

Social inequality is linked to faster biological aging

Artificial intelligence evaluates chemical spectra in minutes

Carbon nanotubes make the electronic nose suitable for everyday use for the first time

New polymorph of indomethacin discovered – a rare event in pharmaceutical research

Lab-on-a-Chip platform shows how immune cells attack cancer cells

Designable van der Waals crystal realizes artificial neuronal cell mimicking with light

Chemists achieve breakthrough: Editing molecules instead of rebuilding them

Extending cryo-electron microscopy beyond water

New method enables accurate sequencing of short peptides hidden in food and human body

Most read news

Merck expands life science portfolio with $11.3 billion Bio-Techne deal

Artificial intelligence evaluates chemical spectra in minutes

Carbon nanotubes make the electronic nose suitable for everyday use for the first time

Social inequality is linked to faster biological aging

Tentamus acquires BioMeca and strengthens its life sciences portfolio

PFAS detection in minutes rather than weeks: deep-tech start-up Grapheal secures €2.5 million in EU funding

Quantum sensors are set to detect food fraud directly in the supermarket

Hospital "Fingerprint" Helps AI Identify Misdiagnoses in Cancer Tissue

Lab-on-a-Chip platform shows how immune cells attack cancer cells

2026 Future Insight Prize Goes to Spear’s Vasilis Ntziachristos

AI Diagnoses Brain Tumors in Minutes Instead of Weeks

Chemists achieve breakthrough: Editing molecules instead of rebuilding them

More news from our other portals

From cleaner “cracking” to black gold

Mini-Brains from Patient Cells Point to Vitamin B3 as Treatment for Rare Childhood Disease

According to the report, one in five cups of coffee contains toxic pesticide residues

Siemens and Ucaneo partner to scale direct air capture

New drug could slow the development of Alzheimer’s

Nestlé to acquire smart food pioneer yfood to accelerate the brand’s international expansion

Scenarios for a New “Iron Age”: Iron Complements Hydrogen as an Energy Source

It may not just be what’s in ultra-processed foods, but how they’re made

300 beverage companies call for: No sugar tax on beverages

Plastic bottles could find new life in batteries as graphite

Inducing cell death in pancreatic cancer cells

Plant-based supermarket products contain twice as many additives as animal-based equivalents

India builds first large-scale hydrometallurgical plant for battery circular economy

Breakthrough in tailor-made enzyme design

Nordzucker is revising its beet pricing model and investing €160 million in its factories

A Step Forward for Solar-Driven Ammonia Production

The Kitchen Sponge: A Microcosm—Bacterial Contamination Isn't Always Visible, Smellable, or Palpable

Researchers find fructose sends a weaker “I’m full” signal to the brain than glucose

Efficient Production of Solar Hydrogen Through Direct Coupling of Concentrating Solar Cells and Electrolyzer

Strüngmann Award 2026: Who is transforming the life sciences in the DACH region?

Solar Foods receives EUR 77,8 million for the construction and commissioning of Factory 02

New membrane technology could transform hydrocarbon processing by slashing energy use

Do sugar substitutes disrupt gut health and metabolism?

Multinational Outbreak of Salmonella Stanley Infections Linked to Flavored Pasta Products

A new route to safe and sustainable synthetic chemistry

New antibiotics discovered to treat multi-resistant germs

Water, Clay and Carbon: A New Route to Sustainable Energy Storage

Which universities produce the most successful start-up founders in the DACH region?

Last viewed contents

3M Completes Acquisition of CUNO Incorporated