Protein Identifier Cleaning and Annotation

Published

Jun 2026

Protein identifiers must be cleaned and mapped before enrichment or network analysis.

Common Identifiers

Proteomics tables may contain:

UniProt accession
UniProt protein name
gene symbol
Ensembl gene ID
RefSeq accession
protein description

Why Identifier Cleaning Matters

Incorrect or inconsistent identifiers can affect:

  • GO enrichment
  • pathway enrichment
  • STRING network mapping
  • candidate protein prioritization