
The challenges of curating publicly available single-cell data
....
Semi-structured, raw scRNA-seq data from public repositories are difficult to retrieve and integrate together for cell-type and cell-function annotation exercises. Each repository processes data differently and may lack the adequate metadata annotations which directly affects findability of these datasets.
There is a lack of standards for the deposition of cell-level metadata. Although guidelines have recently been proposed for single-cell data deposition, these guidelines have primarily focused on describing experimental aspects of the study. In most cases, even the cell types assigned by investigators for each cellular barcode are not mentioned.
Preliminary exploration or analysis of single cell data has extensive memory requirements. Also, researchers need to spend critical amounts of time downloading the data, packages, and libraries to a computational environment.
Analysis and insight generation from different pipelines written by different users is often counterproductive to reproducibility. More importantly, comparing and interpreting different datasets requires a standard processing pipeline.
These lead to major challenges in accessibility, harmonization, and standardization, hindering researchers and organizations from effectively utilizing the data.
Why work with us?
One-stop solution for your single-cell multimodal data curation problems.
104,075,430 cells
11,543 samples
413 tissues
1,099 datasets
5,172 disease samples
171 diseases

Our approach
Pythiomics is hosted in CDIAM Multi-omics Studio, you can interactively explore the data through an easy-to-use graphical UI as well as a rich package of state-of-the-art machine learning algorithms and analysis workflows.
Pythiomics
Single-cell RNA-seq data has been a key focus of Pythiomics. Explore some quick stats of our single-cell database below.
104,075,430 cells
11,543 samples
413 tissues
1,099 datasets
5,172 disease samples
171 diseases

Custom services for
single-cell multimodal data curation
Pythiomics brings all those data into one single place, with standard formats and GUI for exploration. It is currently incorporating data from different omics types, including Bulk RNA-seq, Single-cell RNA-seq, Proteomics, Spatial Transcriptomics, and from different public databases.
Bulk RNA
Visium HD & Xenium
Coming soon
Single-cell RNA
Metabolomics
Coming soon
Proteomics
ATAC-Seq & CITE-Seq
Coming soon
Visium Spatial
CHIP-Seq
Coming soon
CosMx
Mutation & GWAS
Coming soon
