wetSpring Validation Results

235+ bioinformatics checks across 8 workloads, dispatched through a live 13-primal NUCLEUS composition with full provenance chains.

Overview

10 workloads dispatched through ToadStool on a live 13-primal NUCLEUS composition. 235+ structured science checks passed across 6 bioinformatics domains. Real NCBI data (11.9M paired-end reads, PRJNA488170) processed through both Python and Rust pipelines. Every result provenance-verified.

Hardware: Intel i9-14900K, 96 GB DDR5, RTX 4070 / RTX 3090 Composition: Full NUCLEUS (13 primals) Provenance: DAG → Merkle root → loamSpine ledger → sweetGrass braid


Results by Domain

16S Microbiome Pipeline

WorkloadChecksDataDuration
16S Rust Validation37/37 PASSSynthetic pipeline vectors<1s
Algae 16S (real data)34/34 PASSSRR7760408 — 11.9M reads23s
Python 16S BaselineSUCCESSSRR7760408 — 50K reads1s

Full DADA2 denoising, chimera detection, taxonomy assignment, UniFrac distances. Python→Rust parity at tol=0.000000 for Shannon and Simpson diversity indices. Source: Nannochloropsis outdoor 16S (Wageningen, DOI: 10.1007/s00253-022-11815-3).

Diversity Metrics

WorkloadChecksDomains Covered
Diversity Indices27/27 PASSAlpha diversity, beta diversity, PCoA
R Industry Parity53/53 PASSvegan 2.7.3, DADA2 1.22.0, phyloseq 1.38.0

53 checks against R gold-standard packages at exact parity — Shannon, Simpson, Bray-Curtis, rarefaction, Chao1, Pielou, UniFrac (weighted + unweighted), cophenetic distances.

Pharmacometrics & Physics

WorkloadChecksDomains Covered
Gonzales CPU Parity43/43 PASSHill equation, PK decay, Anderson spectral
Python BenchmarkSUCCESSCross-domain timing baseline

IC50→barrier mapping, dose-response modeling, Anderson 2D/3D lattice computations with deterministic seed parity.

Immunology & Metagenomics

WorkloadChecksReference
Fajgenbaum Pathway8/8 PASSJCI 2019 — PI3K/AKT/mTOR, sirolimus ranking
Cold Seep Pipeline8/8 PASSRuff et al. — 50 synthetic communities
Real NCBI Pipeline25/25 PASSSovereign diversity + Anderson

Fajgenbaum correctly identifies PI3K/AKT/mTOR as highest-activation pathway (0.92) and ranks sirolimus #1 for drug repurposing.


Provenance Chain

Every workload result is tracked through a 9-phase provenance pipeline:

PhaseOperationPrimals Used
1Health check all primalsAll 13
2Create rhizoCrypt DAG sessionrhizoCrypt
3Create loamSpine spineloamSpine
4Register NCBI data artifacts (BLAKE3)NestGate, rhizoCrypt
5Execute workloads with DAG trackingToadStool, rhizoCrypt
6Dehydrate DAG → Merkle rootrhizoCrypt
7Commit to permanent ledgerloamSpine
8Create attribution braidsweetGrass
9Write manifest + braid JSONFilesystem

The braid is PROV-O compliant with DID attribution and ed25519 witness signature from BearDog’s Tower-tier key hierarchy.


BLAKE3 Data Artifacts

Every input is content-addressed before pipeline execution:

ArtifactBLAKE3 Hash (prefix)Size
SRR7760408 R16250f200f9ff45e0...2.1 GB
SRR7760408 R2cd89f43d74d09c64...2.4 GB
SRR5534045 R1096878541679cd06...444 MB
SRR5534045 R2bee510af71ac9149...451 MB

Source: NCBI SRA accessions SRR7760408 (PRJNA488170) and SRR5534045 (PRJNA382322). Anyone can download the same reads and verify the hashes.


Interactive Exploration

The Gonzales Interactive Explorer provides live charts for the IC50, PK decay, tissue geometry, hormesis, and cross-species data validated in the Gonzales CPU Parity workload above.


Reproduce These Results

See Reproduce It Yourself for step-by-step instructions to run the same workloads on your own hardware.