ChemEngine Datatools Sample

Sample Chemical Process Data for AI and Process Engineering Workflows

A downloadable, literature-anchored demo dataset with 64 representative operating points across 8 industrial reaction families. Use it to test schema fit, data ingestion, and notebook workflows before moving to the full ChemEngine Datatools catalog.

64

records

Representative benchmark operating points

8

reaction families

Ammonia, FT, methanol, reforming, WGS, EO, AN, polymerization

CC BY-NC 4.0

license

Safe to share internally for evaluation and non-commercial use

Preview Rows

reaction_idreaction_nametemp_kpressure_atmyield_pctsource_primary
CHEM-001Haber-Bosch ammonia synthesis673.0224.2314.9410.1039/D4SE00254G
CHEM-009Low-temperature Fischer-Tropsch synthesis493.022.4535.2710.1007/s11244-011-9700-3
CHEM-033Water-gas shift493.020.5676.4210.1021/acscatal.2c03871

Python Quick Start

import pandas as pd

df = pd.read_csv("chemical_process_sample.csv")
print(df.head())

summary = (
    df.groupby("family")[["temperature_k", "pressure_atm", "yield_pct"]]
      .agg(["min", "max", "mean"])
)
print(summary)

The full commercial ChemEngine Datatools catalog extends this pattern with deeper provenance, broader property coverage, and delivery formats tailored for ML pipelines and internal data platforms.

Important Note

This is a marketing-grade sample designed for evaluation, prototyping, and schema review. Operating windows and catalyst families are anchored to the cited literature, while the normalized rate constants are intentionally harmonized to a pseudo-first-order basis for cross-reaction comparability.

Do not use this sample as a direct substitute for plant design packages, relief studies, HAZOP decisions, or safety-critical kinetic validation.

Schema Highlights

reaction_idStable ChemEngine sample record ID
reaction_nameHuman-readable industrial reaction family
reaction_typeProcess / catalysis class
reaction_equationStoichiometric basis used for the record
temperature_k / pressure_atmRepresentative operating conditions
conversion_pct / selectivity_pct / yield_pctBenchmark performance metrics
activation_energy_kj_molLiterature-anchored apparent activation energy
rate_constant_s_invNormalized pseudo-first-order Arrhenius constant
enthalpy_reaction_kj_per_mol_rxnRepresentative reaction enthalpy on the written basis
source_primary / source_secondaryDOI or database references for traceability