Documentation

untangle.bio is an AI-native platform for downstream process design in biotechnology. Generate optimal purification routes, run real-time simulations, and perform techno-economic analysis — all in one workspace.

Quick Start

Get up and running in under 5 minutes. Open the app — the canvas greets you with an empty state and a single call to action: Start here. Click it to launch the guided wizard, which walks you through feed definition, target selection, and route generation in one flow.

🚀 Click "Start here"

The canvas empty state shows a single button. Click it to open the guided wizard — no setup required.

🔬 Define Feed & Targets

Enter your feed components from 300+ molecules, set flow rate, and select the products you want to recover.

📊 Review & Pick a Route

Browse ranked routes by yield, purity, and CAPEX. Apply one to the canvas, inspect it, then return to pick another.

Pro tip: Start with the Balanced optimization mode for your first project. It provides a good mix of yield and purity while keeping costs reasonable.

Scope & Limitations

untangle.bio is a conceptual process design tool, intended for early-stage route screening and feasibility assessment — not for detailed engineering or final process validation. Understanding what the simulator does and does not model will help you interpret results correctly.

What the simulator does

Steady-state mass balances across each unit operation based on separation efficiency, rejection coefficients, and component properties
Flow and concentration tracking through every stream in the flowsheet
pH-dependent solubility checks and precipitation warnings
Yield and purity estimates for each product at every step
Indicative capital and operating cost ranges based on literature-derived correlations

What the simulator does not do

No rigorous thermodynamics — phase equilibria, activity coefficients, and equation-of-state calculations are not performed. Real mixture non-idealities (salting-out effects, co-precipitation, ternary phase diagrams) are not captured.
No detailed transport modelling — concentration polarisation, fouling kinetics, and gel-layer effects in membrane operations are approximated by fixed rejection parameters rather than solved from first principles.
No chromatography band profiles — chromatographic separations are represented by an overall recovery and purity factor, not by breakthrough curves or plate-height models.
No reaction kinetics — enzymatic reactions, degradation, and aggregation during processing are not modelled.
Simplified crystallisation — crystal yield is estimated from a user-defined recovery fraction rather than nucleation and growth kinetics or supersaturation profiles.
No hydrodynamics — pressure drops, pump sizing, pipe velocities, and fluid dynamics are outside scope.

Engineering interpretation required. Results should be treated as indicative order-of-magnitude estimates. Promising routes identified by untangle.bio should be validated with detailed process modelling, pilot-scale experiments, and consultation with separation specialists before making engineering or investment decisions.

Start Here Wizard

The recommended entry point for new sessions. Click Start here on the empty canvas (or Start here in the toolbar) to open the guided wizard. It bundles feed definition, target selection, and route generation into a single step-by-step dialog so you can go from a blank canvas to a ranked list of routes in minutes.

Wizard steps

Feed stream — set flow rate, pH, temperature, and add components from the molecule database or manually.
Target products — select one or more components you want to recover. For multiple products, the generator finds branching routes that separate each product.
Generation settings — choose optimization goal (diversity, balanced, yield, purity, simplest, or high selectivity), minimum thresholds, and algorithm (evolutionary is the default and recommended).
Pick a route — results stream in live. Browse the list, inspect the 3D yield/purity/CAPEX scatter plot, and click Apply to canvas on the route you want to explore.

After applying a route: The canvas is populated with the full process flowsheet and you can inspect every stream and unit operation. The Back to Results button in the toolbar lets you return to the results list, remove the current route from the canvas, and pick a different one. See Back to Results for details.

Platform Workflow

untangle.bio follows a proven engineering workflow that mirrors how process engineers actually work — from initial feed characterization to final economic evaluation.

1. Feed Stream Definition

Define your input stream with volumetric flow rate and component specifications. The platform includes an extensive molecule database with physical properties for accurate modeling:

Flow rate: L/hr, with automatic unit conversion
Components: Concentration (g/L), molecular weight, charge state
Properties: pKa, isoelectric point, diffusion coefficient
pH and temperature: Critical for precipitation modeling

2. Target Product Selection

Select one or multiple target products from your feed components. untangle.bio optimizes routes for maximum recovery and purity of specified products, with support for complex multi-product separations.

3. Route Generation

The AI engine generates thousands of candidate routes using a diversity-preserving genetic algorithm. Set constraints and optimization goals:

Minimum yield: Typically 10-90% depending on application
Minimum purity: Product specification requirements
Optimization mode: Yield, purity, balanced, or high selectivity
Algorithm: Quick enumeration or evolutionary search

4. Simulation & Mass Balance

Run rigorous mass balances with stream-level tracking of concentrations, pH, and flow rates. The simulation engine handles:

Conservation of mass and volume at every node
pH propagation through mixing and chemical addition
Precipitation warnings based on solubility limits
Real-time feasibility checking

Results panel: When a unit operation node is selected, the Properties panel opens with the Results tab active by default — showing stream concentrations, yield, purity, and flow rate at a glance. Switch to Parameters to adjust operating conditions.

Back to Results

After applying a generated route to the canvas you may want to compare it visually with alternatives before committing. The Back to Results button in the toolbar makes this frictionless:

Apply a route to the canvas from the wizard or generator dialog.
Inspect the flowsheet — check stream labels, unit operation results, and the properties panel.
Click Back to Results in the toolbar to remove the current route from the canvas and reopen the results list with all previously generated routes still shown.
Pick a different route and apply it, or re-run generation with new settings.

The results list is saved in memory for the current session. It is cleared when you start a new project or close the browser tab.

Unsaved changes: Clicking Back to Results removes the currently applied route from the canvas. Any manual edits made after applying (added nodes, changed parameters) will be lost. Use Ctrl + Z after returning if you change your mind.

Building Your Own Process Flow

Alongside the automated route generators, you can build and edit process flowsheets entirely by hand — drag nodes onto the canvas, wire them together, and run the simulation yourself. This is useful when you want to test a specific sequence, reproduce a literature process, or make targeted modifications to a generated route.

Step 1 — Place a Feed Stream

Drag a Feed Stream node from the left palette onto the canvas. Double-click it to open the feed configuration dialog. Set the volumetric flow rate, temperature, and pH, then add your components — either from the built-in molecule database or as custom entries with manually entered properties.

Step 2 — Add Unit Operation Nodes

Drag one or more unit operation nodes from the palette. Available operations are grouped by category:

Upstream / Fermentation: Stirred tank bioreactor, fed-batch bioreactor, perfusion bioreactor, continuous bioreactor (chemostat), air-lift fermentor, continuous heat sterilizer
Reaction & Mixing: Mixing vessel
Clarification: Disc centrifuge, depth filtration, microfiltration, flocculation/coagulation, Nutsche filter, basket centrifuge
Purification: Ultrafiltration (10 kDa / 30 kDa MWCO), cation exchange, anion exchange, affinity, size exclusion, hydrophobic interaction (HIC), reverse osmosis, electrodialysis, distillation, liquid-liquid extraction, precipitation
Polishing: Nanofiltration, reverse phase, crystallization, activated carbon adsorption, viral inactivation
Drying: Spray drying, freeze drying, vacuum tray drying, thin-film evaporator, fluid bed dryer

A separate Sources & Sinks section at the top of the palette holds the feed, product and waste nodes plus reagent feeds — wash water (💧), NaOH solution (🔵), and HCl solution (🔴). These are covered in steps 1, 4 and 5.

Double-click any unit operation to configure its parameters (MWCO, pH target, wash volume, etc.).

Step 3 — Switch to Connect Mode

Press C to enter Connect Mode (the current mode is shown in the status bar at the bottom of the window). In this mode, hovering over a node reveals its connection handles. Click and drag from one handle to another to draw a stream edge.

Handle	Position	Meaning
`output`	Right side of feed node	Feed stream outlet — connect to the first unit operation's `input`
`input`	Left side of unit operation	Main process inlet
`light`	Right side of unit operation	Light-phase outlet — permeate, filtrate, mother liquor, volatiles
`heavy`	Bottom of unit operation	Heavy-phase outlet — retentate, concentrate, solid, crystals
`dilution`	Top of filtration nodes	Auxiliary water inlet for diafiltration — connect a Wash Water node here

Tip: Press V to return to Select Mode for moving nodes around. Use Ctrl + Z / Y for undo/redo.

Step 4 — Add Product and Waste Nodes

Every outlet of every unit operation must terminate at either a Product node or a Waste node — the simulator validates this before running. Drag these from the palette (Sources & Sinks section) and connect them to the appropriate outlets.

Connect the outlet carrying your target product to a Product node
Connect all other outlets to a Waste (Wastewater Treatment) node
Multiple unit operations can share the same Waste node

Step 5 — Add Reagent Feeds (optional)

To model diafiltration or pH adjustment, drag reagent feed nodes from the palette and connect them to the appropriate inlets:

💧 Wash Water → connect to the dilution handle on any filtration node
🔵 NaOH Solution → connect as an additional inlet for pH increase
🔴 HCl Solution → connect as an additional inlet for pH decrease

Step 6 — Run the Simulation

Press F5 or click the Recalculate button in the toolbar. The simulator performs a steady-state mass balance through every node in sequence, propagating concentrations, flow rates, and pH along every stream. Results appear as labels on stream edges and as summary panels on each unit operation node.

Validation errors: If the simulator reports dangling outlets or unconnected streams, check that every outlet handle on every unit operation is connected to either a downstream node, a Product node, or a Waste node. Unconnected outlets prevent the simulation from running.

Feed Definition

Accurate feed characterization is critical for reliable route optimization. untangle.bio provides comprehensive tools for defining complex biotechnology feeds.

Component Database

The platform includes 300+ pre-characterized molecules across key categories:

Proteins: Antibodies, enzymes, therapeutic proteins
Organic acids & amino acids: Citric, acetic, lactic, and others
Sugars & polysaccharides: Glucose, sucrose, complex carbohydrates
Salts & alcohols: Buffer components, ionic species, solvents
Cells: E. coli, CHO, yeast with size distributions
Small molecules: Vitamins, antibiotics, lipids, polyphenols, terpenes

Database integration: Clicking any molecule automatically populates all relevant properties for separation modeling, including molecular weight, charge, and transport properties.

Route Generation

untangle.bio uses advanced algorithms to explore the vast space of possible purification sequences and identify optimal routes based on your criteria. The evolutionary algorithm is the default and recommended mode — it returns only feasible routes (meeting both yield and purity thresholds) and streams results live to the UI as each simulation completes.

Genetic Algorithm Approach

The platform employs a diversity-preserving genetic algorithm optimized for breadth rather than convergence:

Population size: 600 genomes for maximum diversity
Generations: 15 for single-product runs, 40 for multi-product, each with fresh injection (~14% new genomes per generation)
Selection: Tournament selection with low elitism (2%)
Mutation: Multi-type operations (add, remove, replace steps)
Only feasible routes returned: Routes failing yield or purity thresholds are excluded. If zero feasible routes are found, the top 5 infeasible routes are shown with clear warnings.

Optimization Goals

Choose the optimization goal to guide the search:

Diversity (default) — explore the widest range of process options with maximum variety
Balanced — good mix of yield and purity (Yield × Purity)
Yield — maximize mass recovery of product
Purity — maximize product concentration relative to impurities
Simplest — favor shorter routes with fewer unit operations
High Selectivity — only allow routes where every step enriches the product over impurities (selectivity > 1 at every step)

Expert Rules Integration

All generated routes pass through 30+ expert rules that eliminate physically impossible or economically infeasible combinations:

Chromatography requires water ≥ 500 g/L and particles < 1 g/L; prior clarification is required when cells are present
Membrane operations require appropriate particle size reduction first
Crystallization requires supersaturation (concentration > solubility limit)
Size-based separation must follow large-to-small ordering
Reverse phase blocked for proteins (MW > 1500 Da) — causes denaturation
Organic acids must be at low pH (<6) for liquid-liquid extraction to work
Nutsche filter and basket centrifuge require solids > 2–5 g/L
Fluid bed dryer requires granular feed or prior solid-forming step

Simulation Engine

The simulation engine performs rigorous mass and energy balances with real-time validation of process feasibility and stream compatibility.

Mass Balance Methodology

untangle.bio uses a mass-flow-based approach for accurate modeling:

// Convert to mass flows
mass_flow = concentration × volumetric_flow

// Apply separation efficiency
retained_mass = mass_flow × rejection_coefficient
permeate_mass = mass_flow × (1 - rejection_coefficient)

// Enforce conservation
total_out = retained_mass + permeate_mass
assert(total_out == mass_flow_in)

pH Tracking

pH is tracked throughout the entire process with buffer capacity weighting:

Volume-weighted mixing of streams
Chemical addition effects (NaOH, HCl)
Precipitation warnings near isoelectric points
Henderson-Hasselbalch equation for acid solubility

Techno-Economic Analysis

Built-in cost estimation provides immediate economic feedback on route alternatives using industry-standard methodologies.

Capital Cost (CAPEX)

Equipment costs are scaled using the power law, with an exponent that varies by operation type — generally following the "six-tenths rule" but calibrated individually to each technology class:

CAPEX = Base_Cost × (Flow_Rate / Reference_Rate)^n
Total_CAPEX = Σ(Equipment_Cost × Lang_Factor)

The exponent n is not a fixed 0.6 for all equipment — it is calibrated per technology class. Chromatography columns and membrane systems (area-limited equipment) scale more favourably than thermal or cryogenic systems. As a rough guide: membrane and column operations sit in the lower range (~0.55–0.65), mechanical separators in the middle, and drying operations — especially freeze drying — at the higher end (~0.70–0.75).

Lang factors (1.5–3.0×) account for installation, instrumentation, and auxiliary equipment based on operation complexity. All base costs are referenced at 100 L/hr feed rate (2026 USD).

Operating Cost (OPEX)

Annual OPEX is built up from several itemized components:

Utilities: Electricity, steam, and cooling water — each with configurable unit prices
Consumables: Resins, membranes, filters, and process chemicals
Maintenance: 5% of FCI annually (industry standard)
Labor: Base operators plus 0.25 additional operators per unit operation beyond two
QC/QA: Facility-type-specific rate — pharma 50%, food & beverage 20%, chemical 15% of labor cost

Working Capital

Working capital is itemized rather than estimated as a flat fraction of FCI:

Accounts receivable: 45 days of annual revenue
Product inventory: 30 days of COGS
Cash reserve: 2% of annual OPEX

TEA Analysis Views

Open the Economic Analysis dialog from the toolbar. Inside it, the analysis is split across several tabs:

Summary / CAPEX / OPEX: Full detailed TEA — CAPEX and OPEX breakdowns with working capital and cost waterfall
Scale-up: Cost vs. annual throughput curves — shows economies of scale
Investor: NPV, IRR, and payback period under configurable revenue and discount rate assumptions
Sensitivity: Tornado charts showing which cost drivers (flow rate, yield, price) have the largest impact on project economics

Multi-Product Routes

untangle.bio supports complex separations where multiple valuable products are simultaneously recovered from a single feed stream through branching routes. Each product is tracked individually for yield and purity and exits at a dedicated product node.

Branching Logic

At every two-outlet unit operation, each product is assigned to whichever physical stream carries more of its mass — heavy (retentate/solid) or light (permeate/filtrate). Products that end up in different streams at the same step are considered separated at that step and branch into their own product nodes. Products that remain together continue downstream together.

Heavy outlet: retentate, concentrate, solid, precipitate
Light outlet: permeate, filtrate, mother liquor, volatiles
Each product is tracked through every step individually
A product node is created at the step and outlet where each product first separates

Metrics

Yield — total mass recovery: mass of all target products recovered / mass of all target products in feed
Purity — best individual product purity achieved, measured at each product's own exit step (excluding water)

Design constraint: For N selected products, the route must produce exactly N distinct product nodes — each product must exit through a unique (step, outlet) combination. Routes that fail to separate all products are automatically rejected.

Multi-Product Generation Algorithm

Multi-product routes are generated by the same diversity-preserving genetic algorithm used for single-product runs — the generator simply switches to a multi-product fitness function when you select two or more targets. There is no separate constructive search; the wizard and generator both call the evolutionary streaming engine for any number of products.

Multi-product fitness

When more than one target is selected, a route's reported yield and purity are the average across all target products. A route that recovers one product well but fails to separate a co-product therefore scores low rather than being hidden — it still appears in the results list and 3D plot, flagged as infeasible, so you can see why it fell short.

Each product is tracked individually through every step of the simulated route.
Branching is encoded in the genome's outlet-handle genes — at each two-outlet operation a product follows whichever stream (heavy or light) carries more of its mass, letting products split into their own product nodes.
The algorithm runs a larger 40-generation search for multi-product problems (versus 15 for single-product) to explore more branching combinations, with relaxed default thresholds (min purity 60%, min yield 40%) to account for the added complexity.

Simulation & validation

Every candidate genome is passed through the full mass-balance simulation engine and gated the same way as single-product routes:

Expert rules — evaluated against the actual stream composition at each step inlet (not just the feed). Violations are rejected immediately.
Products must separate — routes are scored on how well each target ends up at a distinct outlet; those that keep products together score poorly on the averaged purity/yield.
Purity & yield thresholds — routes are streamed to the UI as they finish, tagged as feasible or infeasible relative to your targets.

Streaming results: Routes are yielded to the UI as soon as each simulation completes — you see results appear live without waiting for the full population to finish.

Expert Rules System

The platform incorporates decades of downstream processing knowledge through 30+ expert rules that prevent infeasible designs. Rules are evaluated against the actual stream composition at each step — not just the feed — so violations caused by upstream operations are also caught. The rules below are representative examples, not the full set.

Core Prerequisites

Chromatography prerequisites: Requires water ≥ 500 g/L and particles < 1 g/L; prior clarification is required when cells are present
Membrane fouling prevention: UF/NF with cells requires prior clarification
Crystallization thermodynamics: Concentration must exceed compound-specific solubility (from molecule database)
Drying constraints: Must be the final operation in any route
Consecutive duplicate rejection: Same unit operation type cannot appear back-to-back
Concentrate before drying: Solids must be > 50 g/L before any drying step

Practical Rules (additional examples)

No reverse phase for proteins: Blocked for MW > 1500 Da — causes irreversible denaturation
Maximum chromatography steps: No more than 3 chromatographic operations (cost and cycle time)
Redundant clarification: Maximum 2 consecutive clarification operations
Size-based separation order: Must follow large → small (centrifuge → MF → UF → NF)
Concentrate before crystallization: Requires ≥ 50% of compound solubility limit
LLE requires low pH for organic acids: At pH > 6, organic acids are fully ionized (A⁻) and won't partition into organic phase
LLE not after drying: No aqueous phase remains after a drying step
Nutsche filter requires solids: Needs > 5 g/L solids in feed
Basket centrifuge requires solids: Needs > 2 g/L solids in feed
Fluid bed dryer requires granular feed: Needs prior solid-forming step or > 50 g/L solids

Selectivity

Selectivity (α) measures how well each unit operation enriches the target product relative to impurities. It is calculated at every step and shown in the route results panel.

α = (product concentration factor) / (impurity concentration factor)
concentration factor = C_out / C_in

α > 1 → step enriches product over impurities (good)
α = 1 → no selective separation
α < 1 → step enriches impurities more than product

Use High Selectivity optimization mode to restrict results to routes where every single step achieves α > 1.0. The route list and 3D scatter plot include filter tabs to show only routes with fully monotone selectivity profiles.

Unit Operations Reference (35 total)

All 35 operations have 3–5 configurable parameters (accessible by double-clicking the node) with validated scientific defaults used by the process generator.

Category	Operations	Outlets
Clarification	Disc centrifuge, depth filtration, microfiltration, flocculation, Nutsche filter, basket centrifuge	2 (light + heavy)
Purification	UF 10k, UF 30k, cation/anion exchange, affinity, size exclusion, HIC, reverse phase, precipitation, distillation, reverse osmosis, electrodialysis, liquid-liquid extraction	2 (light + heavy)
Polishing	Nanofiltration, crystallization, activated carbon adsorption, viral inactivation	2 for NF/crystallization; 1 for viral inactivation, activated carbon
Drying	Spray drying, freeze drying, vacuum tray drying, thin-film evaporator, fluid bed dryer	2 (solid/heavy + volatiles/light)
Upstream / reaction	Stirred tank, fed-batch, perfusion, continuous (chemostat), air-lift bioreactors; continuous heat sterilizer; mixing vessel	1 (single outlet)

Two-outlet operations produce a heavy stream (retentate, concentrate, solid, crystals) and a light stream (permeate, filtrate, mother liquor, volatiles). Both outlets must be connected to a downstream node, product node, or waste node before the simulation will run.

Molecule Database

The built-in molecule database currently covers a limited set of common biotech components — proteins, sugars, organic acids, amino acids, salts, alcohols, and cell types. It is actively being expanded over time based on user feedback and real-world process cases.

For testing purposes: If your molecule is not in the database yet, you can add it manually directly in the feed stream dialog. Enter the component name and as many physical properties as you know (MW, charge, solubility, pKa, log P, etc.). The simulator will use whatever properties you provide — missing values are handled gracefully, though accuracy improves with more complete data.

Note that molecules added this way are local to your simulation only — they are not automatically added to the central database. To request a molecule be added for all users, reach out via LinkedIn.

Property Categories

Basic: MW, charge, typical concentrations
Solubility: Water solubility, pH-dependent solubility
Transport: Diffusion coefficients, viscosity effects
Thermodynamic: Heat capacity, formation enthalpy
Chemical: pKa values, log P, isoelectric points

Suggest a Molecule

The database is continuously expanding. If you work with a molecule that is missing, reach out on LinkedIn — feedback from practitioners directly shapes what gets added next.

Corjan van den Berg — Revyve

AI Connector (MCP)

untangle.bio ships a Model Context Protocol (MCP) server, so you can drive the engine — generate purification routes, simulate mass balances, and run techno-economic analysis — directly from your own AI assistant. Inference runs on your model and plan; the connector only answers tool calls, and never holds an API key on your behalf.

Endpoint: https://mcp.untangle.bio/mcp (Streamable HTTP). The connection is authorized once via OAuth, after which the tools appear in your assistant's tool menu.

Using Claude

If you use Anthropic's Claude, the button below opens the Add custom connector dialog with the name and endpoint already filled in — just review and confirm, then complete the one-time OAuth sign-in. No copy-pasting the URL into settings.

Add Untangle to Claude →

On a Team or Enterprise plan? Individual members can't add custom connectors themselves — a workspace Owner must add it once for the organization first. The Owner uses this link: https://claude.ai/admin-settings/connectors?modal=add-custom-connector&connectorName=Untangle&connectorUrl=https%3A%2F%2Fmcp.untangle.bio%2Fmcp After that, each teammate opens Settings → Connectors, finds Untangle (labeled "Custom"), and clicks Connect to authorize with their own account.

Using ChatGPT

ChatGPT doesn't yet support a one-click install link, so you add the server manually (a quick, one-time step). In ChatGPT, enable Settings → Connectors → Advanced → Developer mode, then Connectors → Add, give it a name, paste the endpoint https://mcp.untangle.bio/mcp, choose OAuth, and create. Via the Responses API, pass the same URL in the request's tools array as an mcp tool ({"type": "mcp", "server_url": "https://mcp.untangle.bio/mcp"}).

Manual setup (advanced & other MCP clients)

Because MCP is a shared, vendor-neutral standard, the same endpoint works from every MCP client — only the place you paste it differs. In Claude (claude.ai or the Claude Desktop app) you can also add it by hand: open Settings → Connectors → Add custom connector, give it a name, and paste the URL; Claude walks you through the OAuth sign-in on first use. For any other MCP client — Cursor, Cline, Zed, custom agents built on the MCP SDKs, or the reference MCP Inspector — register it wherever that client lists MCP servers, using the HTTP/SSE (Streamable HTTP) transport and the same URL. In every case no per-vendor build or API key is required on your side.

Available tools

Tool	What it does
`list_unit_operations`	List available downstream unit operations.
`get_molecules`	List the built-in molecule database with physical properties.
`generate_processes`	Evolutionary search for single-product purification routes.
`generate_processes_multiproduct`	Branching flowsheets that recover 2+ products in parallel.
`simulate_separation`	Step-by-step mass and energy balance for one route.
`calculate_tea`	Detailed techno-economic analysis (CAPEX / OPEX / COGS / payback).
`tea_scale_analysis`	Sweep economics across a range of throughput scales.

A typical flow: get_molecules → generate_processes → simulate_separation on a promising route → calculate_tea on the simulated result.

Ready to start designing processes? Launch the workspace and begin optimizing your downstream operations.

Launch Workspace →