AoMP
« Author Information
Table of Contents
Foreword
Foreword
History
Introduction & A Brief History of Molecular Computing
Foundations
Foundations
PrimerBio
Biology for Molecular Programmers
Central dogma & cell response to external stimuli
Compartmentalization & reactions?
Common DNA/RNA binding macromolecules (Box)
PrimerThermo
Basic Thermodynamics
Concept of entropy, enthalpy, Gibbs energy, Chemical Equilibrium
Thermodynamic energy landscapes, partition function
PrimerODE_CRN
Dynamic Models ODEs and Chemical Reaction Networks
Stochastic Models
Deterministic Models
Box on rule-based systems / process algebras
MethodMolProg
Experimental Methods to Program Molecules
Synthesis of DNA
Phosphoramidite (chemical synthesis)
Ligation-based, Gibson assembly
Single-stranded DNA production, e.g. asymmetric PCR, bead pulldown, phagemid production, cell production, RCA, etc
In vivo assembly, e.g. with yeast
Upcoming methods: PER, TdT-based
Scaled production: bacteriophage/cell culture/IVT+RT
Modifications/redox/thiol/maleimide, NHS, succinimide, amine, enzymatic approaches, biotinylation, fluorization, dyes and quenchers etc.
BOX/TABLE: Discussion/comparison of yield, error rates, costs, challenges (e.g. repetitive sequences, modifications) for different ways of writing DNA
Purification (HPLC/agarose and polyacrylamide electrophoresis/centrifugation)
Agarose/PAGE purification
Chromatography (HPLC, LC, maybe capillary?, ion exchange, size exclusion for origami purification, affinity like via his-tags)
Centrifugation (filter, gradient, ultracentrifugation)
Precipitation/binding methods: magnetic beads, PEG precipitation
Experimental methods to produce and modify RNA
Experimental methods to produce and modify proteins
MethodRxnSetup
Experimental Methods: Reaction Setup
MethodMicrosc
Experimental Methods: Microscopy
Microscopy (AFM, TEM/cryoEM/SEM, fluorescence/superresolution - PAINT)
Contrast mechanisms and their drawbacks (stains being inconsistent, AFM always convolutes tip, etc)
Spectroscopy (fluorescence/CD/etc)
Table comparing different methodologies: labelled vs unlabelled, sensitivities, throughput?Background reporter signal?
Quenchers
Photobleaching (things that kill fluorophores permanently)
EXTINCTION COEFFICIENTS. basics of beer’s law. assumptions re: additivity of nucleotides. nearest neighbor model for absorption. denatured vs native.
Mass Spec
Detecting large amounts of small molecules in solution
Understand mass spectra (e.g. oligo synthesis)
MethodAnalysis
Experimental Methods: Analysis
Gel electrophoresis
Intercalating dyes vs fluorophores or radiolabeling
Separation quality as a function of gel % and DNA length?
Role of buffer conditions
Pulse field gradient electrophoresis + Other advanced methods (reference)
Capillary electrophoresis
Denaturing Gels
Stacking Gels
Sequencing
Sanger sequencing
NGS - Illumina, PacBio, DNAnanoball,
Nanopore
Microarrays
Chemistry
From molecules to variables
InfoMols
What are information bearing molecules?
nucleic acids (basic features and geometry)
proteins/peptides; Protein geometry (alpha helix, …)
Other (PNA, L-DNA, new nucleotides, polysaccharides, block-copolymers, other supramolecular programmable molecules)
Information in unordered mixtures of small molecules
Glue
The interactions that glue molecules together
Chemical bonds and interactions (a primer in chemistry)
Intramolecular forces via covalent bonds: peptide bonds, ester bonds, delocalised aromatic bonds.
Intermolecular forces and non-covalent bonds: Van der Waals, hydrogen bond and hydrophobicity, stacking interactions, electrostatic interactions, Debye screening. (Table with relative bond strengths)
Entropic forces: depletion, steric, fractionation, crowding, polymers
MolN
Digital description of molecules.
Structure abstraction layers: Primary, Secondary, Tertiary structure
Sequence abstraction layers: domain level vs nucleotide level
Notations: Dot parens plus notation, DU+ notation, …
SMILES string for small molecules
Notions of “valid” conformations (nearest neighbor model)
Intuition behind coarse-grained representations and macrostates
MolV
Digital visualization of molecules
Intuition behind coarse-grained representations and macrostates
How can we represent molecules graphically
Intro into formal concepts for visualization.
Biomolecules
Molecules as construction material
DNA
DNA properties
data sheet to find useful parameters.
Crossover motifs (anti-parallel and parallel)
Special motifs (I motif, G-quads, aptamers, triplex, Holliday junctions)
Non-Watson-Crick-Franklin base-pairing (wobble etc)
Differences between single and double stranded DNA (could be a box) [for example different persistence lengths, inter-basepair lengths].
Biophysical influence of buffer conditions on DNA structure
RNA
RNA properties
data sheet to find useful parameters.
Special motifs (I motif, G-quads, kissing loops, aptamers, ribozymes)
Biophysical influence of buffer conditions on RNA structure and stability
Non-canonical base pairing
DSD
Dynamic processes of nucleic acids
Hybridisation
Toehold-mediated strand displacement via branch migration
4-way strand displacement
Protein
Protein biochemistry
data sheet to find useful parameters.
Enzymatic activity and binding pockets, …
Special motifs of secondary structure (alpha helices, beta sheets), tertiary structure (fold families like b-barrels etc)
Quaternary structure: Multimeric protein complexes
Biophysical influence of buffer conditions on enzyme activity and protein structure
XNA
Non-canonical polymers and interactions (LNA, PNA, XNA, etc)
Comparisons of interactions between canonical polymers (DNA/RNA, Protein/RNA binding)
Protein Non-canonical amino acids
data sheet to find useful parameters…
Biophysics
Interfacing biophysical and computational models for analysis and design.
PP
Polymer physics models for nucleic acids and proteins.
Static properties
Elastic models (FJC, WLC)
Transport phenomena (viscosity, diffusion)
MD
Molecular dynamics models.
Basics of simulation
Nucleic acid models
Protein models
NN
The thermodynamic nearest neighbor model.
single DNA/RNA molecules
multi stranded nucleic acid systems
NLKin
Nearest neighbor level stochastic simulations
Thermodynamic energy landscapes and their application (and limitations) for Gillespie-type kinetic simulations
DLKin
Models of hybridization, dissociation and branch migration.
The ad-hoc approach (based on experimental results)
Molecular Dynamics Models (e.g. oxDNA)
Secondary Structure Kinetics (with and without the nearest neighbor model, e.g. Dave Zhang 2009 and Niranjan 2013 CRN Models, Multistrand)
SeqDesign
DNA/RNA Sequence design.
Why does sequence design work? – properties of landscapes (Lenvinthal’s paradox?)
Sequence design as an artificial evolution to optimize a landscape.
Formulation of a thermodynamic objective function for multistable design.
Formulation of combinatorial WCF base-pairing objectives.
Incorporation of kinetic objectives into sequence design (possibly with reference to kinetic proofreading.)
ProteinDesign
Sequence design of proteins to fold into a specific structure
Rational Protein engineering (David Baker, U Washington )
Box: theory/tools for rational design
Directed Evolution protein engineering (Frances Arnold)
Box: Theory of directed evolution
Alphafold2 box on learned structure prediction and the protein folding grand challenge
SelfAssembly
Introduction into molecular self-assembly theory via natural examples.
NatAssemblyMac
Self-assembly of macroscopic phases
Crystals
Gels and disordered phases
Liquid-liquid phase separation
Lipid membranes and bilayers
NatAssemblyMic
Self-assembly of finite-sized structures
Assembly from distinct units (capsid, enzyme complexes, ribosome…)
Protein folding and cyclisation/looping (eg. hairpins)
NatAssemblyNoneq
Beating equilibrium in self-assembled systems
Molecular templating as a way to drive the formation of non-equilibrium structures (central dogma).
Molecular species such as ATP as a store of high free energy fuel.
Use of fuel to push assembled systems out of equilibrium (kinetic proofreading, chaperones).
Use of fuel to create dynamic non-equilibrium systems (motors, signal-processing architectures, things like microtubules which are a form of "dissipative self-assembly").
DNAstructures
Programmed molecular self-assemblies (experiments)
Origami
DNA Origami
The concept
Design principles, cooperativity
Sequence design constraints
Design and simulation tools
“Wireframe origami” examples 1D / 2D / 3D structures
Production and purification
Custom scaffold design
Thermo/stoichiometry
AssembledDNA
Scaffold-less DNA assemblies
ss Tiles,
dx Tiles
HCR
Simple polyhedra: Seeman cube, tetrahedron, Yamuna’s icosahedron, Mao’s Bucky ball and octahedron.
MultiComponent
Periodic and Multi component assemblies
Shape complementary, base stacking
Self-limiting assemblies (rings)
Fractal assemblies
Lattices, ribbons, nanotubes, and crystals
Interlocked assemblies (e.g. origami rotaxanes) -> connection to mechanics
DynamicDNA
Dynamic rearrangements of structures
DynDNAMotion
DNA tweezers, walkers, and motors
DNA tweezers, DNA Walkers, burnt-bridge motor
DynDNAMech
Nanomechanical devices
Mechanical constructs / active components/ machines / walkers (comparison with molecular motors/enzymes)
ProgrammedRNA
RNA Structures
RNA nanoparticles / Tiles
Cotranscriptional RNA-Origami
Multi-stranded RNA tiling
ProgrammedProtein
Synthetic Protein Structure
Protein Complexes (e.g. nanocage self-assembly)
LiquidDNA
Other phases of DNA structures (Physical properties)
DNA hydrogels
DNA liquids
Coacervates
SurfaceDNA
DNA-grafted structures on surfaces
Colloids
Polymer/DNA brushes (surface/colloid coatings)
Grafted nanoparticles
Computation
Introduction to Computation
Computation_intro
Intro to computation / information processing
conventionalComp
Conventional computation
CompBackground
Background: boolean logic, turing machines, register machines, analog computation nondeterministic finite automata, cellular automata
unconventionalComp
Examples of unconventional computing
naturalComp
Examples of natural computing (e.g. neural networks, gene regulatory networks)
molprogLang
Molecular programming languages
molprogCompil
Compiling molecular programs
CRN
Programming molecular behaviors over time (CRNs)
CRN_intro
Introduction
dCRN
Computing with Deterministic CRNs
Theory examples: circuits, boolean circuits, oscillators, bistability, etc.
Well-mixed CRN’s as example of analog computing
Computing functions (e.g. \(y=kx\))
Approximate majority
Dynamic system: Oscillators, bistability
Biology example: (predator-prey / ecology models)
Computational power of deterministic CRNs
CRNdistribution
Compute with distributions
sCRN
Computing with Stochastic CRNs
Theory example and analysis: min/max and boolean logic programming with CRNs
Biology example: something simple
Computational Power of Stochastic CRNs
Time complexity of stochastic CRNs
NucleicAcidCircuits
Nucleic acids as a universal substrate for molecular programming
NucleicIntro
Introduction (reference Foundations for overlapping topics)
Reference to Structures section on background: nucleic acid hybridization and thermodynamics
Background: nucleic acid branch migration
Toehold-mediated strand displacement
Experiment examples: toehold exchange reaction
StrandDisplacementCascades
Strand displacement cascades
DNAforCRNs
Theory & experiment: DNA as a universal substrate for CRNs
EarlyBooleanDSD
Boolean circuits: an early example
DSDsequenceDesign
Box: DSD sequence design strategies
ApproxMajorityTwoDomain
Approximate majority (two-domain design)
AmplifyCatalyze
Amplification and catalysis
Threshold
Thresholding
SignalRestore
Box: signal restoration comparison
Seesaw
Seesaw circuits
NucleicNeuralNetworks
Neural networks
Oscillator
Oscillators
NucleicAdvanced
Advanced designs in DNA strand displacement cascades
ToeholdActivation
Toehold activation
4wDSD
Theory & Experiment: 4wDSD circuits
NucleicLeakless
Leakless circuits
Why do circuits leak and why leaks are problematic
How to avoid leak. Examples of leakless circuits
EnzymeCircuits
Programming behavior with diverse biomolecules
EnzymeCircuitsIntro
Introduction
Background: enzymatic behaviors
PolymeraseCircuits
DNA polymerase-based circuits
PEN circuits, predator prey
APR, PER circuits (whiplash PCR?)
Shah et al. work on logic gates
TranscriptionalCircuits
Transcribed RNA-based circuits
ToeholdSwitches
Toehold switches
ConditionalCrispr
Conditional crispr
Oritatami
Oritatami
Genelets
Genelets
ProteinCircuits
Protein-based circuits
Protein-protein binding circuits
Phosphorylation based circuits
GeneCircuits
Gene circuits
GeneLogicOscillators
Logic gates and repressilator
GeneNetworks
Theory of synthetic gene regulatory networks (incl. Cell-level circuits)
References to good syn bio resources?
SmMolCircuits
Small molecule cascades
Autocatalytic reactions and applications
SpatiallyOrganizedCircuits
Spatially-Organized Circuits
SpatialIntro
Introduction: advantages of spatial structures
SpatialBackground
Background: compartmentalization in biology
SurfaceCRNs
Surface CRNs
Surface DNA circuits and DNA walkers
DropletComputing
Droplet-based computing / “synthetic compartmentalization”
ReactionDiffusion
Reaction diffusion circuits
MicrofluidicsBox
Microfluidics Breakout Box
AdvancedAssembly
Advanced topics in tile assembly (algorithmic self-assembly)
TileAssembly
Tile self-assembly
Tiling theory as a mathematical theory (Geometry, Wang tiles).
A-tam and K-tam.
Tiles and algorithmic self assembly, example algorithms
Selected models (3D, active tile-assembly, probabilistic…), the problems that led to their creation.
Error correction and proofreading strategies
WangTile
Wang tile ATAM Turing machine implementation
TileComplexity
Tile Complexity of shapes and patterns
ActiveAssembly
Nubots, amoebots, turning machines
CircuitsConclusion
Conclusion