AoMP
« Author Information
Table of Contents
ForewordForeword
HistoryIntroduction & A Brief History of Molecular Computing
FoundationsFoundations
PrimerBioBiology for Molecular Programmers
Central dogma & cell response to external stimuli
Compartmentalization & reactions?
Common DNA/RNA binding macromolecules (Box)
PrimerThermoBasic Thermodynamics
Concept of entropy, enthalpy, Gibbs energy, Chemical Equilibrium
Thermodynamic energy landscapes, partition function
PrimerODE_CRNDynamic Models ODEs and Chemical Reaction Networks
Stochastic Models
Deterministic Models
Box on rule-based systems / process algebras
MethodMolProgExperimental Methods to Program Molecules
Synthesis of DNA
Phosphoramidite (chemical synthesis)
Ligation-based, Gibson assembly
Single-stranded DNA production, e.g. asymmetric PCR, bead pulldown, phagemid production, cell production, RCA, etc
In vivo assembly, e.g. with yeast
Upcoming methods: PER, TdT-based
Scaled production: bacteriophage/cell culture/IVT+RT
Modifications/redox/thiol/maleimide, NHS, succinimide, amine, enzymatic approaches, biotinylation, fluorization, dyes and quenchers etc.
BOX/TABLE: Discussion/comparison of yield, error rates, costs, challenges (e.g. repetitive sequences, modifications) for different ways of writing DNA
Purification (HPLC/agarose and polyacrylamide electrophoresis/centrifugation)
Agarose/PAGE purification
Chromatography (HPLC, LC, maybe capillary?, ion exchange, size exclusion for origami purification, affinity like via his-tags)
Centrifugation (filter, gradient, ultracentrifugation)
Precipitation/binding methods: magnetic beads, PEG precipitation
Experimental methods to produce and modify RNA
Experimental methods to produce and modify proteins
MethodRxnSetupExperimental Methods: Reaction Setup
MethodMicroscExperimental Methods: Microscopy
Microscopy (AFM, TEM/cryoEM/SEM, fluorescence/superresolution - PAINT)
Contrast mechanisms and their drawbacks (stains being inconsistent, AFM always convolutes tip, etc)
Spectroscopy (fluorescence/CD/etc)
Table comparing different methodologies: labelled vs unlabelled, sensitivities, throughput?Background reporter signal?
Quenchers
Photobleaching (things that kill fluorophores permanently)
EXTINCTION COEFFICIENTS. basics of beer’s law. assumptions re: additivity of nucleotides. nearest neighbor model for absorption. denatured vs native.
Mass Spec
Detecting large amounts of small molecules in solution
Understand mass spectra (e.g. oligo synthesis)
MethodAnalysisExperimental Methods: Analysis
Gel electrophoresis
Intercalating dyes vs fluorophores or radiolabeling
Separation quality as a function of gel % and DNA length?
Role of buffer conditions
Pulse field gradient electrophoresis + Other advanced methods (reference)
Capillary electrophoresis
Denaturing Gels
Stacking Gels
Sequencing
Sanger sequencing
NGS - Illumina, PacBio, DNAnanoball,
Nanopore
Microarrays
ChemistryFrom molecules to variables
InfoMolsWhat are information bearing molecules?
nucleic acids (basic features and geometry)
proteins/peptides; Protein geometry (alpha helix, …)
Other (PNA, L-DNA, new nucleotides, polysaccharides, block-copolymers, other supramolecular programmable molecules)
Information in unordered mixtures of small molecules
GlueThe interactions that glue molecules together
Chemical bonds and interactions (a primer in chemistry)
Intramolecular forces via covalent bonds: peptide bonds, ester bonds, delocalised aromatic bonds.
Intermolecular forces and non-covalent bonds: Van der Waals, hydrogen bond and hydrophobicity, stacking interactions, electrostatic interactions, Debye screening. (Table with relative bond strengths)
Entropic forces: depletion, steric, fractionation, crowding, polymers
MolNDigital description of molecules.
Structure abstraction layers: Primary, Secondary, Tertiary structure
Sequence abstraction layers: domain level vs nucleotide level
Notations: Dot parens plus notation, DU+ notation, …
SMILES string for small molecules
Notions of “valid” conformations (nearest neighbor model)
Intuition behind coarse-grained representations and macrostates
MolVDigital visualization of molecules
Intuition behind coarse-grained representations and macrostates
How can we represent molecules graphically
Intro into formal concepts for visualization.
BiomoleculesMolecules as construction material
DNADNA properties
data sheet to find useful parameters.
Crossover motifs (anti-parallel and parallel)
Special motifs (I motif, G-quads, aptamers, triplex, Holliday junctions)
Non-Watson-Crick-Franklin base-pairing (wobble etc)
Differences between single and double stranded DNA (could be a box) [for example different persistence lengths, inter-basepair lengths].
Biophysical influence of buffer conditions on DNA structure
RNARNA properties
data sheet to find useful parameters.
Special motifs (I motif, G-quads, kissing loops, aptamers, ribozymes)
Biophysical influence of buffer conditions on RNA structure and stability
Non-canonical base pairing
DSDDynamic processes of nucleic acids
Hybridisation
Toehold-mediated strand displacement via branch migration
4-way strand displacement
ProteinProtein biochemistry
data sheet to find useful parameters.
Enzymatic activity and binding pockets, …
Special motifs of secondary structure (alpha helices, beta sheets), tertiary structure (fold families like b-barrels etc)
Quaternary structure: Multimeric protein complexes
Biophysical influence of buffer conditions on enzyme activity and protein structure
XNANon-canonical polymers and interactions (LNA, PNA, XNA, etc)
Comparisons of interactions between canonical polymers (DNA/RNA, Protein/RNA binding)
Protein Non-canonical amino acids
data sheet to find useful parameters…
BiophysicsInterfacing biophysical and computational models for analysis and design.
PPPolymer physics models for nucleic acids and proteins.
Static properties
Elastic models (FJC, WLC)
Transport phenomena (viscosity, diffusion)
MDMolecular dynamics models.
Basics of simulation
Nucleic acid models
Protein models
NNThe thermodynamic nearest neighbor model.
single DNA/RNA molecules
multi stranded nucleic acid systems
NLKinNearest neighbor level stochastic simulations
Thermodynamic energy landscapes and their application (and limitations) for Gillespie-type kinetic simulations
DLKinModels of hybridization, dissociation and branch migration.
The ad-hoc approach (based on experimental results)
Molecular Dynamics Models (e.g. oxDNA)
Secondary Structure Kinetics (with and without the nearest neighbor model, e.g. Dave Zhang 2009 and Niranjan 2013 CRN Models, Multistrand)
SeqDesignDNA/RNA Sequence design.
Why does sequence design work? – properties of landscapes (Lenvinthal’s paradox?)
Sequence design as an artificial evolution to optimize a landscape.
Formulation of a thermodynamic objective function for multistable design.
Formulation of combinatorial WCF base-pairing objectives.
Incorporation of kinetic objectives into sequence design (possibly with reference to kinetic proofreading.)
ProteinDesignSequence design of proteins to fold into a specific structure
Rational Protein engineering (David Baker, U Washington )
Box: theory/tools for rational design
Directed Evolution protein engineering (Frances Arnold)
Box: Theory of directed evolution
Alphafold2 box on learned structure prediction and the protein folding grand challenge
SelfAssemblyIntroduction into molecular self-assembly theory via natural examples.
NatAssemblyMacSelf-assembly of macroscopic phases
Crystals
Gels and disordered phases
Liquid-liquid phase separation
Lipid membranes and bilayers
NatAssemblyMicSelf-assembly of finite-sized structures
Assembly from distinct units (capsid, enzyme complexes, ribosome…)
Protein folding and cyclisation/looping (eg. hairpins)
NatAssemblyNoneqBeating equilibrium in self-assembled systems
Molecular templating as a way to drive the formation of non-equilibrium structures (central dogma).
Molecular species such as ATP as a store of high free energy fuel.
Use of fuel to push assembled systems out of equilibrium (kinetic proofreading, chaperones).
Use of fuel to create dynamic non-equilibrium systems (motors, signal-processing architectures, things like microtubules which are a form of "dissipative self-assembly").
DNAstructuresProgrammed molecular self-assemblies (experiments)
OrigamiDNA Origami
The concept
Design principles, cooperativity
Sequence design constraints
Design and simulation tools
“Wireframe origami” examples 1D / 2D / 3D structures
Production and purification
Custom scaffold design
Thermo/stoichiometry
AssembledDNAScaffold-less DNA assemblies
ss Tiles,
dx Tiles
HCR
Simple polyhedra: Seeman cube, tetrahedron, Yamuna’s icosahedron, Mao’s Bucky ball and octahedron.
MultiComponentPeriodic and Multi component assemblies
Shape complementary, base stacking
Self-limiting assemblies (rings)
Fractal assemblies
Lattices, ribbons, nanotubes, and crystals
Interlocked assemblies (e.g. origami rotaxanes) -> connection to mechanics
DynamicDNADynamic rearrangements of structures
DynDNAMotionDNA tweezers, walkers, and motors
DNA tweezers, DNA Walkers, burnt-bridge motor
DynDNAMechNanomechanical devices
Mechanical constructs / active components/ machines / walkers (comparison with molecular motors/enzymes)
ProgrammedRNARNA Structures
RNA nanoparticles / Tiles
Cotranscriptional RNA-Origami
Multi-stranded RNA tiling
ProgrammedProteinSynthetic Protein Structure
Protein Complexes (e.g. nanocage self-assembly)
LiquidDNAOther phases of DNA structures (Physical properties)
DNA hydrogels
DNA liquids
Coacervates
SurfaceDNADNA-grafted structures on surfaces
Colloids
Polymer/DNA brushes (surface/colloid coatings)
Grafted nanoparticles
ComputationIntroduction to Computation
Computation_introIntro to computation / information processing
conventionalCompConventional computation
CompBackgroundBackground: boolean logic, turing machines, register machines, analog computation nondeterministic finite automata, cellular automata
unconventionalCompExamples of unconventional computing
naturalCompExamples of natural computing (e.g. neural networks, gene regulatory networks)
molprogLangMolecular programming languages
molprogCompilCompiling molecular programs
CRNProgramming molecular behaviors over time (CRNs)
CRN_introIntroduction
dCRNComputing with Deterministic CRNs
Theory examples: circuits, boolean circuits, oscillators, bistability, etc.
Well-mixed CRN’s as example of analog computing
Computing functions (e.g. \(y=kx\))
Approximate majority
Dynamic system: Oscillators, bistability
Biology example: (predator-prey / ecology models)
Computational power of deterministic CRNs
CRNdistributionCompute with distributions
sCRNComputing with Stochastic CRNs
Theory example and analysis: min/max and boolean logic programming with CRNs
Biology example: something simple
Computational Power of Stochastic CRNs
Time complexity of stochastic CRNs
NucleicAcidCircuitsNucleic acids as a universal substrate for molecular programming
NucleicIntroIntroduction (reference Foundations for overlapping topics)
Reference to Structures section on background: nucleic acid hybridization and thermodynamics
Background: nucleic acid branch migration
Toehold-mediated strand displacement
Experiment examples: toehold exchange reaction
StrandDisplacementCascadesStrand displacement cascades
DNAforCRNsTheory & experiment: DNA as a universal substrate for CRNs
EarlyBooleanDSDBoolean circuits: an early example
DSDsequenceDesignBox: DSD sequence design strategies
ApproxMajorityTwoDomainApproximate majority (two-domain design)
AmplifyCatalyzeAmplification and catalysis
ThresholdThresholding
SignalRestoreBox: signal restoration comparison
SeesawSeesaw circuits
NucleicNeuralNetworksNeural networks
OscillatorOscillators
NucleicAdvancedAdvanced designs in DNA strand displacement cascades
ToeholdActivationToehold activation
4wDSDTheory & Experiment: 4wDSD circuits
NucleicLeaklessLeakless circuits
Why do circuits leak and why leaks are problematic
How to avoid leak. Examples of leakless circuits
EnzymeCircuitsProgramming behavior with diverse biomolecules
EnzymeCircuitsIntroIntroduction
Background: enzymatic behaviors
PolymeraseCircuitsDNA polymerase-based circuits
PEN circuits, predator prey
APR, PER circuits (whiplash PCR?)
Shah et al. work on logic gates
TranscriptionalCircuitsTranscribed RNA-based circuits
ToeholdSwitchesToehold switches
ConditionalCrisprConditional crispr
OritatamiOritatami
GeneletsGenelets
ProteinCircuitsProtein-based circuits
Protein-protein binding circuits
Phosphorylation based circuits
GeneCircuitsGene circuits
GeneLogicOscillatorsLogic gates and repressilator
GeneNetworksTheory of synthetic gene regulatory networks (incl. Cell-level circuits)
References to good syn bio resources?
SmMolCircuitsSmall molecule cascades
Autocatalytic reactions and applications
SpatiallyOrganizedCircuitsSpatially-Organized Circuits
SpatialIntroIntroduction: advantages of spatial structures
SpatialBackgroundBackground: compartmentalization in biology
SurfaceCRNsSurface CRNs
Surface DNA circuits and DNA walkers
DropletComputingDroplet-based computing / “synthetic compartmentalization”
ReactionDiffusionReaction diffusion circuits
MicrofluidicsBoxMicrofluidics Breakout Box
AdvancedAssemblyAdvanced topics in tile assembly (algorithmic self-assembly)
TileAssemblyTile self-assembly
Tiling theory as a mathematical theory (Geometry, Wang tiles).
A-tam and K-tam.
Tiles and algorithmic self assembly, example algorithms
Selected models (3D, active tile-assembly, probabilistic…), the problems that led to their creation.
Error correction and proofreading strategies
WangTileWang tile ATAM Turing machine implementation
TileComplexityTile Complexity of shapes and patterns
ActiveAssemblyNubots, amoebots, turning machines
CircuitsConclusionConclusion