Detecting analytes using proximity-induced tagmentation, strand invasion, restriction, or ligation is provided herein. In some examples, detecting an analyte includes coupling a donor recognition probe to a first portion of the analyte. The donor recognition probe includes a first recognition element specific to the first portion of the analyte, a first oligonucleotide corresponding to the first portion, and a transposase coupled to the first recognition element and the first oligonucleotide. An acceptor recognition probe is coupled to a second portion of the analyte. The acceptor recognition probe includes a second recognition element specific to the second portion of the analyte and a second oligonucleotide coupled to the second recognition element and corresponding to the second portion. The transposase is used to generate a reporter polynucleotide including the first and second oligonucleotides. The analyte is detected based on the reporter including comprising the first and second oligonucleotides.
This disclosure describes methods, non-transitory computer readable media, and systems that can use a computationally efficient model to determine a corrected methylation-level value for a specific sample nucleotide sequence. For instance, the disclosed systems determine a false positive rate and a false negative rate at which a given methylation sequencing assay converts cytosine bases. Based on the determined false positive rate and false negative rate, the disclosed systems determine a corrected methylation-level value that corrects for a bias of the given methylation sequencing assay.
This disclosure describes methods, non-transitory computer readable media, and systems that can generate genotype calls from a combined pipeline for processing nucleotide reads from multiple read types/sources for robust, accurate genotype calls. For example, the disclosed systems can train and/or utilize a genotype-call-integration machine-learning model to generate predictions for genotype calls based on data associated with a first type of nucleotide reads (e.g., short reads) and a second type of nucleotide reads (e.g., long reads). As disclosed, the disclosed systems can determine sequencing metrics and can utilize a genotype-call-integration machine-learning model to generate predictions (e.g., genotype probabilities, variant call classifications) for generating output genotype calls based on the sequencing metrics. The disclosed system can utilize multiple such genotype-call-integration machine-learning models to generate genotype calls for different variant types, such as SNPs and indels, where the genotype-call-integration machine-learning models generate different predictions for each variant type.
An example of a flow cell includes a substrate and a reaction area defined in or over the substrate. The reaction area includes two angularly offset and non-perpendicular surfaces relative to a planar surface of the substrate, a polymeric hydrogel positioned over at least a portion of each of the two angularly offset and non-perpendicular surfaces; a first primer set attached to the polymeric hydrogel that is positioned over the portion of a first of the two angularly offset and non-perpendicular surfaces; and a second primer set attached to the polymeric hydrogel that is positioned over the portion of a second of the two angularly offset and non-perpendicular surfaces, wherein the first and second primer sets are orthogonal.
This disclosure relates to novel amplification compositions and methods, in particular for use in nucleic acid amplification and sequencing, preferably that do not involve reagents that are thermophilic.
This disclosure relates to novel thermophilic amplification compositions and methods, in particular for use in nucleic acid amplification and sequencing.
Gasket assemblies and related system and methods. An apparatus includes a system, a flow cell, and a plurality of gasket assemblies. The system includes a flow cell interface and the flow cell has one or more channels. Each channel has a first channel opening and a second channel opening. The first channel openings are positioned at a first end of the flow cell and the second channel openings are positioned at a second end of the flow cell. A gasket assembly coupled at each second channel opening. Each gasket assembly includes an adhesive stack and a gasket. The adhesive stack includes a first side bonded to the gasket and a second side bonded to the flow cell. The flow cell interface is engagable with the corresponding gaskets to establish a fluidic coupling between system and the flow cell.
The motion of a mechanical stage may be directed in x-, y-, and/or z-dimensions such that excitation of a resonant frequency f is reduced. In particular, once a resonant frequency f is identified, the acceleration of the stage in the x-, y-, and/or z-dimensions may divided into an even number of acceleration segments or intervals, with the second of each pair of acceleration segments starting 1/(2f) seconds after the start of the initial acceleration segment. The acceleration intervals may be defined by a start time, an amplitude profile, and/or a time duration. In some implementations, the amplitude and time duration of each acceleration pulse may be different. The amplitude and time duration of acceleration steps may be determined and adjusted to compensate for the particular resonance frequency of an individual system, and programmed into a controller for the stage using motor programming controls.
Liquid reservoirs, cartridge assemblies and related systems and methods are disclosed. An example implementation includes an apparatus that includes a body, a cover, and a lid assembly. The body includes a top surface and a storage chamber having an opening at the top surface. The cover covers or is positioned within the opening of the storage chamber. The lid assembly is coupled to the top surface and covers the opening of the storage chamber. The top surface and the first portion define a plenum. The cover is at least one of piercable, breakable, or movable to allow the storage chamber to be fluidly coupled to the plenum without venting the plenum to atmosphere.
This disclosure describes methods, non-transitory computer readable media, and systems that can utilize a machine-learning model to refine structural variant calls of a call generation model. For example, the disclosed systems can train and utilize a structural variant refinement machine-learning model to reduce false positives and/or false negatives. Indeed, the disclosed systems can improve or refine structural variant calls (e.g., between 50-200 base pairs in length) determined by a call generation model by training and utilizing the structural variant refinement machine-learning model. As disclosed, the systems can determine sequencing metrics and can customize training data for a structural variant refinement machine-learning model to generate modified structural variant calls.
Genome-wide association studies may allow for detection of variants that are statistically significantly associated with disease risk. However, inferring which are the genes underlying these variant associations may be difficult. The presently disclosed approaches utilize machine learning techniques to predict genes from genome-wide association study summary statistics that substantially improves causal gene identification in terms of both precision and recall compared to other techniques.
Presented herein are techniques for indexing of nucleic acid, e.g., for use in conjunction with sequencing. The techniques include generating indexed nucleic acid fragments from an individual sample, whereby the index sequence incorporated into each index site of the nucleic acid fragment is selected from a plurality of distinguishable of index sequences and such that the population of generated nucleic acid fragments represents each index sequence from the plurality. In this manner, the generated indexed nucleic acid fragments from a single sample are indexed with a diverse mix of index sequences that reduce misassignment due to index read errors associated with low sequence diversity.
The disclosure relates to methods, compositions, and kits for improving seeding efficiency of flow cells with polynucleotides, and applications thereof, including for sequencing.
Reusable flow cells for sequencing which exhibit signal intensity retention over numerous use cycles, the active surface of which contains poly-azide functional moieties, methods of treating flow cells surfaces with reagents to provide such poly-azide functional moieties, and reagents therefor.
C12Q 1/6874 - Methods for sequencing involving nucleic acid arrays, e.g. sequencing by hybridisation [SBH]
C07D 207/46 - Heterocyclic compounds containing five-membered rings not condensed with other rings, with one nitrogen atom as the only ring hetero atom with hetero atoms directly attached to the ring nitrogen atom
C12Q 1/6806 - Preparing nucleic acids for analysis, e.g. for polymerase chain reaction [PCR] assay
15.
NANOGEL PARTICLES HAVING DUAL FUNCTIONALITY AND TEMPERATURE RESPONSIVENESS FOR PARTICLE CLUSTERING IN NUCLEIC ACID SEQUENCING SYSTEMS
In some examples, novel nanogel particles are described having dual functionality, temperature responsiveness and pH responsiveness. For nucleic acid sequencing, amplification primers are grafted to nanogel particles to form primer-grafted nanogel particles, and the primer-grafted nanogel particles are captured onto surfaces within a flow cell. Within flow cells such as used in SBS nucleic acid sequencing, each primer-grafted nanogel particle functions as a nano-well in the flow cell, thus eliminating the need for nano-wells in some examples.
A microarray is designed to capture one or more molecules of interest at each of a plurality of sites on a substrate. The sites comprise base pads, such as polymer base pads, that promote the attachment of the molecules at the sites. The microarray may be made by one or more patterning techniques to create a layout of base pads in a desired pattern. Further, the microarrays may include features to encourage clonality at the sites.
C40B 50/18 - Solid phase synthesis, i.e. wherein one or more library building blocks are bound to a solid support during library creation; Particular methods of cleavage from the solid support using a particular method of attachment to the solid support
17.
BIOSENSORS FOR BIOLOGICAL OR CHEMICAL ANALYSIS AND SYSTEMS AND METHODS FOR SAME
A biosensor is provided including a detection device and a flow cell mounted to the detection device. The detection device has a detector surface with a plurality of reaction sites. The detection device also includes a filter layer. A method is providing including obtaining signal data from an array of light detectors; determining a crosstalk function for each of the light detectors of the array of light detectors; and determining characteristics of analytes of interest based on the signal data using the crosstalk functions.
The present invention provides a novel approach for storing, analyzing, and/or accessing biological data in a cloud computing environment. Sequence data generated by a particular sequencing device may be uploaded to the cloud computing environment during a sequencing run, which reduces the on-site storage needs for the sequence data. Analysis of the data may also be performed in the cloud computing environment, and the instructions for such analysis may be set at the originating sequencing device. The sequence data in the cloud computing environment may be shared according to permissions. Further, the sequence data may be modified or annotated by authorized secondary users.
G16H 40/67 - ICT specially adapted for the management or administration of healthcare resources or facilities; ICT specially adapted for the management or operation of medical equipment or devices for the operation of medical equipment or devices for remote operation
G06F 21/62 - Protecting access to data via a platform, e.g. using keys or access control rules
G16B 50/30 - Data warehousing; Computing architectures
G16H 40/40 - ICT specially adapted for the management or administration of healthcare resources or facilities; ICT specially adapted for the management or operation of medical equipment or devices for the management of medical equipment or devices, e.g. scheduling maintenance or upgrades
G16H 40/63 - ICT specially adapted for the management or administration of healthcare resources or facilities; ICT specially adapted for the management or operation of medical equipment or devices for the operation of medical equipment or devices for local operation
Sequencing systems and methods are provided that include a nanopore well that includes a cis well associated with a cis electrode and a trans well associated with a trans electrode, a membrane separating the cis well and the trans well, and a nanopore well embedded in the membrane providing a channel through the membrane; a command node connected directly to the nanopore well. The command node is configured to apply a potential across the nanopore well and a command pulse. The system further includes an amplifier with a feedback loop coupled to the nanopore well and a switch disposed between the amplifier and the nanopore well. The switch is driven by a clock pulse and configured to ground an inverting input of the amplifier.
The present disclosure relates to systems, non-transitory computer-readable media, and methods for generating a target-variant-reference panel comprising a target-variant position with target-variant indicators or using the target-variant-reference panel to impute a genotype call for the corresponding target variant. In particular, in one or more embodiments, the disclosed systems generate an initial reference panel including a variety of phased genomic samples of different haplotypes. The disclosed systems further add a target-variant position to the initial reference panel to indicate a presence or absence of a target variant, thereby creating a target-variant-reference panel comprising a target-variant position with target-variant indicators. Additionally or alternatively, the disclosed systems can utilize the target-variant-reference panel to impute genotype calls indicating a presence or absence of a target variant within a target genomic sample based on a comparison of (i) haplotypes represented in the target-variant-reference panel and (ii) nucleotide reads corresponding to the target genomic sample.
A method of processing sequence data comprising a known location of the start of a copy number variant breakpoint to generate a prediction for the location of the end of the copy number variant breakpoint. The method comprises an encoder and a copy number variation (CNV) caller guide. The encoder processes an anchor sequence and corresponding subject candidate sequence to generate a learned representation of the anchor sequence and a learned representation of the corresponding subject candidate sequence. The CNV caller guide determines a similarity between the learned representation of the anchor sequence and a learned representation of the corresponding subject candidate sequence. Similarity between anchor sequence and subject candidate sequence is used as a proxy for likelihood that the end of the CNV breakpoint is located on the subject candidate sequence.
G16B 40/00 - ICT specially adapted for biostatistics; ICT specially adapted for bioinformatics-related machine learning or data mining, e.g. knowledge discovery or pattern finding
An apparatus includes a flow cell, an imaging assembly, and a processor. The flow cell includes a channel and a plurality of reaction sites. The imaging assembly is operable to receive light emitted from the reaction sites in response to an excitation light. The processor is configured to drive relative movement between at least a portion of the imaging assembly and the flow cell along a continuous range of motion to thereby enable the imaging assembly to capture images along the length of the channel. The processor is also configured to activate the imaging assembly to capture one or more calibration images of one or more calibration regions of the channel, during a first portion of the continuous range of motion. The processor is also configured to activate the imaging assembly to capture images of the reaction sites during a second portion of the continuous range of motion.
Some implementations of the disclosure relate to an imaging system including one or more image sensors and a Z-stage. The imaging system is configured to perform operations including: capturing, using the one or more image sensors, a first image of a first pair of spots projected at a first sample location of a sample; determining whether or not the first image of the first pair of spots is valid; and when the first image is determined to be valid: obtaining, based on the first image, a current separation distance measurement of the first pair of spots; and controlling, based at least on the current separation distance measurement, the Z-stage to focus the imaging system at the first sample location.
The presently described techniques relate generally to providing motion feedback (e.g., motion system calibration and/or sample alignment) in the context of an imaging system (such as a time delay and integration (TDI) based imaging system). The architecture and techniques discussed may achieve nanoscale control and calibration of a movement feedback system without a high-resolution encoder subsystem or, in the alternative embodiments, with a lower resolution (and correspondingly less expensive) encoder subsystem than might otherwise be employed. By way of example, certain embodiments described herein relate to ascertaining or calibrating linear motion of a sample holder surface using nanoscale features (e.g., sample sites or nanowells or lithographically patterned features) provided on a surface of the sample holder.
An example flow cell includes a patterned substrate having an active region and a bonding region that at least partially surrounds the active region. The active region includes first depressions defined in a layer of the patterned substrate, surface chemistry positioned in the first depressions, and first interstitial regions surrounding the first depressions. The bonding region includes second depressions defined in the layer and second interstitial regions surrounding the second depressions. An adhesive is positioned over the second depressions and over the second interstitial regions. A cover is attached to the adhesive such that a flow channel is defined between a portion of the cover and the active region.
A method includes flowing an incorporation reagent through a reagent management system and a flow cell of an instrument. The flow cell having a first polynucleotide positioned therein. The incorporation reagent adding a first base onto a sequence of bases. The sequence of bases includes a second polynucleotide complementary to the first polynucleotide. An image of an identification signal emanating from the first base is captured after the first base has been added onto the second polynucleotide. A cleavage reagent is flowed through the reagent management system and flow cell to remove a first terminator from the first base in order to enable a subsequent base in the sequence of bases to be added to the second polynucleotide. A buffer reagent is flowed through the reagent management system and flow cell in a plurality of cycles of consecutive forward and reverse flow directions.
A system and method for imaging biological samples on multiple surfaces of a support structure are disclosed. The support structure may be a flow cell through which a reagent fluid is allowed to flow and interact with the biological samples. Excitation radiation from at least one radiation source may be used to excite the biological samples on multiple surfaces. In this manner, fluorescent emission radiation may be generated from the biological samples and subsequently captured and detected by detection optics and at least one detector. The detected fluorescent emission radiation may then be used to generate image data. This imaging of multiple surfaces may be accomplished either sequentially or simultaneously. In addition, the techniques of the present invention may be used with any type of imaging system. For instance, both epifluorescent and total internal reflection methods may benefit from the techniques of the present invention.
Systems, methods, and apparatus are described herein for performing sequencing of one or more biological samples in at least two flow cells on a sequencing device. A sequencing system may comprise one or more of a scheduling engine, the sequencing device, and a display. The scheduling engine may maintain scheduling information of a state of compute resources and non-compute resources. The sequencing device may receive the scheduling information from the scheduling engine; determine the state of the compute resources and non-compute resources; determine a sequencing analysis priority associated with performing analysis of the at least two flow cells on the sequencing device; and perform the sequencing task related to the one or more biological samples in the at least two flow cells according to the sequencing analysis priority. The display may display real-time feedback associated with completion of the sequencing task for each flow cell.
In one aspect, the disclosed technology relates to systems and methods for sequencing polynucleotides. In one embodiment, the disclosed technology relates to a nanopore sensor device for identifying nucleotides, the nanopore sensor device including: one or more cis wells; one or more cis electrodes associated with the one or more cis wells; a plurality of trans wells, each of the plurality of trans wells separated from the one or more cis wells by a lipid or solid-state membrane having a nanopore; a plurality of field effect transistors (FETs), each of the plurality of FETs associated with one of the plurality of trans wells; an electrical source configured to provide alternating current (AC) inputs between the one or more cis electrodes and the source terminals of the plurality of FETs; and a controller operably coupled to the plurality of FETs, the controller configured to measure AC responses of the plurality of FETs, wherein the AC responses depend on the identities of the nucleotides within or near the nanopores.
Versions of a sequencing system may be monitored to enable changing of a version of a server subsystem operating the sequencing system to service requests from client subsystems for performing analysis of sequencing data. A monitor subsystem may be utilized for receiving and authorizing requests from client subsystems. The monitor subsystem may identify a version associated with a server subsystem operating the sequencing system to be implemented for servicing the request. The monitor subsystem may allow the server subsystem to be accessed for servicing the request from the client subsystem when the version associated with the client subsystem is compatible with the version associated with the server subsystem. The monitor subsystem may prevent the server subsystem from being accessed when the version associated with the client subsystem is incompatible with the version associated with the server subsystem.
H04L 67/00 - Network arrangements or protocols for supporting network services or applications
G06F 8/71 - Version control ; Configuration management
H04L 41/082 - Configuration setting characterised by the conditions triggering a change of settings the condition being updates or upgrades of network functionality
There is set forth herein a device comprising structure defining a detector surface configured for supporting biological or chemical substances, and a sensor array comprising light sensors and circuitry to transmit data signals using photons detected by the light sensors. The device can include one or more features for reducing fluorescence range noise in a detection band of the sensor array.
The disclosed embodiments concern methods, apparatus, systems and computer program products for determining sequences of interest using unique molecular index (UMI) sequences that are uniquely associable with individual polynucleotide fragments, including sequences with low allele frequencies and long sequence length. In some implementations, the UMIs include both physical UMIs and virtual UMIs. In some implementations, the unique molecular index sequences include non-random sequences. System, apparatus, and computer program products are also provided for determining a sequence of interest implementing the methods disclosed.
Barriers including crosslinked amphiphilic molecules, and methods of making the same, are provided herein. In some examples, a barrier between first and second fluids includes at least one layer comprising a plurality of amphiphilic molecules. Amphiphilic molecules of the plurality of amphiphilic molecules are crosslinked to one another.
C07K 14/195 - Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria
C08F 293/00 - Macromolecular compounds obtained by polymerisation on to a macromolecule having groups capable of inducing the formation of new polymer chains bound exclusively at one or both ends of the starting macromolecule
37.
METHODS AND COMPOSITIONS FOR SELECTIVE CLEAVAGE OF NUCLEIC ACIDS WITH RECOMBINANT NUCLEASES
Some embodiments of the methods and compositions provided herein relate to the selective cleavage of a target nucleic acid. Some such embodiments include the selective cleavage of a target nucleic acid that is associated with a DNA-binding protein or comprises a methylated CpG island, with a recombinant nuclease. In some embodiments, the DNA-binding protein comprises a chromatin protein. Some embodiments also include the enrichment of non-target nucleic acids in a sample by selective cleavage of target nucleic acids in the sample, and removal of the cleaved target nucleic acids from the sample.
C12Q 1/6806 - Preparing nucleic acids for analysis, e.g. for polymerase chain reaction [PCR] assay
C12N 15/66 - General methods for inserting a gene into a vector to form a recombinant vector using cleavage and ligation; Use of non-functional linkers or adaptors, e.g. linkers containing the sequence for a restriction endonuclease
A method for spatially tagging nucleic acids of a biological specimen, including steps of (a) providing a solid support comprising different nucleic acid probes that are randomly located on the solid support, wherein the different nucleic acid probes each includes a barcode sequence that differs from the barcode sequence of other randomly located probes on the solid support; (b) performing a nucleic acid detection reaction on the solid support to locate the barcode sequences on the solid support; (c) contacting a biological specimen with the solid support that has the randomly located probes; (d) hybridizing the randomly located probes to target nucleic acids from portions of the biological specimen; and (e) modifying the randomly located probes that are hybridized to the target nucleic acids, thereby producing modified probes that include the barcode sequences and a target specific modification, thereby spatially tagging the nucleic acids of the biological specimen.
The technology disclosed assigns quality scores to bases called by a neural network-based base caller by (i) quantizing classification scores of predicted base calls produced by the neural network-based base caller in response to processing training data during training, (ii) selecting a set of quantized classification scores, (iii) for each quantized classification score in the set, determining a base calling error rate by comparing its predicted base calls to corresponding ground truth base calls, (iv) determining a fit between the quantized classification scores and their base calling error rates, and (v) correlating the quality scores to the quantized classification scores based on the fit.
G06F 18/23211 - Non-hierarchical techniques using statistics or function optimisation, e.g. modelling of probability density functions with adaptive number of clusters
G06F 18/2415 - Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches based on parametric or probabilistic models, e.g. based on likelihood ratio or false acceptance rate versus a false rejection rate
G06N 3/084 - Backpropagation, e.g. using gradient descent
G06N 7/01 - Probabilistic graphical models, e.g. probabilistic networks
G06V 10/44 - Local feature extraction by analysis of parts of the pattern, e.g. by detecting edges, contours, loops, corners, strokes or intersections; Connectivity analysis, e.g. of connected components
G06V 10/75 - Image or video pattern matching; Proximity measures in feature spaces using context analysis; Selection of dictionaries
G06V 10/762 - Arrangements for image or video recognition or understanding using pattern recognition or machine learning using clustering, e.g. of similar faces in social networks
G06V 10/764 - Arrangements for image or video recognition or understanding using pattern recognition or machine learning using classification, e.g. of video objects
G06V 10/77 - Arrangements for image or video recognition or understanding using pattern recognition or machine learning using data integration or data reduction, e.g. principal component analysis [PCA] or independent component analysis [ICA] or self-organising maps [SOM]; Blind source separation
G06V 10/778 - Active pattern-learning, e.g. online learning of image or video features
G06V 10/82 - Arrangements for image or video recognition or understanding using pattern recognition or machine learning using neural networks
G06V 10/98 - Detection or correction of errors, e.g. by rescanning the pattern or by human intervention; Evaluation of the quality of the acquired patterns
G16B 40/00 - ICT specially adapted for biostatistics; ICT specially adapted for bioinformatics-related machine learning or data mining, e.g. knowledge discovery or pattern finding
Methods, systems, and computer programs for compressing nucleic acid sequence data. A method can include obtaining nucleic acid sequence data representing: (i) a read sequence, and (ii) a plurality of quality scores, determining whether the read sequence includes at least one “N” base, based on a determination that the read sequence includes at least one “N” base, generating, by one or more computers, a first encoding data set by using a first encoding process to encode each set of four quality scores of the read sequence into a single byte of memory, and using a second encoding process to encode the first encoded data set, thereby compressing the data to be compressed.
Methods, systems, and apparatuses, including computer programs for generating and using a hash table configured to improve mapping of reads are disclosed that include obtaining a first seed of K nucleotides from a reference sequence, generating a seed extension tree having a nodes, wherein each node of the nodes corresponds to (i) an extended seed that is an extension of the first seed and has a nucleotide length of K* and (ii) one or more locations, in a seed extension table, that include data describing reference sequence locations that match the extended seed, and for each node: storing interval information at a location of the hash table that corresponds to an index key for the extended seed, wherein the interval information references one or more locations in the seed extension table that include reference sequence locations that match the extended seed associated with the node.
Methods are used for obtaining, cataloguing, and/or storing data derived from a biological source using a flow cell body, electrodes, and an imaging assembly. The data may include DNA and/or RNA obtained from a biological source, such as from the cells of an organism. The methods may be used to obtain, catalog, and/or store data such as DNA or RNA sequence from a pathogen such as a virus and/or a bacteria, human health data over time, and immune system information from an individual. The data obtained using the disclosed methods may be used for a variety of different purposes, including the manufacture of vaccine compositions, and for restoring the immune system of an individual who has undergone an immune system depleting event. The methods may be used for storage of biological cells, which may be used for the screening of compounds, such as small molecules with potential for therapeutic indications.
There is set forth herein a light energy exciter that can include one or more light sources. A light energy exciter can emit excitation light directed toward a detector surface that can support biological or chemical samples.
An integrated detection, flow cell and photonics (DFP) device is provided that comprises a substrate having an array of pixel elements that sense photons during active periods. The substrate and pixel elements form an IC photon detection layer. At least one wave guide is formed on the IC photo detection layer as a photonics layer. An optical isolation layer is formed over at least a portion of the wave guide. A collection of photo resist (PR) walls patterned to define at least one flow cell channel that is configured to direct fluid along a fluid flow path. The wave guides align to extend along the fluid flow path. The flow cell channel is configured to receive samples at sample sites that align with the array of pixel elements.
G01N 21/77 - Systems in which material is subjected to a chemical reaction, the progress or the result of the reaction being investigated by observing the effect on a chemical indicator
H01L 31/107 - Devices sensitive to infrared, visible or ultraviolet radiation characterised by only one potential barrier or surface barrier the potential barrier working in avalanche mode, e.g. avalanche photodiode
G01S 7/4863 - Detector arrays, e.g. charge-transfer gates
H01L 31/055 - Optical elements directly associated or integrated with the PV cell, e.g. light-reflecting means or light-concentrating means where light is absorbed and re-emitted at a different wavelength by the optical element directly associated or integrated with the PV cell, e.g. by using luminescent material, fluorescent concentrators or up-conversion arrangements
G01S 7/4865 - Time delay measurement, e.g. time-of-flight measurement, time of arrival measurement or determining the exact position of a peak
B01L 3/00 - Containers or dishes for laboratory use, e.g. laboratory glassware; Droppers
The technology disclosed relates to splice site prediction and aberrant splicing detection. In particular, it relates to a splice site predictor that includes a convolutional neural network trained on training examples of donor splice sites, acceptor splice sites, and non-splicing sites. An input stage of the convolutional neural network feeds an input sequence of nucleotides for evaluation of target nucleotides in the input sequence. An output stage of the convolutional neural network translates analysis by the convolutional neural network into classification scores for likelihoods that each of the target nucleotides is a donor splice site, an acceptor splice site, and a non-splicing site.
G16B 30/00 - ICT specially adapted for sequence analysis involving nucleotides or amino acids
G06N 3/04 - Architecture, e.g. interconnection topology
G16B 40/00 - ICT specially adapted for biostatistics; ICT specially adapted for bioinformatics-related machine learning or data mining, e.g. knowledge discovery or pattern finding
The technology disclosed relates to artificial intelligence-based base calling. The technology disclosed relates to accessing a progression of per-cycle analyte channel sets generated for sequencing cycles of a sequencing run, processing, through a neural network-based base caller (NNBC), windows of per-cycle analyte channel sets in the progression for the windows of sequencing cycles of the sequencing run such that the NNBC processes a subject window of per-cycle analyte channel sets in the progression for the subject window of sequencing cycles of the sequencing run and generates provisional base call predictions for three or more sequencing cycles in the subject window of sequencing cycles, from multiple windows in which a particular sequencing cycle appeared at different positions, using the NNBC to generate provisional base call predictions for the particular sequencing cycle, and determining a base call for the particular sequencing cycle based on the plurality of base call predictions.
Non-contact dispensers and related systems and methods are disclosed. In accordance with an implementation, an apparatus includes a pump having a body that defines an inlet, an outlet, and a flow path fluidly coupling the inlet and the outlet. A first displacement member is movable from a first position to a second position within the flow path to urge a first volume of the fluid out of the outlet. A second displacement member is movable from a first position to a second position within the flow path to urge a second volume of the fluid out of the outlet.
Actuation systems and methods are disclosed. An apparatus includes a system including a flow cell receptacle and a valve drive assembly including a shape memory alloy actuator including a pair of shape memory alloy wires and a flow cell disposable within the flow cell receptacle and having a membrane valve. The system actuates the membrane valve, via the shape memory alloy actuator, by causing a voltage to be applied to a first one of the shape memory alloy wires and the system not applying the voltage to a second one of the shape memory alloy wires.
F03G 7/06 - Mechanical-power-producing mechanisms, not otherwise provided for or using energy sources not otherwise provided for using expansion or contraction of bodies due to heating, cooling, moistening, drying, or the like
49.
NUCLEIC ACID SEQUENCE ANALYSIS AND CONFIGURABLE REPORT GENERATION
The presently described techniques relate generally to configuration and use of a software platform that provides tools for users to store, arrange, and visualize genetic data, such as may be derived from a nucleic acid sequencing device. In addition, such a software platform may include one or more tools that allow a user to annotate genetic data with information available from external and/or internal genetic databases and to create custom reports based on such information. In practice, the software platform may be generic with respect to the sequencing device generating the sequence data, one or more upstream analytic packages, such as may perform variant identification or calling, and one or more external or internal data stores (e.g., knowledge bases or databases) used to access information about the sequence and/or variants identified therein.
The presently described techniques relate generally to providing motion feedback (e.g., motion system calibration and/or sample alignment) in the context of an imaging system (such as a time delay and integration (TDI) based imaging system). The architecture and techniques discussed may achieve nanoscale control and calibration of a movement feedback system without a high-resolution encoder subsystem or, in the alternative embodiments, with a lower resolution (and correspondingly less expensive) encoder subsystem than might otherwise be employed. By way of example, certain embodiments described herein relate to ascertaining or calibrating linear motion of a sample holder surface using nanoscale features (e.g., sample sites or nanowells or lithographically patterned features) provided on a surface of the sample holder.
This disclosure describes methods, non-transitory-computer readable media, and systems that can use a single executable file to run a single-cell multiomics analysis that (i) aligns multiomics reads with a reference genome and (ii) jointly filters cellular barcode sequences for cells based on feature-specific, single-cell read counts. To run such an assay, the disclosed systems identify transcriptomic reads and genomic reads for a sample, where such reads comprise different sets of cellular barcode sequences. In some cases, the disclosed systems further use separate invocations of a configurable processor to align the transcriptomic reads and genomics reads with a reference genome. Based on single-cell counts of aligned transcriptomic reads and aligned genomic reads for target nucleotide sequences, the disclosed systems select a subset of candidate cells corresponding to a subset of cellular barcode sequences. The disclosed systems further generate, for the sample, single-cell multiomics outputs based on the counts of aligned reads.
G06K 7/14 - Methods or arrangements for sensing record carriers by corpuscular radiation using light without selection of wavelength, e.g. sensing reflected white light
52.
Methods of Preparing Directional Tagmentation Sequencing Libraries Using Transposon-Based Technology with Unique Molecular Identifiers for Error Correction
Materials and methods for preparing nucleic acid libraries for next-generation sequencing are described herein. A variety of approaches are described relating to the use of unique molecular identifiers with transposon-based technology in the preparation of sequencing libraries. Also described herein are sequencing materials and methods for identifying and correcting amplification and sequencing errors.
A method comprises: directing, using an objective and a first reflective surface, first autofocus light toward a sensor, the first autofocus light reflected from a first surface of a substrate; preventing second autofocus light from reaching the sensor, the second autofocus light reflected from a second surface of the substrate; and directing, using the objective and a second reflective surface, emission light toward the sensor, the emission light originating from a sample at the substrate.
Described herein are standards and methods of normalizing amplicon size bias. These standards may comprise unique molecular identifiers. In some embodiments, the standards and methods are for use with next generation sequencing (NGS) assays. Also described herein are methods for quantifying DNA damage in a sample comprising DNA using fluorescence or for determining the presence of DNA damage in a library.
Techniques are described for dynamically correcting image distortion during imaging of a patterned sample having repeating spots. Different sets of image distortion correction coefficients may be calculated for different regions of a sample during a first imaging cycle of a multicycle imaging run and subsequently applied in real time to image data generated during subsequent cycles. In one implementation, image distortion correction coefficients may be calculated for an image of a patterned sample having repeated spots by: estimating an affine transform of the image; sharpening the image; and iteratively searching for an optimal set of distortion correction coefficients for the sharpened image, where iteratively searching for the optimal set of distortion correction coefficients for the sharpened image includes calculating a mean chastity for spot locations in the image, and where the estimated affine transform is applied during each iteration of the search.
H04N 1/401 - Compensating positionaly unequal response of the pick-up or reproducing head
H04N 1/409 - Edge or detail enhancement; Noise or error suppression
G06V 20/69 - Microscopic objects, e.g. biological cells or cellular parts
H04N 25/61 - Noise processing, e.g. detecting, correcting, reducing or removing noise the noise originating only from the lens unit, e.g. flare, shading, vignetting or "cos4"
G06V 10/24 - Aligning, centring, orientation detection or correction of the image
G01B 11/16 - Measuring arrangements characterised by the use of optical techniques for measuring the deformation in a solid, e.g. optical strain gauge
G06T 3/00 - Geometric image transformation in the plane of the image
H04N 1/387 - Composing, repositioning or otherwise modifying originals
56.
Display screen or portion thereof with animated graphical user interface
Embodiments of systems, methods, and compositions provided herein relate to on bead tagmentation and droplet indexing. Some embodiments include performing co-assays on partitioned beads, including nucleic acid sequencing, indexed PCR, preparing nucleic acid libraries, determining methylation status, identifying genomic variants, or protein analysis.
A biosensor is provided including a detection device and a flow cell mounted to the detection device. The detection device has a detector surface with a plurality of reaction sites. The detection device also includes a filter layer. A method is providing including obtaining signal data from an array of light detectors; determining a crosstalk function for each of the light detectors of the array of light detectors; and determining characteristics of analytes of interest based on the signal data using the crosstalk functions.
The technology disclosed relates to splice site prediction and aberrant splicing detection. In particular, it relates to a splice site predictor that includes a convolutional neural network trained on training examples of donor splice sites, acceptor splice sites, and non-splicing sites. An input stage of the convolutional neural network feeds an input sequence of nucleotides for evaluation of target nucleotides in the input sequence. An output stage of the convolutional neural network translates analysis by the convolutional neural network into classification scores for likelihoods that each of the target nucleotides is a donor splice site, an acceptor splice site, and a non-splicing site.
G16B 20/00 - ICT specially adapted for functional genomics or proteomics, e.g. genotype-phenotype associations
G16B 40/00 - ICT specially adapted for biostatistics; ICT specially adapted for bioinformatics-related machine learning or data mining, e.g. knowledge discovery or pattern finding
G16B 50/00 - ICT programming tools or database systems specially adapted for bioinformatics
The disclosure provides detection apparatus having one or more nanopores, methods for making apparatus having one or more nanopore and methods for using apparatus having one or more nanopores. Uses include, but are not limited to detection and sequencing of nucleic acids.
A composition for capturing target polynucleotides at a surface of a substrate is provided. The composition may include a plurality of capture primers coupled to the surface of the substrate and including modified nucleic acids; and a plurality of orthogonal capture primers coupled to the surface of the substrate and including modified nucleic acids. The modified nucleic acids of the capture primers may include locked nucleic acid (LNA), peptide nucleic acid (PNA), or super T. The modified nucleic acids of the orthogonal capture primers may include locked nucleic acid (LNA), peptide nucleic acid (PNA), or super T.
Provided herein include various examples of a flow cell and methods for forming aspects of flow cell. The method may include applying a first adhesive to a substrate. The method may include orienting a die on the first adhesive. The method may also include orienting a package on the first adhesive. The package includes a die and a top surface of the die comprises an active surface and electrical contact points. Surfaces adjacent to the active surface on at least two opposing sides of the active surface form fanout regions for utilization in a fluidic path of the flow cell. The method further may include applying a second adhesive to a part of the package and attaching a lid to the second adhesive to define a fluidic flow-cell cavity below the lid and above a surface comprising the active surface and the fanout regions.
A method may be implemented for in-tip flow-through magnetic bead processing. A biological solution may include a plurality of magnetic beads suspended therein. The biological solution may be introduced to a tube via an opening in a tip portion of the tube. The tip portion of the tube may include a magnetizable material arranged in a flow path of the biological solution. The magnetizable material may include a ferromagnetic matrix or a wire within the tip portion. A magnetic field may be applied proximate to the tip portion of the tube using an electromagnetic coil. The electromagnetic coil may be wound around the tip portion. The biological solution may be removed from the tube, for example, via the opening in the tip portion. The plurality of magnetic beads may be captured within the magnetizable material in the tip portion using the magnetic field.
The disclosed embodiments concern methods, apparatus, systems and computer program products for determining sequences of interest using unique molecular index sequences that are uniquely associable with individual polynucleotide fragments, including sequences with low allele frequencies and long sequence length. In some implementations, the unique molecular index sequences include variable-length nonrandom sequences. In some implementations, the unique molecular index sequences are associated with the individual polynucleotide fragments based on alignment scores indicating similarity between the unique molecular index sequences and subsequences of sequence reads obtained from the individual polynucleotide fragments. System, apparatus, and computer program products are also provided for determining a sequence of interest implementing the methods disclosed.
A system includes an image sensor structure and a flow cell. The image sensor structure includes an image layer disposed over a base substrate. A device stack is disposed over the image layer. A bond pad is disposed in the device stack. A passivation stack is disposed over the device stack and the bond pad. An array of nanowells is disposed in a top layer of the passivation stack. A through-silicon via (TSV) is in electrical contact with the bond pad. The TSV extends through the base substrate. A redistribution layer (RDL) is disposed on a bottom surface of the base substrate. The RDL is in electrical contact with the TSV. The flow cell is disposed upon the top layer of the passivation stack to form a flow channel therebetween. The flow channel is disposed over the array of nanowells and the bond pad.
H01L 25/065 - Assemblies consisting of a plurality of individual semiconductor or other solid state devices all the devices being of a type provided for in the same subgroup of groups , or in a single subclass of , , e.g. assemblies of rectifier diodes the devices not having separate containers the devices being of a type provided for in group
H01L 23/31 - Encapsulation, e.g. encapsulating layers, coatings characterised by the arrangement
H01L 23/00 - SEMICONDUCTOR DEVICES NOT COVERED BY CLASS - Details of semiconductor or other solid state devices
67.
NON-CONTACT DISPENSERS AND RELATED SYSTEMS AND METHODS
Non-contact dispensers and related systems and methods are disclosed. In accordance with an implementation, an apparatus includes a non-contact dispenser includes a body defining an inlet, an outlet, and a flow path fluidly coupling the inlet and the outlet. The non-contact dispenser also includes a first valve to control flow into a portion of the flow path, a second valve to control flow out of the outlet, and a pump positioned between the first valve and the second valve.
Systems and methods for conducting designated reactions that include a fluidic network having a sample channel, a reaction chamber, and a reservoir. The sample channel is in flow communication with a sample port. The system also includes a rotary valve that has a flow channel and is configured to rotate between first and second valve positions. The flow channel fluidically couples the reaction chamber and the sample channel when the rotary valve is in the first valve position and fluidically couples the reservoir and the reaction chamber when the rotary valve is in the second valve position. A pump assembly induces a flow of a biological sample toward the reaction chamber when the rotary valve is in the first valve position and induces a flow of a reaction component from the reservoir toward the reaction chamber when the rotary valve is in the second valve position.
B01L 3/00 - Containers or dishes for laboratory use, e.g. laboratory glassware; Droppers
B01L 7/00 - Heating or cooling apparatus; Heat insulating devices
F16K 99/00 - Subject matter not provided for in other groups of this subclass
G01N 35/08 - Automatic analysis not limited to methods or materials provided for in any single one of groups ; Handling materials therefor using a stream of discrete samples flowing along a tube system, e.g. flow injection analysis
The current document discusses a detection system comprising a mechanical-change sensor that exhibits one or more mechanical changes when specifically interacting with entities within a target, each entity having a type, a mechanical-change-to-signal transducer that transduces the one or more mechanical changes into a signal, and an analysis subsystem that determines the types of entities within the target using the signal.
Presented herein are methods and compositions for analyzing rare nucleic acid species. Some methods presented herein use DNA reassociation kinetics following thermal denaturation to define populations of nucleic acid sequences, e.g., highly abundant (e.g., cDNA from rRNA), moderately abundant, and less abundant or rare sequences (e.g., cDNA from mRNA).
C40B 30/04 - Methods of screening libraries by measuring the ability to specifically bind a target molecule, e.g. antibody-antigen binding, receptor-ligand binding
C12N 15/10 - Processes for the isolation, preparation or purification of DNA or RNA
C12Q 1/6806 - Preparing nucleic acids for analysis, e.g. for polymerase chain reaction [PCR] assay
C12Q 1/6874 - Methods for sequencing involving nucleic acid arrays, e.g. sequencing by hybridisation [SBH]
71.
HYDRATION AND HOMOGENIZATION OF LYOPHILIZED REAGENTS
Provided are systems and methods including, under control of control circuitry implementing a hydration and homogenization protocol, hydrating lyophilized reagents and homogenizing the hydrated reagents. Lyophilized reagent nozzle sippers, including distal tips, extend into lyophilized reagent wells such that the distal tips do not contact the associated lyophilized reagent, designated amounts of hydration fluid are automatically aspirated from the corresponding hydration reservoir by corresponding sippers and discharged into the lyophilized reagent well based on the hydration and homogenization protocol implemented by the control circuitry. The method may also include extending the lyophilized reagent nozzle sippers into lyophilized reagent wells such that the distal tips contact the hydrated reagent, and automatically aspirating and discharging the hydrated reagent based on the hydration and homogenization protocol implemented by the control circuitry.
A method of characterizing candidate agents including steps of (a) providing a library of candidate agents attached to nucleic acid tags; (b) contacting the library with a solid support to attach the candidate agents to the solid support, whereby an array of candidate agents is formed; (c) contacting the array with a screening agent, wherein one or more candidate agents in the array react with the screening agent; (d) detecting the array to determine that at least one candidate agent in the array reacts with the screening agent; (e) sequencing the nucleic acid tag to determine the tag sequences attached to candidate agents in the array; and (f) identifying the at least one candidate agent in the array that reacts with the screening agent based on the tag sequence that is attached to the at least one candidate agent.
C40B 50/16 - Solid phase synthesis, i.e. wherein one or more library building blocks are bound to a solid support during library creation; Particular methods of cleavage from the solid support involving encoding steps
B01L 3/00 - Containers or dishes for laboratory use, e.g. laboratory glassware; Droppers
B01J 19/00 - Chemical, physical or physico-chemical processes in general; Their relevant apparatus
C40B 20/04 - Identifying library members by means of a tag, label, or other readable or detectable entity associated with the library members, e.g. decoding processes
C12N 15/10 - Processes for the isolation, preparation or purification of DNA or RNA
73.
SUBSTRATES FOR PERFORMING QUANTIFICATION PROCESSES AND RELATED SYSTEMS AND METHODS
Substrates for performing quantification processes and related systems and methods are disclosed. In an implementation, an apparatus includes a substrate and an imaging system. The substrate includes a pair of plates and a plurality of spacers positioned between the plates to define a gap between the pair of plates. A portion of a sample is to be received within the gap of the substrate and the imaging system is to obtain image data of the portion of the sample. The image data is to be used to determine a concentration of the portion of the sample.
The present disclosure relates to systems, non-transitory computer-readable media, and methods for efficiently identifying and selecting split groups corresponding to one or more nucleotide reads. Generally, split groups comprise chains of fragments forming split-alignments of one read. The disclosed system utilizes dynamic programming to generate and evaluate candidate split groups. The disclosed system can generate split group scores for each of the candidate split groups. To generate the split group scores, the disclosed system considers fragment alignment scores and geometries of fragment alignments within the candidate split groups. The disclosed systems select a predicted split group from the candidate split groups based on the split group scores.
G16B 40/00 - ICT specially adapted for biostatistics; ICT specially adapted for bioinformatics-related machine learning or data mining, e.g. knowledge discovery or pattern finding
Provided herein include various examples of an apparatus, a sensor system and examples of a method for manufacturing aspects of an apparatus, a sensor system. The apparatus may include a die. The apparatus may also include a substrate comprising a cavity. The die may be oriented in a portion of the cavity in the substrate, where the orientation defines a first space in the cavity adjacent to a first edge of the upper surface of the die and a second space in the cavity adjacent to the second edge of the upper surface of the die. The apparatus may further include fluidics fan-out regions comprising a first cured material deposited in the first space and the second space, a surface of the fluidics fan-out regions being contiguous with the upper surface of the die.
H01L 23/053 - Containers; Seals characterised by the shape the container being a hollow construction and having an insulating base as a mounting for the semiconductor body
H01L 23/31 - Encapsulation, e.g. encapsulating layers, coatings characterised by the arrangement
H01L 23/00 - SEMICONDUCTOR DEVICES NOT COVERED BY CLASS - Details of semiconductor or other solid state devices
H01L 23/48 - Arrangements for conducting electric current to or from the solid state body in operation, e.g. leads or terminal arrangements
B81B 1/00 - Devices without movable or flexible elements, e.g. microcapillary devices
Light detection devices and related methods are provided. The devices may comprise a reaction structure for containing a reaction solution with a relatively high or low pH and a plurality of reaction sites that generate light emissions. The devices may comprise a device base comprising a plurality of light sensors, device circuitry coupled to the light sensors, and a plurality of light guides that block excitation light but permit the light emissions to pass to a light sensor. The device base may also include a shield layer extending about each light guide between each light guide and the device circuitry, and a protection layer that is chemically inert with respect to the reaction solution extending about each light guide between each light guide and the shield layer. The protection layer prevents reaction solution that passes through the reaction structure and the light guide from interacting with the device circuitry.
Systems, methods, and apparatus are described herein for identifying callable regions and performing variant calling while operating within allocated memory. A sequencing subsystem may comprise a variant caller or variant caller subsystem. The variant caller may include a calling subsystem configured to identify callable regions and may send the callable regions to a downstream genotyping subsystem of the variant caller. The calling subsystem of the variant caller may be configured to detect a callable region of the sequencing data when a depth of the plurality of reads is above a callable region depth threshold. The calling subsystem of the variant caller may monitor memory used by the callable region and, when the memory used exceeds a memory threshold of a total amount of memory allocated, the calling subsystem may split or spill at least a portion of the callable region to operate within the total amount of allocated memory.
Methods and systems for decreasing amplification bias and primer-dimer formation in amplification reactions and for amplifying a plurality of target polynucleotides from a sample in a single reaction and for sequencing the target polynucleotides where samples can include forensic samples and where target polynucleotides can include identity- or ancestry-informative markers, short tandem repeats (STRs) and single nucleotide polymorphisms (SNPs). Methods of determining a nucleotide spacer sequence for disrupting primer dimer formation can include: receiving a set of primer sequences; determining a plurality of candidate spacers between an adapter sequence and a gene-specific portion of the primer sequence, the determined plurality of candidate spacers comprises sequences that disrupt stable interactions between sequences of the set of primer sequences; ranking candidate spacers that meet a predetermined threshold value of stable interactions in the extension sequences; and outputting a set of the ranked spacers that meet the predetermined threshold.
A co-polymer includes a plurality of a first monomer including a terminal functional group that is to attach to at least two different primers; a plurality of a second monomer including a second functional group that is different from the terminal functional group, and that is selected from the group consisting of a phenyl group, methoxy propyl, glycosyl, vinyl pyrrolidone, and an imidazole group; and a plurality of a third monomer that is different from the first and second monomers. This co-polymer may be used in a flow cell, and may enhance the clustering efficiency and kinetics.
This disclosure describes methods, non-transitory computer readable media, and systems that can generate a structural variation graph genome with alternate contiguous sequences representing structural variant haplotypes. For instance, the disclosed systems can identify candidate structural variants that satisfy an occurrence threshold within a genomic sample database. From among the candidate structural variants, the systems select structural variant haplotypes based on one or both of the structural variant haplotypes satisfying a relative haplotype frequency and finding flanking variants adjacent to particular structural variant haplotypes. The systems can likewise select reference haplotypes corresponding to the selected structural variant haplotypes from a reference genome. Based on the selected haplotypes, the disclosed systems generate a structural variation graph genome comprising both alternate contiguous sequences representing the structural variant haplotypes and reference sequences representing the reference haplotypes.
This disclosure describes methods, non-transitory computer readable media, and systems that can determine allele likelihoods of a genomic region exhibiting certain haplotype alleles using one or both of consolidated computations and data exchanges across specialized hardware. For instance, the disclosed systems can determine an intermediate allele likelihood of a genomic region comprising a haplotype allele by running a single-pass-concurrent-multiplication operation. In some cases, the disclosed systems determine and store subsets of intermediate allele likelihoods corresponding to marker-variant groups and extemporaneously generate sets of intermediate allele likelihoods for a set of marker variants by using the intermediate-allele-likelihood subsets as hot-start points. In further embodiments, the disclosed systems determine running sums of intermediate allele likelihoods of a genomic region exhibiting haplotype alleles for haplotypes given one marker variant and use the running sums as inputs to determine intermediate allele likelihoods of the genomic region exhibiting the haplotype alleles given another marker variant.
G16B 20/20 - Allele or variant detection, e.g. single nucleotide polymorphism [SNP] detection
G16B 40/00 - ICT specially adapted for biostatistics; ICT specially adapted for bioinformatics-related machine learning or data mining, e.g. knowledge discovery or pattern finding
Defocus is introduced during sequencing by synthesis by tilt of a flow cell and by variations in flatness of the flow cell. Effects of the defocus are reduced, and base calling quality is improved using techniques relating to dependence of base calling on flow cell tilt. For example, the flow cell surface height is measured throughout the flow cell. A focal height of an imager having a sensor for the sequencing is set, optionally adaptively, one or more times during the sequencing. Each image captured by the sensor is partitioned, e.g., based on differences between focal height and the measured flow cell surface height across areas of the sensor. Filters, e.g., related to defocus correction, are selected based at least in part on the difference between the focal height and the measured flow cell surface height at a particular area of the image being corrected for defocus.
Embodiments of the present disclosure include methods and compositions for functionalizing molecules, such as oligonucleotides, with functional groups, including polyhistidine tags useful in affinity methods. Some embodiments include methods for modifying and purifying complex mixtures of molecules by exchange of functional tags.
C12Q 1/6811 - Selection methods for production or design of target specific oligonucleotides or binding molecules
C07H 21/00 - Compounds containing two or more mononucleotide units having separate phosphate or polyphosphate groups linked by saccharide radicals of nucleoside groups, e.g. nucleic acids
C12N 15/10 - Processes for the isolation, preparation or purification of DNA or RNA
This disclosure describes methods, non-transitory computer readable media, and systems that can introduce short calibration sequences into a sequencing device and run calibration cycles to adjust or otherwise determine a sequencing parameter corresponding to the sequencing device. For instance, the disclosed systems can detect a flow cell (or other sample-nucleotide slide) with calibration sequences incorporated into samples' library fragments or into a surface of the sample-nucleotide slide. By running one or more calibration cycles to incorporate nucleobases on oligonucleotides corresponding to calibration sequences and capture corresponding images for calibration sequences—separate from genomic sequencing cycles for sample genomic sequences—the disclosed systems can determine a sequencing parameter corresponding to the sequencing device.
An example of a nanopore sensing system includes an application specific integrated circuit (ASIC) sensor mounted on a printed circuit board having an electrical interface with the ASIC sensor; and a nanopore sequencer formed on the ASIC sensor. The nanopore sequencer includes a redox mediator chamber having a cis electrode positioned therein; a cis well; a membrane positioned between the cis well and the redox mediator chamber, the membrane to confine a redox mediator species in the redox mediator chamber and to allow an ionic species to pass between the redox mediator chamber and the cis well; a plurality of trans wells, each including a trans electrode positioned therein; and a plurality of nanopores respectively fluidically connecting the cis well to each of the plurality of trans wells.
Disclosed herein are systems and methods for performing secondary analyses of nucleotide sequencing data in a time-efficient manner. Some embodiments include performing a secondary analysis iteratively while sequence reads are generated by a sequencing system. Secondary analyses can encompass both alignment of sequence reads to a reference sequence (e.g., the human reference genome sequence) and utilization of this alignment to detect differences between a sample and the reference. Secondary analysis can enable detection of genetic differences, variant detection and genotyping, identification of single nucleotide polymorphisms (SNPs), small insertions and deletion (indels) and structural changes in the DNA, such as copy number variants (CNVs) and chromosomal rearrangements.
Described herein is a polynucleotide for use as a sequencing template comprising multiple inserts. Also described herein are method of generating and using these polynucleotides and methods of use of such templates, including analysis of contiguity information. Further, sequencing templates comprising an insert sequence and a copy of the insert sequence can be used to correct for random errors generated during sequencing or amplification or to identify nucleobase damage or other mutation that leads to non-canonical base pairing in a double-stranded nucleic acid. Methods of performing methylation analysis are also described herein.
A nanopore sensing system includes a cis well, a trans well, and a metal based membrane positioned between the cis and trans wells so that a channel defined in the metal based membrane fluidically connects the cis and trans wells. The metal based membrane has a thickness ranging from about 1 nm to about 3 nm and is selected from the group consisting of a metal oxide, a metal sulfide, a metal nitride, a metal phosphide, a metal arsenide, a metal antimonide, a metal selenide, and a metal telluride.
Methods of inserting a nanopore into a polymeric membrane are provided herein. The membrane may be destabilized using a chaotropic solvent. The nanopore may be inserted into the destabilized polymer membrane. The chaotropic solvent may be removed to stabilize the polymer membrane with the nanopore inserted therein.
A flow cell includes a support and a heteropolymer attached to the support. The heteropolymer includes an acrylamide monomer including an attachment group to react with a functional group attached to a primer, and a monomer including a stimuli-responsive functional group. The monomer including the stimuli-responsive functional group may be pH-responsive, temperature-responsive, saccharide-responsive, nucleophile-responsive, and/or salt-responsive.
The present invention includes methods and materials for use in the detection preeclampsia and/or determining an increased risk for preeclampsia in a pregnant female, the method including identifying in a biosample obtained from the pregnant women circulating RNA (C-RNA) molecules associated with preeclampsia.
C12Q 1/6883 - Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material
C12Q 1/6806 - Preparing nucleic acids for analysis, e.g. for polymerase chain reaction [PCR] assay
92.
APPARATUS AND METHOD OF ESTIMATING VALUES FROM IMAGES
A method is used to generate a distortion model for a structured illumination microscopy (SIM) optical system. A sliding window is moved in relation to a plurality of images to define a plurality of sub-tiles. Each sub-tile represents a portion of the corresponding image. Parameters are estimated for each sub-tiles. The parameters include two or more parameters selected from the group consisting of modulation, angle, spacing, phase offset, and phase deviation. A full width at half maximum (FWHM) value associated with each sub-tile is estimated. A distortion model is estimated, based at least in part on a combination of the estimated parameters and FWHM values stored in the predetermined format and an estimated center window parameter. A two-dimensional image may be generated, based at least in part on the estimated distortion model. The two-dimensional image may include representations indicating where distortions occur in the optical system.
The present disclosure relates to methods, compositions, and kits for treating target nucleic acids, including methods and compositions for fragmenting and tagging nucleic acid (e.g., DNA) using transposome complexes bound to a solid support.
Barriers including molecules covalently bonded to amphiphilic molecules, and methods of making the same, are provided herein. In some examples, a barrier between first and second fluids includes one or more layers comprising a plurality of amphiphilic molecules; and a first layer comprising a plurality of molecules covalently bonded to amphiphilic molecules of the plurality of amphiphilic molecules.
B29C 39/14 - Shaping by casting, i.e. introducing the moulding material into a mould or between confining surfaces without significant moulding pressure; Apparatus therefor for making articles of indefinite length
95.
EQUALIZER-BASED INTENSITY CORRECTION FOR BASE CALLING
The technology disclosed relates to equalizer-based intensity correction for base calling. In particular, the technology disclosed relates to accessing an image whose pixels depict intensity emissions from a target cluster and intensity emissions from additional adjacent clusters, selecting a lookup table that contains pixel coefficients that are configured to increase a signal-to-noise ratio, applying the pixel coefficients to intensity values of the pixels in the image to produce an output, and base calling the target cluster based on the output.
G16B 40/10 - Signal processing, e.g. from mass spectrometry [MS] or from PCR
G06V 10/762 - Arrangements for image or video recognition or understanding using pattern recognition or machine learning using clustering, e.g. of similar faces in social networks
G06V 20/69 - Microscopic objects, e.g. biological cells or cellular parts
96.
COMPOSITIONS AND METHODS FOR NUCLEIC ACID SEQUENCING
Embodiments of the present disclosure relate to kits, compositions, and methods for nucleic acid sequencing, for example, two-channel nucleic acid sequencing by synthesis using blue and green light excitation. In particular, unlabeled nucleotides for incorporation may be used in conjunction with affinity reagents containing detectable labels excitable by blue and/or green lights, for specific binding to each type of nucleotides incorporated.
An example of a flow cell includes a substrate, a plurality of chambers defined on or in the substrate, and a plurality of depressions defined in the substrate and within a perimeter of each of the plurality of chambers. The depressions are separated by interstitial regions. Primers are attached within each of the plurality of depressions, and a capture site is located within each of the plurality of chambers.
B01J 19/00 - Chemical, physical or physico-chemical processes in general; Their relevant apparatus
C12N 15/10 - Processes for the isolation, preparation or purification of DNA or RNA
C40B 40/08 - Libraries containing RNA or DNA which encodes proteins, e.g. gene libraries
B01L 3/00 - Containers or dishes for laboratory use, e.g. laboratory glassware; Droppers
C40B 50/18 - Solid phase synthesis, i.e. wherein one or more library building blocks are bound to a solid support during library creation; Particular methods of cleavage from the solid support using a particular method of attachment to the solid support
98.
TARGETED CALLING OF OVERLAPPING COPY NUMBER VARIANTS
Disclosed herein include systems, devices, and methods for calling overlapping copy number variants (CNVs) of a gene. The gene can comprise a plurality of regions. The gene can have a plurality of CNVs. Two alleles of the gene of a subject can be determined based on a number of copies of each region of the plurality of regions and all CNVs of the plurality of CNVs of the gene comprising the region.
The technology disclosed directly operates on sequencing data and derives its own feature filters. It processes a plurality of aligned reads that span a target base position. It combines elegant encoding of the reads with a lightweight analysis to produce good recall and precision using lightweight hardware. For instance, one million training examples of target base variant sites with 50 to 100 reads each can be trained on a single GPU card in less than 10 hours with good recall and precision. A single GPU card is desirable because it a computer with a single GPU is inexpensive, almost universally within reach for users looking at genetic data. It is readily available on could-based platforms.
G16B 40/00 - ICT specially adapted for biostatistics; ICT specially adapted for bioinformatics-related machine learning or data mining, e.g. knowledge discovery or pattern finding
G16B 20/00 - ICT specially adapted for functional genomics or proteomics, e.g. genotype-phenotype associations
G06F 9/38 - Concurrent instruction execution, e.g. pipeline, look ahead
G06N 3/04 - Architecture, e.g. interconnection topology
G06N 3/084 - Backpropagation, e.g. using gradient descent
Provided is a method, including stretching a polynucleotide over a substrate including a plurality of equally spaced cleavage regions including a plurality of transposases, cleaving the polynucleotide with two or more of the plurality of transposases to form a plurality of polynucleotide fragments, and separating, within the plurality of polynucleotide fragments, a population of longer polynucleotide fragments from a population of shorter polynucleotide fragments. Also provided is a method including stretching a polynucleotide over a substrate including a plurality of equally spaced cleavage regions including a plurality of transposases, cleaving the polynucleotide with two or more of the plurality of transposases to form a plurality of polynucleotide fragments, and separating, within the plurality of polynucleotide fragments, a population of longer polynucleotide fragments from a population of shorter polynucleotide fragments.