Systematic discovery of recombinases for efficient integration of large DNA sequences into the human genomeLarge serine recombinases (LSRs) are DNA integrases that facilitate the site-specific integration of mobile genetic elements into bacterial genomes. Only a few LSRs, such as Bxb1 and PhiC31, have been characterized to date, with limited efficiency as tools for DNA integration in human cells. In this study, we developed a computational approach to identify thousands of LSRs and their DNA attachment sites, expanding known LSR diversity by >100-fold and enabling the prediction of their insertion site specificities. We tested their recombination activity in human cells, classifying them as landing pad, genome-targeting or multi-targeting LSRs. Overall, we achieved up to seven-fold higher recombination than Bxb1 and genome integration efficiencies of 40-75% with cargo sizes over 7 kb. We also demonstrate virus-free, direct integration of plasmid or amplicon libraries for improved functional genomics applications. This systematic discovery of recombinases directly from microbial sequencing data provides a resource of over 60 LSRs experimentally characterized in human cells for large-payload genome insertion without exposed DNA double-stranded breaks.
Rapid detection of SARS-CoV-2 RNA in saliva via Cas13Deep learning and CRISPR-Cas13d ortholog discovery for optimized RNA targetingRapid, point-of-care molecular diagnostics with Cas13Rapid nucleic acid testing is a critical component of a robust infrastructure for increased disease surveillance. Here, we report a microfluidic platform for point-of-care, CRISPR-based molecular diagnostics. We first developed a nucleic acid test which pairs distinct mechanisms of DNA and RNA amplification optimized for high sensitivity and rapid kinetics, linked to Cas13 detection for specificity. We combined this workflow with an extraction-free sample lysis protocol using shelf-stable reagents that are widely available at low cost, and a multiplexed human gene control for calling negative test results. As a proof-of-concept, we demonstrate sensitivity down to 40 copies/μL of SARS-CoV-2 in unextracted saliva within 35 minutes, and validated the test on total RNA extracted from patient nasal swabs with a range of qPCR Ct values from 13-35. To enable sample-to-answer testing, we integrated this diagnostic reaction with a single-use, gravity-driven microfluidic cartridge followed by real-time fluorescent detection in a compact companion instrument. We envision this approach for Diagnostics with Coronavirus Enzymatic Reporting (DISCoVER) will incentivize frequent, fast, and easy testing.
Deep learning and CRISPR-Cas13d ortholog discovery for optimized RNA targetingJingyi Wei, Peter Lotfy, Kian Faizi et al.|bioRxiv (Cold Spring Harbor Laboratory)|2021 Abstract Transcriptome engineering technologies that can effectively and precisely perturb mammalian RNAs are needed to accelerate biological discovery and RNA therapeutics. However, the broad utility of programmable CRISPR-Cas13 ribonucleases has been hampered by an incomplete understanding of the design rules governing guide RNA activity as well as cellular toxicity resulting from off-target or collateral RNA cleavage. Here, we sought to characterize and develop Cas13d systems for efficient and specific RNA knockdown with low cellular toxicity in human cells. We first quantified the performance of over 127,000 RfxCas13d (CasRx) guide RNAs in the largest-scale screen to date and systematically evaluated three linear, two ensemble, and two deep learning models to build a guide efficiency prediction algorithm validated across multiple human cell types in orthogonal validation experiments ( https://www.RNAtargeting.org ). Deep learning model interpretation revealed specific sequence motifs at spacer position 15-24 along with favored secondary features for highly efficient guides. We next identified 46 novel Cas13d orthologs through metagenomic mining for activity and cytotoxicity screening, discovering that the metagenome-derived DjCas13d ortholog achieves low cellular toxicity and high transcriptome-wide specificity when deployed against high abundance transcripts or in sensitive cell types, including human embryonic stem cells, neural progenitor cells, and neurons. Finally, our Cas13d guide efficiency model successfully generalized to DjCas13d, highlighting the utility of a comprehensive approach combining machine learning with ortholog discovery to advance RNA targeting in human cells.