Data and analysis code for an MS on SK VOC genomes phenotyping/neutralisation assays

Overview

Description

image

Summary of phylogenomic methods and analyses used in "Immunogenicity of convalescent and vaccinated sera against clinical isolates of ancestral SARS-CoV-2, Beta, Delta, and Omicron variants"

Methods

Raw reads underwent adapter/quality trimming (trim-galore v0.6.5 [citation: https://github.com/FelixKrueger/TrimGalore]), host filtering and read mapping to reference (bwa v0.7.17 [citation: arXiv:1303.3997v2 ], samtools v.1.7 [citation: 10.1093/bioinformatics/btp352]) trimming of primers (iVar v1.3 [citation:10.1186/s13059-018-1618-7]) and variant/consensus calling (freebayes v1.3.2 [citation: arXiv:1207.3907]) using the SIGNAL workflow (https://github.com/jaleezyy/covid-19-signal) v1.4.4dev (#60dd466) [citation: doi.org/10.3390/v12080895] with the ARTICv4 amplicon scheme (from https://github.com/artic-network/artic-ncov2019) and the MN908947.3 SARS-CoV-2 reference genome and annotations. Additional quality control and variant effect annotation (SnpEff v5.0-0 [citation:0.4161/fly.19695]) was performed using the ncov-tools v1.8.0 (https://github.com/jts/ncov-tools/). Finally, PANGO lineages were assigned to consensus sequences using pangolin v3.1.17 (with the PangoLEARN v2021-12-06 models) [citation:10.1093/ve/veab064], scorpio v0.3.16 (with constellations v0.1.1) [citation: https://github.com/cov-lineages/scorpio], and PANGO-designations v1.2.117 [citation:10.1038/s41564-020-0770-5]. Variants were summarised using PyVCF v0.6.8 [citation:https://github.com/jamescasbon/PyVCF] and pandas v1.2.4 [citation:10.25080/Majora-92bf1922-00a]. Phylogenetic analysis was performed using augur v13.1.0 [citation: 10.21105/joss.02906] with IQTree (v2.2.0beta) [citation:10.1093/molbev/msaa015] and the resulting phylogenetic figure generated using ETE v3.1.2 [citation: 10.1093/molbev/msw046]. Contexual sequences were incorporated into the phylogenetic analysis by using Nexstrain's ingested GISAID metadata and pandas to randomly sample a representative subset of sequences (jointly deposited in NCBI and GISAID) that belonged to lineages observed in Canada (see sequences_used_in_tree_with_acknowledgements.tsv for metadata and acknowledgements).

File Description

  • 20220101_MN01513_WGS114_DEC31SRI_CK_summary_valid_negative_pass_only.tsv ncov-tools generate QC summary

  • sk_variant_summary.ipynb notebook containing code to summarise variants (tables/variant_percentage_read_support_protein_nonsynonymous_only.tsv and graphic figures/intermediate/spike_mutation_table_styled.png) and subsample representative genomes phlyogeny/seqs/open_context_genomes.fasta from GISAID (nextstrain ingested fasta and metadata from 2021-12-31: metadata_2021-12-31_17-29.tsv.gz and sequences_fasta_2022_01_03.tar.xz)

  • genomes/ Consensus sequences generated by FreeBayes via SIGNAL.

  • variants/ ncov-tools SnpEff annotated SIGNAL FreeBayes VCFs

  • phylogeny data used to generate annotated phylogeny with augur

  • phylogeny/tree.sh script used to generate phylogeny

  • phylogeny/seqs sequences used for phlyogeny

  • phylogeny/data reference data for phylogeny

  • phylogeny/augur phylogeny and intermediate files

  • phlyogeny/viz_tree.py ete3 based script to generate phylogeny figure (tree.svg)

  • figure files for generating result plot

  • figure/phylo_variant_figure.* final figure combining tree.svg and spike_mutation_table_styled.png

  • figure/intermediate/tree.svg rendered SVG of phylogeny

  • figure/intermediate/spike_mutation_table_styled.png rendered summary of variants

  • tables set of tables for manuscript

  • tables/sequences_used_in_tree_with_acknowledgements.tsv ncov-ingest metadata with acknowledgements

  • tables/variant_percentage_read_support_protein_nonsynonymous_only.tsv summary of variants

You might also like...
Vertical Federated Principal Component Analysis and Its Kernel Extension on Feature-wise Distributed Data based on Pytorch Framework
Vertical Federated Principal Component Analysis and Its Kernel Extension on Feature-wise Distributed Data based on Pytorch Framework

VFedPCA+VFedAKPCA This is the official source code for the Paper: Vertical Federated Principal Component Analysis and Its Kernel Extension on Feature-

BisQue is a web-based platform designed to provide researchers with organizational and quantitative analysis tools for 5D image data. Users can extend BisQue by implementing containerized ML workflows.
BisQue is a web-based platform designed to provide researchers with organizational and quantitative analysis tools for 5D image data. Users can extend BisQue by implementing containerized ML workflows.

Overview BisQue is a web-based platform specifically designed to provide researchers with organizational and quantitative analysis tools for up to 5D

Easily pull telemetry data and create beautiful visualizations for analysis.
Easily pull telemetry data and create beautiful visualizations for analysis.

This repository is a work in progress. Anything and everything is subject to change. Porpo Table of Contents Porpo Table of Contents General Informati

Code for reproducing our analysis in the paper titled: Image Cropping on Twitter: Fairness Metrics, their Limitations, and the Importance of Representation, Design, and Agency
Code for reproducing our analysis in the paper titled: Image Cropping on Twitter: Fairness Metrics, their Limitations, and the Importance of Representation, Design, and Agency

Image Crop Analysis This is a repo for the code used for reproducing our Image Crop Analysis paper as shared on our blog post. If you plan to use this

Request execution of Galaxy SARS-CoV-2 variation analysis workflows on input data you provide.
Request execution of Galaxy SARS-CoV-2 variation analysis workflows on input data you provide.

SARS-CoV-2 processing requests Request execution of Galaxy SARS-CoV-2 variation analysis workflows on input data you provide. Prerequisites This autom

 TagLab: an image segmentation tool oriented to marine data analysis
TagLab: an image segmentation tool oriented to marine data analysis

TagLab: an image segmentation tool oriented to marine data analysis TagLab was created to support the activity of annotation and extraction of statist

Deep Learning applied to Integral data analysis

DeepIntegralCompton Deep Learning applied to Integral data analysis Module installation Move to the root directory of the project and execute : pip in

The source code for the Cutoff data augmentation approach proposed in this paper: "A Simple but Tough-to-Beat Data Augmentation Approach for Natural Language Understanding and Generation".

Cutoff: A Simple Data Augmentation Approach for Natural Language This repository contains source code necessary to reproduce the results presented in

Code for the paper A Theoretical Analysis of the Repetition Problem in Text Generation
Code for the paper A Theoretical Analysis of the Repetition Problem in Text Generation

A Theoretical Analysis of the Repetition Problem in Text Generation This repository share the code for the paper "A Theoretical Analysis of the Repeti

Releases(v0.1.1)
Owner
Finlay Maguire
Assistant Professor (Computer Science & Epidemiology). Working on infectious disease genomic epidemiology & data-driven solutions to social crises
Finlay Maguire
Open standard for machine learning interoperability

Open Neural Network Exchange (ONNX) is an open ecosystem that empowers AI developers to choose the right tools as their project evolves. ONNX provides

Open Neural Network Exchange 13.9k Dec 30, 2022
Meandering In Networks of Entities to Reach Verisimilar Answers

MINERVA Meandering In Networks of Entities to Reach Verisimilar Answers Code and models for the paper Go for a Walk and Arrive at the Answer - Reasoni

Shehzaad Dhuliawala 271 Dec 13, 2022
FastFace: Lightweight Face Detection Framework

Light Face Detection using PyTorch Lightning

Ömer BORHAN 75 Dec 05, 2022
Resources related to EMNLP 2021 paper "FAME: Feature-Based Adversarial Meta-Embeddings for Robust Input Representations"

FAME: Feature-based Adversarial Meta-Embeddings This is the companion code for the experiments reported in the paper "FAME: Feature-Based Adversarial

Bosch Research 11 Nov 27, 2022
Official repository of "Investigating Tradeoffs in Real-World Video Super-Resolution"

RealBasicVSR [Paper] This is the official repository of "Investigating Tradeoffs in Real-World Video Super-Resolution, arXiv". This repository contain

Kelvin C.K. Chan 566 Dec 28, 2022
TensorFlow implementation of the algorithm in the paper "Decoupled Low-light Image Enhancement"

Decoupled Low-light Image Enhancement Shijie Hao1,2*, Xu Han1,2, Yanrong Guo1,2 & Meng Wang1,2 1Key Laboratory of Knowledge Engineering with Big Data

17 Apr 25, 2022
PyTorch implementation of "VRT: A Video Restoration Transformer"

VRT: A Video Restoration Transformer Jingyun Liang, Jiezhang Cao, Yuchen Fan, Kai Zhang, Rakesh Ranjan, Yawei Li, Radu Timofte, Luc Van Gool Computer

Jingyun Liang 837 Jan 09, 2023
The official code for PRIMER: Pyramid-based Masked Sentence Pre-training for Multi-document Summarization

PRIMER The official code for PRIMER: Pyramid-based Masked Sentence Pre-training for Multi-document Summarization. PRIMER is a pre-trained model for mu

AI2 114 Jan 06, 2023
This repository includes different versions of the prescribed-time controller as Simulink blocks and MATLAB script codes for engineering applications.

Prescribed-time Control Prescribed-time control (PTC) blocks in Simulink environment, MATLAB R2020b. For more theoretical details, refer to the papers

Amir Shakouri 1 Mar 11, 2022
Learning Pixel-level Semantic Affinity with Image-level Supervision for Weakly Supervised Semantic Segmentation, CVPR 2018

Learning Pixel-level Semantic Affinity with Image-level Supervision This code is deprecated. Please see https://github.com/jiwoon-ahn/irn instead. Int

Jiwoon Ahn 337 Dec 15, 2022
AugLy is a data augmentations library that currently supports four modalities (audio, image, text & video) and over 100 augmentations

AugLy is a data augmentations library that currently supports four modalities (audio, image, text & video) and over 100 augmentations. Each modality’s augmentations are contained within its own sub-l

Facebook Research 4.6k Jan 09, 2023
Nicely is a real-time Feedback and Intervention Program Depression is a prevalent issue across all age groups, socioeconomic classes, and cultural identities.

Nicely is a real-time Feedback and Intervention Program Depression is a prevalent issue across all age groups, socioeconomic classes, and cultural identities.

1 Jan 16, 2022
Deep generative models of 3D grids for structure-based drug discovery

What is liGAN? liGAN is a research codebase for training and evaluating deep generative models for de novo drug design based on 3D atomic density grid

Matt Ragoza 152 Jan 03, 2023
An algorithm study of the 6th iOS 10 set of Boost Camp Web Mobile

알고리즘 스터디 🔥 부스트캠프 웹모바일 6기 iOS 10조의 알고리즘 스터디 입니다. 개인적인 사정 등으로 S034, S055만 참가하였습니다. 스터디 목적 상진: 코테 합격 + 부캠끝나고 아침에 일어나기 위해 필요한 사이클 기완: 꾸준하게 자리에 앉아 공부하기 +

2 Jan 11, 2022
A Python multilingual toolkit for Sentiment Analysis and Social NLP tasks

pysentimiento: A Python toolkit for Sentiment Analysis and Social NLP tasks A Transformer-based library for SocialNLP classification tasks. Currently

298 Jan 07, 2023
Improving 3D Object Detection with Channel-wise Transformer

"Improving 3D Object Detection with Channel-wise Transformer" Thanks for the OpenPCDet, this implementation of the CT3D is mainly based on the pcdet v

Hualian Sheng 107 Dec 20, 2022
A basic reminder tool written in Python.

A simple Python Reminder Here's a basic reminder tool written in Python that speaks to the user and sends a notification. Run pip3 install pyttsx3 w

Sachit Yadav 4 Feb 05, 2022
SphereFace: Deep Hypersphere Embedding for Face Recognition

SphereFace: Deep Hypersphere Embedding for Face Recognition By Weiyang Liu, Yandong Wen, Zhiding Yu, Ming Li, Bhiksha Raj and Le Song License SphereFa

Weiyang Liu 1.5k Dec 29, 2022
The MATH Dataset

Measuring Mathematical Problem Solving With the MATH Dataset This is the repository for Measuring Mathematical Problem Solving With the MATH Dataset b

Dan Hendrycks 267 Dec 26, 2022
Anonymous implementation of KSL

k-Step Latent (KSL) Implementation of k-Step Latent (KSL) in PyTorch. Representation Learning for Data-Efficient Reinforcement Learning [Paper] Code i

1 Nov 10, 2021