DSLab: Data Science Laboratory

One line command to pre-process the files
A.) Awk command to split a large multi-fasta file in to small files.

1. awk 'BEGIN {n_seq=0;} /^>/ {if(n_seq%200==0){file=sprintf("myseq%d.fa",n_seq);} print >> file; n_seq++; next;} { print >> file; }' < input_file
myseq = output file name. input_file = Name of multi-fasta input file. 200 = maximum no. of sequence in each output fasta file

B.) Perl command to replace/substitute the desire word into other word from a file.

2. perl -pi -e 's/query/result/g' file_name
query = the word to be replace. result = the new word to be placed. file_name = name of the input file

3. Convert xls to csv file command line
ssconvert file.xls file.csv

follow me on facebook

Latest News

1. A crowdsourced analysis to identify ab initio molecular signatures predictive of susceptibility to viral infection Details

2. The Immune Epitope Database (IEDB): 2018 update Details

3. A toolkit for caste differentiation Detail

4. The opium poppy genome and morphinan production Detail

5. PvaxDB: a comprehensive structural repository of Plasmodium vivax proteome Detail

Latest Jobs

Scientist-E, National Institute of Animal Biotechnology (NIAB), Hyderabad Last Date 11 Feb. 2019

Consultant Public Health at National Centre for Disease Control, Delhi Last Date 16 Jan. 2019

CSIR-Young Scientist Award-2019, CSIR, Delhi Last Date 31 Jan. 2019

Research Scientist, Research Associate, JRF at University of Hyderabad, Hyderabad Last Date 20 Jan. 2019

Research Officer, at AIIMS, Bhopal Last Date 25 Jan. 2019

Scientist C, Executive officer at Delhi University, Delhi Last Date 18 Feb. 2019

All Jobs

Web-Stat traffic analytics