DSLab: Data Science Laboratory

One line command to pre-process the files
A.) Awk command to split a large multi-fasta file in to small files.

1. awk 'BEGIN {n_seq=0;} /^>/ {if(n_seq%200==0){file=sprintf("myseq%d.fa",n_seq);} print >> file; n_seq++; next;} { print >> file; }' < input_file
myseq = output file name. input_file = Name of multi-fasta input file. 200 = maximum no. of sequence in each output fasta file

B.) Perl command to replace/substitute the desire word into other word from a file.

2. perl -pi -e 's/query/result/g' file_name
query = the word to be replace. result = the new word to be placed. file_name = name of the input file

3. Convert xls to csv file command line
ssconvert file.xls file.csv

follow me on facebook

Latest News

1. A crowdsourced analysis to identify ab initio molecular signatures predictive of susceptibility to viral infection Details

2. The Immune Epitope Database (IEDB): 2018 update Details

3. A toolkit for caste differentiation Detail

4. The opium poppy genome and morphinan production Detail

5. PvaxDB: a comprehensive structural repository of Plasmodium vivax proteome Detail

Latest Jobs

DST-Young Scientists, Department of Science & Technology (DST), Delhi Last Date 20 April. 2019

SwarnaJayanti Fellowship, Department of Science & Technology (DST), Delhi Last Date 31 Mar. 2019

Scientist-B, ICMR-Vector Control Research Centre, Puducherry Last Date 8 Apr. 2019

Scientist, CSIR-IMTECH, Chandigarh Last Date 8 Apr. 2019

Technical Assistant, ICMR-Vector Control Research Centre, Puducherry Last Date 15 Apr. 2019

Technical Assistant, ICMR-National Institute of Malaria Research, Delhi Last Date 19 Apr. 2019

Staff Scientist at DBT-National Institute of Immunology, Delhi Last Date Rolling Advertisement

All Jobs

Web-Stat traffic analytics