DSLab: Data Science Laboratory

One line command to pre-process the files
A.) Awk command to split a large multi-fasta file in to small files.

1. awk 'BEGIN {n_seq=0;} /^>/ {if(n_seq%200==0){file=sprintf("myseq%d.fa",n_seq);} print >> file; n_seq++; next;} { print >> file; }' < input_file
myseq = output file name. input_file = Name of multi-fasta input file. 200 = maximum no. of sequence in each output fasta file

B.) Perl command to replace/substitute the desire word into other word from a file.

2. perl -pi -e 's/query/result/g' file_name
query = the word to be replace. result = the new word to be placed. file_name = name of the input file

3. Convert xls to csv file command line
ssconvert file.xls file.csv

follow me on facebook

Latest News

1. A crowdsourced analysis to identify ab initio molecular signatures predictive of susceptibility to viral infection Details

2. The Immune Epitope Database (IEDB): 2018 update Details

3. A toolkit for caste differentiation Detail

4. The opium poppy genome and morphinan production Detail

5. PvaxDB: a comprehensive structural repository of Plasmodium vivax proteome Detail

Latest Jobs

Research Associate, Delhi University, Delhi Last Date 28 May 2019

Domain Expert and Analyst, NCCS, PuneLast Date 28 May 2019

Manager-Scientific Data, NCCS, Pune Last Date 28 May 2019

Senior Domain Expert and Domain Expert, IISER, Pune Last Date 8 June 2019

Multiple Project Assistant Positions, IHBT, Palampur Last Date 24 May 2019

ICMR PostDoc Fellowship, Delhi Last Date 30 June 2019

All Jobs

Web-Stat traffic analytics