DSLab: Data Science Laboratory

One line command to pre-process the files
A.) Awk command to split a large multi-fasta file in to small files.

1. awk 'BEGIN {n_seq=0;} /^>/ {if(n_seq%200==0){file=sprintf("myseq%d.fa",n_seq);} print >> file; n_seq++; next;} { print >> file; }' < input_file
myseq = output file name. input_file = Name of multi-fasta input file. 200 = maximum no. of sequence in each output fasta file

B.) Perl command to replace/substitute the desire word into other word from a file.

2. perl -pi -e 's/query/result/g' file_name
query = the word to be replace. result = the new word to be placed. file_name = name of the input file

3. Convert xls to csv file command line
ssconvert file.xls file.csv

follow me on facebook

Latest News

1. Genome-wide identification and analysis of GRAS transcription factors in the bottle gourd genome Details

2. VacPred: Sequence-based prediction of plant vacuole proteins using machine-learning techniques Details

3. A toolkit for caste differentiation Detail

4. The opium poppy genome and morphinan production Detail

5. PvaxDB: a comprehensive structural repository of Plasmodium vivax proteome Detail

Latest Jobs

Scientist, CSIR-NEIST, Assam Last Date 07 Aug. 2019

Scientist, CSIR-Central Salt & Marine Chemicals Research Institute, Gujrat Last Date 9 Aug. 2019

Scientist, DRDO, Delhi Last Date 20 Aug. 2019

DBT-INSPIRE Faculty at DBT, Delhi Last Date 31 Aug. 2019

Research Associate at IARI, Delhi Last Date 26 Aug. 2019

Research Associate at National Institute of Pathology, Delhi Last Date 29 Aug. 2019

Scientist at National Institute of Biologicals, Delhi Last Date 6 Oct. 2019

Web-Stat traffic analytics