DSLab: Data Science Laboratory

One line command to pre-process the files
A.) Awk command to split a large multi-fasta file in to small files.

1. awk 'BEGIN {n_seq=0;} /^>/ {if(n_seq%200==0){file=sprintf("myseq%d.fa",n_seq);} print >> file; n_seq++; next;} { print >> file; }' < input_file
myseq = output file name. input_file = Name of multi-fasta input file. 200 = maximum no. of sequence in each output fasta file

B.) Perl command to replace/substitute the desire word into other word from a file.

2. perl -pi -e 's/query/result/g' file_name
query = the word to be replace. result = the new word to be placed. file_name = name of the input file

3. Convert xls to csv file command line
ssconvert file.xls file.csv

follow me on facebook

Latest News

1. A crowdsourced analysis to identify ab initio molecular signatures predictive of susceptibility to viral infection Details

2. The Immune Epitope Database (IEDB): 2018 update Details

3. A toolkit for caste differentiation Detail

4. The opium poppy genome and morphinan production Detail

5. PvaxDB: a comprehensive structural repository of Plasmodium vivax proteome Detail

Latest Jobs

DST Women Scientist (WOS-B), Department of Science & Technology (DST), Delhi Last Date 16 Nov. 2018

Scientist, Project Assistant, JRF at Jamia Hamdard Univeristy, Delhi Last Date 20 Nov. 2018

Information Officer, Maharaja Sayajirao University of Baroda, Gujarat Last Date 30 Nov. 2018

Scientist-B, C, D, Rajendra Memorial Research Institute of Medical Sciences (RMRIMS), Patna Last Date 12 Dec. 2018

Scientist-C, Scientist-D, Consultant, ICMR, Delhi Last Date 19 Nov. 2018

Assistant Professor, Central University of Jharkhand, Ranchi Last Date 27 Nov. 2018

Visiting Postdoctoral Scholar in Bioinformatics, TIGS-InStem, Bangalore Last Date 01 Dec. 2018

Scientist-D, E, F at ICMR, Delhi Last Date 7 Dec. 2018

All Jobs

Web-Stat traffic analytics