Sponsored Content
Top Forums UNIX for Beginners Questions & Answers Clustering for beginners. Few questions. Post 302999337 by protos27 on Saturday 17th of June 2017 08:02:40 AM
Old 06-17-2017
hehSmilie understood , but idea of building cluster from 2 or more nodes is just the beginning. I have to do something more - this is obvious. The question there is what can You recomend to search? I consider You as well exp admin so your advice will be helpful as well. For example some scenario about discarding storage (pendrive) etc what do You think? I have specialisation about security. Your ideas are welcome!
 

9 More Discussions You Might Find Interesting

1. Programming

X-programming for beginners

Good morning. Thanks for the very valuable hard-to-find information I get from you guys. Can anybody give any suggested websites or references for anyone who wants to begin learning on programming applications in X? Thanks to anyone in advance... (1 Reply)
Discussion started by: jfsuminist
1 Replies

2. UNIX Desktop Questions & Answers

unix course for beginners

does anyone know of a course for unix beginners (1 Reply)
Discussion started by: moose
1 Replies

3. UNIX Desktop Questions & Answers

UNIX for beginners

I am new to non Windows operating systems. Does anyone have advice on which UNIX OS vendor would be good for learning purposes. I was looking for a version of UNIX that runs on the Intel platform. Do you have any recommendations on where to purchase the software? Thank you. (14 Replies)
Discussion started by: jmy113437
14 Replies

4. Shell Programming and Scripting

Practice examples for beginners

Hi , I am new to shell scripting . I have been go through many sites and ready the material for shell scripting. But I am not getting complete examples for practice. Can any one suggest me any site that contains lots of ready examples for shell scripting ?? Regards (0 Replies)
Discussion started by: himvat
0 Replies

5. Programming

Beginners question about fork

Hi everyone: I'm developing a dynamic library for notifications, this library is used for a daemon that i've programmed, when something goes wrong the library should send an email to an administrator, but since sending an email is a non-vital process then it can fail (it should work as an... (4 Replies)
Discussion started by: edgarvm
4 Replies

6. UNIX Desktop Questions & Answers

Which UNIX version for beginners

1. I would like to know which UNIX version I should use as a beginner and from I can get it? 2. How much hard disk space(drive space...in case of dual boot) does it require? 3. Can I run it from USB or Live CD? 4. Is there any need of installing UNIX if I have Linux... ie does Linux... (3 Replies)
Discussion started by: ArpitRaj
3 Replies

7. UNIX for Dummies Questions & Answers

UNIX for beginners

i'm just a beginner in unix environment- please help which book to read and which os to use!!! :confused: seriously i've no idea what is unix or how much capable it is!! (1 Reply)
Discussion started by: gaurav singh
1 Replies

8. UNIX for Dummies Questions & Answers

UNIX ebook for beginners

hi all, Can you suggest me a ebook for unix beginners. I am new to unix. (2 Replies)
Discussion started by: rajasingam
2 Replies

9. UNIX for Beginners Questions & Answers

Which Linux Certification comes first for beginners?

I want to get a Linux Certification to start my Career in IT sector. My Qualification is MCS from a top university. Suggest me some Linux Certifications. (1 Reply)
Discussion started by: alexwatson1711
1 Replies
PSI-CD-HIT-2D.PL(1)						   User Commands					       PSI-CD-HIT-2D.PL(1)

NAME
psi-cd-hit-2d.pl - runs similar algorithm like CD-HIT but using BLAST to calculate similarities in db1 or db2 format DESCRIPTION
Usage psi-cd-hit-2d [Options] Options -i in_dbname, required -o out_dbname, required -c clustering threshold (sequence identity), default 0.3 -ce clustering threshold (blast expect), default -1, it means by default it doesn't use expect threshold, but with positive value, the program cluster seqs if similarities meet either identity threshold or expect threshold -L coverage of shorter sequence ( aligned / full), default 0.0 -M coverage of longer sequence ( aligned / full), default 0.0 -R (1/0) use psi-blast profile? default 0 perform psi-blast / pdb-blast type search -G (1/0) use global identity? default 1 sequence identity calculated as total identical residues of local alignments / length of shorter seq if you prefer to use -G 0, it is suggested that you also use -L, such as -L 0.8, to prevent very short matches. -d length of description line in the .clstr file, default 30 if set to 0, it takes the fasta defline and stops at first space -l length_of_throw_away_sequences, default 10 -p profile search para, default "-a 2 -d nr80 -j 3 -F F -e 0.001 -b 500 -v 500" -bfdb profile database, default nr80 -s blast search para, default "-F F -e 0.000001 -b 100000 -v 100000" -be blast expect cutoff, default 0.000001 -b filename of list of hosts to run this program in parallel with ssh calls, you need provide a list of hosts -pbs No of jobs to send each time by PBS querying system you can not use both ssh and pbs at same time -k (1/0) keep blast raw output file, default 1 -rs steps of save restart file and clustering output, default 5000 everytime after process 5000 sequences, program write a restart file and current clustering information -restart restart file, readin a restart file if program crash, stoped, termitated, you can restart it by add a option "-restart sth.restart" -rf steps of re format blast database, default 200,000 if program clustered 200,000 seqs, it remove them from seq pool, and re format blast db to save time -local dir of local blast db, when run in parallel with ssh (not pbs), I can copy blast dbs to local drives on each node to save blast db reading time BUT, IT MAY NOT FASTER -J job, job_file, exe specific jobs like parse blast outonly DON'T use it, it is only used by this program itself -single files of ids those you known that they are singletons so I won't run them as queries -i2 second input database -blastn run blastn, default 0 -lo how long can seq in db2 > db1 in a cluster, default 0 means, that seq in db2 should <= seqs in db1 in a cluster ============================== by Weizhong Li, liwz@sdsc.edu ============================== If you find cd-hit useful, please kindly cite: "Clustering of highly homologous sequences to reduce thesize of large protein database", Weizhong Li, Lukasz Jaroszewski & Adam GodzikBioinformatics, (2001) 17:282-283 "Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences", Weizhong Li & Adam Godzik Bioinformatics, (2006) 22:1658-1659 psi-cd-hit-2d.pl 4.6-2012-04-25 April 2012 PSI-CD-HIT-2D.PL(1)
All times are GMT -4. The time now is 03:33 AM.
Unix & Linux Forums Content Copyright 1993-2022. All Rights Reserved.
Privacy Policy