Please take our Survey
logo       

Choosing A Webhost:
A web hosting service is a type of Internet hosting service that allows individuals and organizations to provide their own website accessible via the World Wide Web. Web hosts are companies that provide space on a server they own for use by their clients as well as providing Internet connectivity, typically in a data center. Web hosts can also provide data center space and connectivity to the Internet for servers they do not own to be located in their data center, called colocation. more...

Re: Parsing MegaBLAST output files?: msg#00034

java.bio.general

Subject: Re: Parsing MegaBLAST output files?

Hello -

MegaBLAST is not offcially supported. This doesn't mean it won't work it
just means we don't know if it will work. If it isn't too different from
normal blast it probably will.

The BlastLikeSAXParser has two modes. Lazy and Strict. If you call
setModeLazy() before parsing it won't care if it doesn't recognise the
format as one that is tried and tested and will attempt to parse it
anyway. You should carefully check a few results though to make sure it is
going well. If things work let us know so we can add MegaBLAST to the list
of trusted programs.

Hope this helps,

Mark





James Diggans <jdiggans@xxxxxxxxxxxxxxxxx>
Sent by: biojava-l-bounces@xxxxxxxxxxxxxxxxxxx
11/22/2004 02:38 PM


To: BioJava <biojava-l@xxxxxxxxxxx>
cc: (bcc: Mark Schreiber/GP/Novartis)
Subject: [Biojava-l] Parsing MegaBLAST output files?


-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1



All, I'm attempting to use BioJava to parse the output from NCBI's
commandline MegaBLAST and receiving an error:

'Could not recognise the format of this file as one supported by the
framework.'

in a SAXException thrown by BlastLikeSAXParser. An old post to the
mailing list:

http://www.biojava.org/pipermail/biojava-dev/2002-October/000150.html

seems to indicate that this was fixed long ago via this commit to CVS:

http://cvs.biojava.org/cgi-bin/viewcvs/viewcvs.cgi/biojava-live/src/org/biojava/bio/program/ssbind/HeaderStAXHandler.java.diff?r1=1.3&r2=1.4&cvsroot=biojava

The MegaBLAST file I'm trying to parse is clean and my attempt at a
parse consists of (largely pulled from the recipe from BioJava in Anger):

- ------------------
InputStream is = new FileInputStream(blastResult);

BlastLikeSAXParser parser = new BlastLikeSAXParser();
SeqSimilarityAdapter adapter = new SeqSimilarityAdapter();
parser.setContentHandler(adapter);

alignmentResults = new ArrayList();
SearchContentHandler builder = new
BlastLikeSearchBuilder(alignmentResults,
~ new DummySequenceDB("queries"),
new DummySequenceDBInstallation());

adapter.setSearchContentHandler(builder);

parser.parse(new InputSource(is));
- ------------------

Any ideas on why I'm getting the SAXException? Thanks ...
- -j

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.2.3-nr1 (Windows XP)
Comment: Using GnuPG with Thunderbird - http://enigmail.mozdev.org

iD8DBQFBoYlc75jgGJzUhNkRAu8zAJ9gTNoPouk4/29EDpWKcQVx5EB34gCg2MkD
DndldC3zi3bD2QKWgqMNOxs=
=TS47
-----END PGP SIGNATURE-----
_______________________________________________
Biojava-l mailing list - Biojava-l@xxxxxxxxxxx
http://biojava.org/mailman/listinfo/biojava-l



_______________________________________________
Biojava-l mailing list - Biojava-l@xxxxxxxxxxx
http://biojava.org/mailman/listinfo/biojava-l



<Prev in Thread] Current Thread [Next in Thread>
Google Custom Search

Free Magazines

Cisco News
Receive a free quarterly e-newsletter with exclusive articles on how Cisco IT uses its own products and solutions to enable the business.
subscribe

Systems Management News, the newspaper for IT systems administration and data center managers! Each issue of Systems Management News is chock-full of news and analysis to help you understand what's happening in your field.
subscribe

The Enterprise Newsweekly eWeek is the essential technology information source for builders of e-business.
subscribe

Oracle Magazine Oracle Magazine contains technology strategy articles, sample code, tips, Oracle and partner news, how to articles for developers and DBAs, and more. Oracle (NASDAQ: ORCL) is the world's largest enterprise software company.
subscribe

Total Telecom Total Telecom is "The Economist of the communications industry".
subscribe

Navigation

Home | advertise | OSDir is an inevitable website. super tiny logo