Bioinformatics algorithm library-NeoBio
NeoBio is a library of bioinformatics algorithms implemented in Java..
What algorithms? The current version consists mainly of (pairwise) sequence alignment algorithms such as the classical dynamic programming methods of Needleman & Wunsch (global alignment) and Smith & Waterman (local alignment).
Anything else? Yes, a more efficient approach, due to Crochemore, Landau and Ziv-Ukelson is also available. It uses Lempel-Ziv compression to speed-up the computation of the dynamic programming matrix. It also relies on the SMAWK algorithm, due to Aggarwal et al., that computes all column maxima of a totally monotone matrix in linear time.
Hum... And all sequence alignment algorithms support simple scoring schemes as well as substitution matrices such as standard BLOSUM and PAM matrices. But so far they support constant gap penalty functions only. Future versions may contain related algorithms such as multiple sequence alignment, database search and protein structure prediction.
Wow...Last but not least, NeoBio also provides a simple GUI and command line based tools to run the sequence alignment algorithms on DNA and protein sequences.
NeoBio project is hosted at SourceForge.net and is available in two distributions. The executable JAR file can be used to easily run the library's main utility as described in the next section.
The ZIP file can be used to compile the source code as described in the compiling section. It contains:
What algorithms? The current version consists mainly of (pairwise) sequence alignment algorithms such as the classical dynamic programming methods of Needleman & Wunsch (global alignment) and Smith & Waterman (local alignment).
Anything else? Yes, a more efficient approach, due to Crochemore, Landau and Ziv-Ukelson is also available. It uses Lempel-Ziv compression to speed-up the computation of the dynamic programming matrix. It also relies on the SMAWK algorithm, due to Aggarwal et al., that computes all column maxima of a totally monotone matrix in linear time.
Hum... And all sequence alignment algorithms support simple scoring schemes as well as substitution matrices such as standard BLOSUM and PAM matrices. But so far they support constant gap penalty functions only. Future versions may contain related algorithms such as multiple sequence alignment, database search and protein structure prediction.
Wow...Last but not least, NeoBio also provides a simple GUI and command line based tools to run the sequence alignment algorithms on DNA and protein sequences.
NeoBio project is hosted at SourceForge.net and is available in two distributions. The executable JAR file can be used to easily run the library's main utility as described in the next section.
The ZIP file can be used to compile the source code as described in the compiling section. It contains:
- source code - complete Java source code released under the GNU General Public License;
- binary code - compiled with Sun's Java 2 SDK, Standard Edition, v 1.4.0;
- test cases - includes several DNA and protein sequences as well as common BLOSUM and PAM substitution matrices;
- documentation - API documentation produced with Javadoc.
0 comments:
Post a Comment