The Perl Toolchain Summit needs more sponsors. If your company depends on Perl, please support this very important event.

NAME

BIE::App::PacBio - An application for QC of PacBio CCS sequencing data.

SYNOPSIS

It is very easy to use. After installation, just call "CCSQC.pl" followed by path of bas.h5 file.

        CCSQC.pl pacbio.bas.h5

DESCRIPTION

This module installs an application (or more in future) to check sequencing data quality produced by PacBio RS system. PacBio RS is a 3rd-generation sequencing technology which presents novel exciting features. Here this module summarizes our experiences in dealing with PacBio data. Currently it diggs raw data and shows interesting figures for researchers to have ideas about data quality. Besides the usage mentioned above, user could also utilize functions in this package in order to customize scripts for particular questions.

INSTALLATION

There are two ways to install BIE::App::PacBio. User could install it in a working directory, which is the usual way for many researchers who have no hardware rights; another option is for administrator to install it for all users.

PREREQUISITES

Unfortunately, as every software, there may be some annoying installations you must have prior to using this module. They could all get installed with "cpan".

  • Moose

  • namespace::autoclean

  • PDL, PDL::Graphics::PLplot, Cairo

FOR ORDINARY USER

  1. Go to our website and download the zip file.

  2. Unzip the downloaded file and enter the created directory.

  3. Type "make". A executable script will be here. Remember to open another terminal to use it. Ask your administrator for help if you unluckily get error about lacking some prerequisites.

FOR POWER USER

Start a terminal, type "cpan" and press return, then type "install BIE::App::PacBio". That's it.

ATTRIBUTES AND METHODS

Following is simple introduction of involved attributes and methods in this module. Users don't have to know these unless tweaking is wanted.

  • "file": The HDF5 file name. It is the only argument to construct a PacBio object.

  • "h5": A HDF5 object.

  • "content": A list of all datasets in HDF5 file.

  • "data": A hash contains all datasets in HDF5 file, which may occupy huge memories. Don't use it without a reasonable purpose.

  • "hitIdx": The index of hit smart cell holes.

  • "lens": The read lengths of sequencing data.

  • "read": Given a dataset, "read" return corresponding data.

SEE ALSO

There is an example data here.

AUTHOR

Xin Zheng, <zhengxin@mail.nih.gov>

COPYRIGHT AND LICENSE

Copyright (C) 2012 by LIB/SAIC-Frederick at Frederick National Laboratory for Cancer Research.

This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself, either Perl version 5.14.2 or, at your option, any later version of Perl 5 you may have available.

By the way, FNL has no responsibility for any unexpected result related with BIE::App::PacBio. The only one to be blamed is listed above.