The Perl Toolchain Summit needs more sponsors. If your company depends on Perl, please support this very important event.
 FASTA searches a protein or DNA sequence data bank
 version 3.3t08 Jan. 17, 2001
Please cite:
 W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448

t/data/cysprot1.fa: 343 aa
 >CYS1_DICDI
 vs  /data_2/jason/blastdb/ecoli.aa library
searching /data_2/jason/blastdb/ecoli.aa library

       opt      E()
< 20     0     0:
  22     0     0:           one = represents 7 library sequences
  24     0     0:
  26     0     0:
  28     0     1:*
  30     4     6:*
  32    13    23:== *
  34    62    62:========*
  36   130   127:==================*
  38   252   210:=============================*======
  40   310   293:=========================================*===
  42   405   359:===================================================*======
  44   401   396:========================================================*=
  46   386   403:======================================================== *
  48   348   386:==================================================     *
  50   360   352:==================================================*=
  52   290   309:==========================================  *
  54   264   264:=====================================*
  56   215   221:===============================*
  58   145   181:=====================    *
  60   144   147:====================*
  62   119   118:================*
  64    96    94:=============*
  66    72    74:==========*
  68    65    58:========*=
  70    54    46:======*=
  72    30    36:=====*
  74    19    28:===*
  76    26    22:===*
  78    18    17:==*
  80    19    13:=*=
  82    14    10:=*
  84     8     8:=*
  86     4     6:*
  88     2     5:*          inset = represents 1 library sequences
  90     4     4:*
  92     3     3:*         :==*
  94     2     2:*         :=*
  96     1     2:*         :=*
  98     1     1:*         :*
 100     0     1:*         :*
 102     0     1:*         :*
 104     2     1:*         :*=
 106     0     0:          *
 108     1     0:=         *=
 110     0     0:          *
 112     0     0:          *
 114     0     0:          *
 116     0     0:          *
 118     0     0:          *
>120     0     0:          *
1358987 residues in  4289 sequences
  Expectation_n fit: rho(ln(x))= 5.9493+/-0.00202; mu= 2.7408+/- 0.115
 mean_var=77.5610+/-17.011, 0's: 0 Z-trim: 0  B-trim: 2 in 1/41
 Lambda= 0.1456
 Kolmogorov-Smirnov  statistic: 0.0234 (N=29) at  44

FASTA (3.36 June 2000) function [optimized, BL50 matrix (15:-5)] ktup: 2
 join: 37, opt: 25, gap-pen: -12/ -2, width:  16
 Scan time:  1.110
The best scores are:                                       opt bits E(4289)
gi|1787478|gb|AAC74309.1| (AE000221) nitrate redu  ( 512)   92   29     1.2
gi|1790635|gb|AAC77148.1| (AE000491) putative DEO  ( 251)   84   27     2.1
gi|1786590|gb|AAC73494.1| (AE000145) orf, hypothe  (  94)   78   26     2.1
gi|1790853|gb|AAC77345.1| (AE000509) soluble lyti  ( 654)   84   28     4.8
gi|1789307|gb|AAC75975.1| (AE000377) biosynthetic  ( 658)   83   27     5.6
gi|1788174|gb|AAC74937.1| (AE000280) orf, hypothe  ( 199)   74   25     7.4
gi|1789138|gb|AAC75818.1| (AE000361) putative kin  ( 492)   79   26     7.8
gi|1789427|gb|AAC76084.1| (AE000386) orf, hypothe  ( 354)   76   26     9.1

>>gi|1787478|gb|AAC74309.1| (AE000221) nitrate reductase  (512 aa)
 initn:  35 init1:  35 opt:  92  Z-score: 109.2  bits: 29.2 E():  1.2
Smith-Waterman score: 92;  23.936% identity (26.012% ungapped) in 188 aa overlap (125-305:2-181)

          100       110       120       130       140        150   
CYS1_D NKEAIFTDDLPVADYLDDEFINSIPTAFDWRTRGAVTPVKNQGQCGSCWSFSTT-GNV--
                                     . :. :  : :  .: .: . :.:  ::  
gi|178                              MKIRSQVGMVLNLDKCIGCHTCSVTCKNVWT
                                            10        20        30 

               160       170        180       190       200        
CYS1_D --EGQHFISQNKLVSLSEQNLVDCDHECME-YEGEEACDEGCNGGLQPNAYNYIIKNGGI
         :: ..   :.. .   :..   : : .: :.:     .  :: :::   :  .  : :
gi|178 SREGVEYAWFNNVETKPGQGF-PTDWENQEKYKGGWI--RKINGKLQPRMGNRAMLLGKI
              40        50         60          70        80        

      210       220       230       240       250        260       
CYS1_D QTESSYPYTAETGTQCNFNSANIGAKISNFTMIPKNETVMAGYIVSTGP-LAIAADAVEW
        ..   :   .     .:.  :. .   .     :.. .     . ::  .:    . .:
gi|178 FANPHLPGIDDYYEPFDFDYQNLHTAPEG----SKSQPIARPRSLITGERMAKIEKGPNW
       90       100       110           120       130       140    

       270       280       290       300       310       320       
CYS1_D QFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRR
       .  .:: ::   . ...:. :  . ::  .. :   .:                      
gi|178 EDDLGGEFDKLAKDKNFDN-IQKAMYSQFENTFMMYLPRLCEHCLNPACVATCPSGAIYK
          150       160        170       180       190       200   

       330       340                                               
CYS1_D GKNTCGVSNFVSTSII                                            
                                                                   
gi|178 REEDGIVLIDQDKCRGWRMCITGCPYKKIYFNWKSGKSEKCIFCYPRIEAGQPTVCSETC
           210       220       230       240       250       260   

>>gi|1790635|gb|AAC77148.1| (AE000491) putative DEOR-typ  (251 aa)
 initn:  46 init1:  46 opt:  84  Z-score: 104.9  bits: 27.4 E():  2.1
Smith-Waterman score: 84;  22.078% identity (23.288% ungapped) in 77 aa overlap (99-171:119-195)

       70        80        90       100       110       120        
CYS1_D HKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPTAFDWRTRG
                                     :.:. ::.:.:: :.  .:.       ...
gi|179 QLVNPGESVVINCGSTAFLLGREMCGKPVQIITNYLPLANYLIDQEHDSVIIMGGQYNKS
       90       100       110       120       130       140        

      130       140           150       160       170       180    
CYS1_D AVTPVKNQGQCGSC----WSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEE
           .. ::. .:     : :..  .. .. . . . :....::....            
gi|179 QSITLSPQGSENSLYAGHWMFTSGKGLTAEGLYKTDMLTAMAEQKMLSVVGKLVVLVDSS
      150       160       170       180       190       200        

          190       200       210       220       230       240    
CYS1_D ACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGAKISNFTMIPKN
                                                                   
gi|179 KIGERAGMLFSRADQIDMLITGKNANPEILQQLEAQGVSILRV                 
      210       220       230       240       250                  

>>gi|1786590|gb|AAC73494.1| (AE000145) orf, hypothetical  (94 aa)
 initn:  37 init1:  37 opt:  78  Z-score: 104.8  bits: 25.9 E():  2.1
Smith-Waterman score: 78;  36.842% identity (43.750% ungapped) in 38 aa overlap (242-278:42-74)

             220       230       240       250       260       270 
CYS1_D SSYPYTAETGTQCNFNSANIGAKISNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFY-
                                     :.. ::..: .    :     ::..:: : 
gi|178 VKSIGFSSSSTGRASVGVMVEGEYTFSTAEPEEMTVISGALNVLLP-----DATDWQVYE
              20        30        40        50             60      

              280       290       300       310       320       330
CYS1_D IGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKN
        :.::..:                                                    
gi|178 AGSVFNVPGHSEFHLQVAEPTSYLCRYL                                
         70        80        90                                    

>>gi|1790853|gb|AAC77345.1| (AE000509) soluble lytic mur  (654 aa)
 initn:  61 init1:  61 opt:  84  Z-score: 98.5  bits: 27.6 E():  4.8
Smith-Waterman score: 84;  32.692% identity (34.694% ungapped) in 52 aa overlap (104-152:104-155)

            80        90       100       110       120       130   
CYS1_D KFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPTAFDWRTRGAVTPV
                                     :: :  :...:.: .    :::   : .: 
gi|179 YPYLEYRQITDDLMNQPAVTVTNFVRANPTLPPARTLQSRFVNELARREDWRGLLAFSPE
            80        90       100       110       120       130   

              140       150       160       170       180       190
CYS1_D K---NQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGC
       :   ...::.  ..  .::. :                                      
gi|179 KPGTTEAQCNYYYAKWNTGQSEEAWQGAKELWLTGKSQPNACDKLFSVWRASGKQDPLAY
           140       150       160       170       180       190   

>>gi|1789307|gb|AAC75975.1| (AE000377) biosynthetic argi  (658 aa)
 initn:  41 init1:  41 opt:  83  Z-score: 97.3  bits: 27.4 E():  5.6
Smith-Waterman score: 83;  23.913% identity (24.176% ungapped) in 92 aa overlap (178-268:315-406)

       150       160       170       180        190       200      
CYS1_D TGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEA-CDEGCNGGLQPNAYNYIIKNG
                                     ..::: ..  : . : ::.  : : :   :
gi|178 TGVRESARFYVELHKLGVNIQCFDVGGGLGVDYEGTRSQSDCSVNYGLNEYANNIIWAIG
          290       300       310       320       330       340    

        210       220       230       240       250       260      
CYS1_D GIQTESSYPYTAETGTQCNFNSANIGAKISNFTMIPKNETVMAGYIVSTGPLAIAADAVE
           :.. :. .    .    .:.  . .::.  . .:: ..    .  .: :. .    
gi|178 DACEENGLPHPTVITESGRAVTAHHTVLVSNIIGVERNEYTVPTAPAEDAPRALQSMWET
          350       360       370       380       390       400    

        270       280       290       300       310       320      
CYS1_D WQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLR
       ::                                                          
gi|178 WQEMHEPGTRRSLREWLHDSQMDLHDIHIGYSSGIFSLQERAWAEQLYLSMCHEVQKQLD
          410       420       430       440       450       460    

>>gi|1788174|gb|AAC74937.1| (AE000280) orf, hypothetical  (199 aa)
 initn:  46 init1:  46 opt:  74  Z-score: 95.2  bits: 25.2 E():  7.4
Smith-Waterman score: 74;  43.750% identity (50.000% ungapped) in 32 aa overlap (308-335:110-141)

       280       290       300       310       320        330      
CYS1_D PCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRR-GKNT---CG
                                     :.: .::: .: .  . ::: : .:   ::
gi|178 PVDAPSPAKVLPENWWQHPAALGATDSDIEIIKRQWGAFYGTDLELQLRRRGIDTIVLCG
      80        90       100       110       120       130         

           340                                                     
CYS1_D VSNFVSTSII                                                  
       .:.                                                         
gi|178 ISTNIGVESTARNAWELGFNLVIAEDACSAASAEQHNNSINHIYPRIARVRSVEEILNAL
     140       150       160       170       180       190         

>>gi|1789138|gb|AAC75818.1| (AE000361) putative kinase [  (492 aa)
 initn:  36 init1:  36 opt:  79  Z-score: 94.7  bits: 26.5 E():  7.8
Smith-Waterman score: 84;  19.136% identity (21.233% ungapped) in 162 aa overlap (34-192:165-313)

            10        20        30        40        50        60   
CYS1_D ILLFVLAVFTVFVSSRGIPPEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEELN
                                     ::::     ...:   ..  . ::.:    
gi|178 GEFKDNIANYFGQWPVDYKSWAWSEDAAVMDKFNIP---RHMLFDVQMPGTVLGHITPQA
          140       150       160       170          180       190 

            70        80        90       100       110       120   
CYS1_D LIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPTAFD
        .: .  :     :   .:   . .    :... :...    .: ... . . . :.:. 
gi|178 ALATHFPAGLPV-VCTTSDKPVEALGAGLLDDETAVISLGTYIALMMNGKALPKDPVAY-
             200        210       220       230       240          

           130          140       150       160       170       180
CYS1_D WRTRGAVTPV---KNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEY
       :   ...  .   .. :   . :. :   .. :. .:.. .  .:: ..:..    :.  
gi|178 WPIMSSIPQTLLYEGYGIRKGMWTVSWLRDMLGESLIQDARAQDLSPEDLLNKKASCVP-
     250       260       270       280       290       300         

              190       200       210       220       230       240
CYS1_D EGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGAKISNFTM
               ::::                                                
gi|178 -------PGCNGLMTVLDWLTNPWEPYKRGIMIGFDSSMDYAWIYRSILESVALTLKNNY
             310       320       330       340       350       360 

>>gi|1789427|gb|AAC76084.1| (AE000386) orf, hypothetical  (354 aa)
 initn:  65 init1:  40 opt:  76  Z-score: 93.5  bits: 25.8 E():  9.1
Smith-Waterman score: 76;  22.619% identity (23.899% ungapped) in 168 aa overlap (141-303:81-244)

              120       130       140       150       160       170
CYS1_D DDEFINSIPTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNL
                                     : :.      . :: : .::. .     : 
gi|178 GDKIWQSSEYFMNVFCNNALPGPSPGEEYPSAWANIMMLLASGQDFYNQNSYTFGVTYNG
               60        70        80        90       100       110

              180       190       200        210       220         
CYS1_D VDCDHECMEYEGEEACDEGCNGGLQPNAYNY-IIKNGGIQTESSYPYTAETGTQCNFNSA
       :: :       .  .: .  ..:   :.:.   . .:: . . :  . ...  :  .. :
gi|178 VDYDSTSPLPIAAPVCIDIKGAGTFGNGYKKPAVCSGGPEPQLSVTFPVRV--QLYIKLA
              120       130       140       150       160          

     230       240       250       260       270          280      
CYS1_D NIGAKISNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDI---PCNPN-SLD
       . . :...  ..: .: .   .   .:  :: .:  .  : : :. .:    :  : .:.
gi|178 KNANKVNKKLVLP-DEYIALEFKGMSGAGAIEVDK-NLTFRIRGLNNIHVLDCFVNVDLE
      170       180        190       200        210       220      

         290       300       310       320       330       340     
CYS1_D HGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVSTSII  
        .  .: ..  :.   ::                                          
gi|178 PADGVVDFGKINSRTIKNTSVSETFSVVMTKDPGAACTEQFNILGSFFTTDILSDYSHLD
        230       240       250       260       270       280      



343 residues in 1 query   sequences
1358987 residues in 4289 library sequences
 Scomplib [33t08]
 start: Sat Dec  8 11:43:36 2001 done: Sat Dec  8 11:43:37 2001
 Scan time:  1.110 Display time:  0.090

Function used was FASTA [version 3.3t08 Jan. 17, 2001]