The pattern must match at least
                     
                
                     Option in command line format: -C%/CM
 
                    
                        % of the sequences
                         sequences
                      
                 
             
            
                
                    Max pattern length
                     Option in command line format: -PL
G-G-[PS]-L-x(1,3)-R  has a length of 8 (1+1+1+1+3+1).
  
                 
                
                     
             
            
                
                    Max number of different pattern symbols
                     Option in command line format: -PN
G-G-[PS]-L-x(1,3)-R  has 4 different symbols: G, [PS], L and R .
  
                 
                
                     
             
            
                
                    Max length of wildcards (x)
                     Option in command line format: -PX
x  has a length of 1, x(7)  has a length of 7
and x(1,3)  has a length of 3.
  
                 
                
                     
             
            
                
                    Max number of flexible wildcards (x)
                     Option in command line format: -FN
A-x(2)-P-x-G-x(0,2)-D-x(3,5)-S  contains 2 flexible wildcards:
x(0,2)  and x(3,5) .
  
                 
                
                     
             
            
                
                    Max flexibility of wildcards (x)
                     Option in command line format: -FL
x(3)  has a flexibility of 0,
x(1,2)  has a flexibility of 1 and x(0,2)  has a flexibility of 2.
  
                 
                
                     
             
            
                
                    Max product of wildcard (x) flexibility
                     Option in command line format: -FP
(flexibility of wildcard_1 + 1) * ... * (flexibility of wildcard_n + 1) 
For instance, for C-x(2,4)-[DE]-x(10)-F  the product is
(2+1) * (0+1)  = 3
C-x(2,4)-[DE]-x(10,14)-F  the product is
(2+1) * (4+1)  = 15.
  
                 
                
                     
             
            
                
                    Maximum number of pattern symbols used in the initial search
                     Option in command line format: -BN
G 
followed by ambiguous symbols of amino acids sharing some physico-chemical properties like [DE] .
 
                 
                
                     
             
            
                
                    Pattern scoring method
                     Option in command line format: -S
    
        info : patterns are scored by their information content as defined in
        Jonassen, I., Collins, J. F. and Higgins, D. G. (1995) .
         
    
        mdl : this scoring method is derived from a Minimum Description Length (mdl) principle.
        This method is related to the 'info' scheme but the number of sequences matched is taken into account,
        i.e. patterns scoring few sequences are penalized in comparison with patterns scoring many.
     
  
                 
                
                    
                        info
                         mdl