Complexity decomposition of genetic texts by modified Lempel-Ziv method

 

 Smart set of program parameters for DNA sequence analysis

To use complete menu please click here 

To calculate complexity profile only use  LZprofile program

To search low complexity regions by other methods (entropy estimations, linguistic complexity)

use new Low Complexity program

 

DNA   Standard alphabet {A,T,G,C}                       

Reduced DNA alphabets:   Weak/Strong  [AT][GC] Purine/Pyrimidine  [AG][TC]        

 

Input sequence(s) here (FASTA format or plain text) (cut & paste)

or from File:

Calculation method:

profile - complexity in sliding window (default)

Profile parameters: Sliding window size    Profile step (shift of sliding window)


Decompose whole file by repeated fragments (sequences joined)   

Decompose each sequences in file separately       

Complexity decomposition of sequence(s) by other sequence(s)

 (second file required, input it below) 

 

  Input sequences here (FASTA format or plain text) (cut & paste)

or from File:

 

    Copying operations while sequence generation using:

Repeats: Direct Copy     Symmetric Copy

Complementary repeats:  Invert Copy     Direct Complementary Copy

 

    Output parameters:

No decomposition report (only complexity value, by default)

full detailed report (all components of decomposition)

economic report  (selected long components only) Show sequence in frame in 'economic' report if its complexity <      

statistical report (statistics of components lengths)  

User-defined length of long repeats >

(for economic and statistical reports)

                

Help       Example      Review of complexity analysis methods     Publications   

 

Recent findings by the program in eukaryotic promoter complexity,  

complete bacterial genomes complexity comparison

 

Old realization of DNA-oriented  Lempel-Ziv algorithm by (Babenko et al., 1999): Complexity profile_builder

Related complexity analysis algorithms: Complexity by context tree source, SIMPLE, Linguistic Complexity, Transformation  Distance

Graphical presentation of sequence regularities: OligoRep system, Verbumculus, DeBruijn graphs

The Institute of Cytology and Genetics (Russia)

This resource has been developed in Institute of Cytology and Genetics, based on methods developed in Sobolev Institute of Mathematics, Novosibirsk, Russia 
Authors: Yu.L.Orlov, V.D.Gusev, L.A.Miroshnichenko(Nemytikova)
Contributors: S.V.Lavryushev, V.P.Filippov, D.A.Grigorovich     Leader: N.A.Kolchanov

The research was partially supported by the Russian Foundation for Basic Research (RFBR), INTAS, Ministry of Education (E02-6.0-250), NATO (LST.CLG 979815) and Siberian Branch of the Russian Academy of Sciences (Integration project No. 119).