Complexity decomposition of genetic texts by modified Lempel-Ziv method


 Smart set of program parameters for DNA sequence analysis

To use complete menu please click here 

To calculate complexity profile only use  LZprofile program

To search low complexity regions by other methods (entropy estimations, linguistic complexity)

use new Low Complexity program


DNA   Standard alphabet {A,T,G,C}                       

Reduced DNA alphabets:   Weak/Strong  [AT][GC] Purine/Pyrimidine  [AG][TC]        


Input sequence(s) here (FASTA format or plain text) (cut & paste)

or from File:

Calculation method:

profile - complexity in sliding window (default)

Profile parameters: Sliding window size    Profile step (shift of sliding window)

Decompose whole file by repeated fragments (sequences joined)   

Decompose each sequences in file separately       

Complexity decomposition of sequence(s) by other sequence(s)

 (second file required, input it below) 


  Input sequences here (FASTA format or plain text) (cut & paste)

or from File:


    Copying operations while sequence generation using:

Repeats: Direct Copy     Symmetric Copy

Complementary repeats:  Invert Copy     Direct Complementary Copy


    Output parameters:

No decomposition report (only complexity value, by default)

full detailed report (all components of decomposition)

economic report  (selected long components only) Show sequence in frame in 'economic' report if its complexity <      

statistical report (statistics of components lengths)  

User-defined length of long repeats >

(for economic and statistical reports)


Help       Example      Review of complexity analysis methods     Publications   


Recent findings by the program in eukaryotic promoter complexity,  

complete bacterial genomes complexity comparison


Old realization of DNA-oriented  Lempel-Ziv algorithm by (Babenko et al., 1999): Complexity profile_builder

Related complexity analysis algorithms: Complexity by context tree source, SIMPLE, Linguistic Complexity, Transformation  Distance

Graphical presentation of sequence regularities: OligoRep system, Verbumculus, DeBruijn graphs

The Institute of Cytology and Genetics (Russia)

This resource has been developed in Institute of Cytology and Genetics, based on methods developed in Sobolev Institute of Mathematics, Novosibirsk, Russia 
Authors: Yu.L.Orlov, V.D.Gusev, L.A.Miroshnichenko(Nemytikova)
Contributors: S.V.Lavryushev, V.P.Filippov, D.A.Grigorovich     Leader: N.A.Kolchanov

The research was partially supported by the Russian Foundation for Basic Research (RFBR), INTAS, Ministry of Education (E02-6.0-250), NATO (LST.CLG 979815) and Siberian Branch of the Russian Academy of Sciences (Integration project No. 119).