Example of using the program. 

Complexity decomposition of genetic texts by modified Lempel-Ziv method

 

The target sequences should be presented in FASTA-format. 

This example contains promoter sequence, 600 bp. Let us select 'whole file decomposition' and 'full report'.

Press "Execute" and you will see complexity decomposition of the text that looks like:

1 ([1:1],1,NW,'a')
2 ([1:1],1,I<,'t')
3 ([1:1],1,NW,'g')
4 ([1:3],1,D>,'g')
5 ([1:2],1,I<,'a')
6 ([1:3],1,D>,'g')
7 ([1:3],3,I<,'tcc')
10 ([1:2],1,D>,'t')
11 ([1:2],2,D>,'tg')
13 ([1:4],3,I<,'ctc')
16 ([1:10],2,D>,'tt')
18 ([1:10],2,I<,'aa')
20 ([1:9],2,D>,'ct')
22 ([1:19],3,D>,'act')
25 ([1:7],2,I<,'ga')
27 ([1:19],2,D>,'ac')
29 ([1:10],3,I<,'caa')
32 ([1:13],4,D>,'ctct')
36 ([1:8],2,D>,'cc')
38 ([1:8],1,D>,'c')
39 ([1:6],5,D>,'gtcct')
...
Counted items:146
Whole file complexity - 146

This means the number of components in decomposition (complexity) is 146.

All the components (repeated fragments are shown.

Change sliding parameters to analyze more. See also example of profile analysis.

 

---------------------------------------------------------------

Smart set of program parameters for DNA sequence analysis. 

To use complete menu please click here 

DNA   Standard alphabet {A,T,G,C}                       

Reduced DNA alphabets:   Weak/Strong  [AT][GC] Purine/Pyrimidine  [AG][TC]        

 

Input sequence(s) here (FASTA format or plain text) (cut & paste)

or from File:

Calculation method:

profile - complexity in sliding window (default)

Profile parameters: Sliding window size    Profile step (shift of sliding window)


Decompose whole file by repeated fragments (sequences joined)   

Decompose each sequences in file separately       

Complexity decomposition of sequence(s) by other sequence(s)

 (second file required, input it below) 

 

  Input sequences here (FASTA format or plain text) (cut & paste)

or from File:

 

    Copying operations while sequence generation using:

Repeats: Direct Copy     Symmetric Copy

Complementary repeats:  Invert Copy     Direct Complementary Copy

 

    Output parameters:

No decomposition report (only complexity value, by default)

full detailed report (all components of decomposition)

economic report  (selected long components only) Show sequence in frame in 'economic' report if its complexity <      

statistical report (statistics of components lengths)  

Table distance between repeats for statistical reports w=

(only statistics of long repeats (for complete decomposition)  and low complexity regions (in profile) will be shown) 

tandem report (statistics of tandem repeats in decomposition)

User-defined length of long repeats >

(for economic and statistical reports)

                

Help          Review of complexity analysis methods     Publications   

 

Recent findings by the program in eukaryotic promoter complexity,  

complete bacterial genomes complexity comparison