Amino acid dipepetide frequency for Curvularia thermal tolerance virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.295AlaAla: 11.295 ± 2.988
0.0AlaCys: 0.0 ± 0.0
5.213AlaAsp: 5.213 ± 1.998
9.557AlaGlu: 9.557 ± 2.998
1.738AlaPhe: 1.738 ± 1.445
6.082AlaGly: 6.082 ± 1.576
0.0AlaHis: 0.0 ± 0.0
3.475AlaIle: 3.475 ± 1.037
2.606AlaLys: 2.606 ± 2.361
5.213AlaLeu: 5.213 ± 0.983
2.606AlaMet: 2.606 ± 1.443
4.344AlaAsn: 4.344 ± 1.002
5.213AlaPro: 5.213 ± 1.78
7.819AlaGln: 7.819 ± 2.683
5.213AlaArg: 5.213 ± 2.386
5.213AlaSer: 5.213 ± 2.574
6.95AlaThr: 6.95 ± 3.762
4.344AlaVal: 4.344 ± 0.637
2.606AlaTrp: 2.606 ± 1.409
0.869AlaTyr: 0.869 ± 0.722
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
0.869CysLeu: 0.869 ± 0.722
0.0CysMet: 0.0 ± 0.0
0.869CysAsn: 0.869 ± 1.028
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
1.738CysArg: 1.738 ± 1.445
0.0CysSer: 0.0 ± 0.0
0.869CysThr: 0.869 ± 0.744
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.869CysTyr: 0.869 ± 0.744
0.0CysXaa: 0.0 ± 0.0
Asp
8.688AspAla: 8.688 ± 2.592
0.0AspCys: 0.0 ± 0.0
2.606AspAsp: 2.606 ± 2.233
5.213AspGlu: 5.213 ± 1.998
3.475AspPhe: 3.475 ± 2.034
2.606AspGly: 2.606 ± 0.54
0.869AspHis: 0.869 ± 0.787
0.869AspIle: 0.869 ± 1.028
4.344AspLys: 4.344 ± 1.76
7.819AspLeu: 7.819 ± 1.226
3.475AspMet: 3.475 ± 0.936
2.606AspAsn: 2.606 ± 0.54
3.475AspPro: 3.475 ± 2.066
1.738AspGln: 1.738 ± 0.865
4.344AspArg: 4.344 ± 1.784
2.606AspSer: 2.606 ± 0.888
1.738AspThr: 1.738 ± 1.574
2.606AspVal: 2.606 ± 0.54
2.606AspTrp: 2.606 ± 1.411
1.738AspTyr: 1.738 ± 1.488
0.0AspXaa: 0.0 ± 0.0
Glu
5.213GluAla: 5.213 ± 1.661
0.0GluCys: 0.0 ± 0.0
1.738GluAsp: 1.738 ± 0.865
6.95GluGlu: 6.95 ± 2.473
2.606GluPhe: 2.606 ± 1.487
6.95GluGly: 6.95 ± 2.774
3.475GluHis: 3.475 ± 1.79
1.738GluIle: 1.738 ± 1.143
3.475GluLys: 3.475 ± 1.037
3.475GluLeu: 3.475 ± 2.889
4.344GluMet: 4.344 ± 1.407
0.869GluAsn: 0.869 ± 0.787
4.344GluPro: 4.344 ± 0.884
4.344GluGln: 4.344 ± 1.903
6.082GluArg: 6.082 ± 1.265
4.344GluSer: 4.344 ± 1.549
3.475GluThr: 3.475 ± 2.066
7.819GluVal: 7.819 ± 4.986
1.738GluTrp: 1.738 ± 1.055
4.344GluTyr: 4.344 ± 1.796
0.0GluXaa: 0.0 ± 0.0
Phe
4.344PheAla: 4.344 ± 1.583
0.0PheCys: 0.0 ± 0.0
1.738PheAsp: 1.738 ± 0.698
0.869PheGlu: 0.869 ± 0.722
0.0PhePhe: 0.0 ± 0.0
1.738PheGly: 1.738 ± 0.867
0.0PheHis: 0.0 ± 0.0
0.869PheIle: 0.869 ± 1.028
3.475PheLys: 3.475 ± 1.343
3.475PheLeu: 3.475 ± 1.396
0.869PheMet: 0.869 ± 0.744
0.0PheAsn: 0.0 ± 0.0
0.869PhePro: 0.869 ± 0.722
2.606PheGln: 2.606 ± 1.411
1.738PheArg: 1.738 ± 1.143
0.869PheSer: 0.869 ± 0.787
3.475PheThr: 3.475 ± 1.111
3.475PheVal: 3.475 ± 2.889
0.0PheTrp: 0.0 ± 0.0
0.869PheTyr: 0.869 ± 0.722
0.0PheXaa: 0.0 ± 0.0
Gly
2.606GlyAla: 2.606 ± 2.361
0.0GlyCys: 0.0 ± 0.0
3.475GlyAsp: 3.475 ± 1.853
2.606GlyGlu: 2.606 ± 0.888
1.738GlyPhe: 1.738 ± 1.143
5.213GlyGly: 5.213 ± 1.73
2.606GlyHis: 2.606 ± 0.54
4.344GlyIle: 4.344 ± 2.792
5.213GlyLys: 5.213 ± 1.78
6.082GlyLeu: 6.082 ± 2.793
3.475GlyMet: 3.475 ± 2.73
4.344GlyAsn: 4.344 ± 1.796
1.738GlyPro: 1.738 ± 1.178
1.738GlyGln: 1.738 ± 1.574
6.082GlyArg: 6.082 ± 1.877
3.475GlySer: 3.475 ± 0.537
8.688GlyThr: 8.688 ± 2.064
4.344GlyVal: 4.344 ± 2.294
0.869GlyTrp: 0.869 ± 0.787
1.738GlyTyr: 1.738 ± 1.143
0.0GlyXaa: 0.0 ± 0.0
His
0.869HisAla: 0.869 ± 0.787
0.0HisCys: 0.0 ± 0.0
0.869HisAsp: 0.869 ± 0.722
1.738HisGlu: 1.738 ± 1.055
0.869HisPhe: 0.869 ± 0.787
0.0HisGly: 0.0 ± 0.0
0.0HisHis: 0.0 ± 0.0
0.0HisIle: 0.0 ± 0.0
0.869HisLys: 0.869 ± 0.787
0.869HisLeu: 0.869 ± 0.722
0.0HisMet: 0.0 ± 0.0
0.869HisAsn: 0.869 ± 0.744
0.869HisPro: 0.869 ± 0.744
0.0HisGln: 0.0 ± 0.0
1.738HisArg: 1.738 ± 1.445
1.738HisSer: 1.738 ± 0.865
0.869HisThr: 0.869 ± 0.722
0.869HisVal: 0.869 ± 0.787
0.869HisTrp: 0.869 ± 0.787
1.738HisTyr: 1.738 ± 2.057
0.0HisXaa: 0.0 ± 0.0
Ile
1.738IleAla: 1.738 ± 1.055
2.606IleCys: 2.606 ± 1.409
2.606IleAsp: 2.606 ± 1.102
4.344IleGlu: 4.344 ± 0.946
0.0IlePhe: 0.0 ± 0.0
3.475IleGly: 3.475 ± 2.206
0.0IleHis: 0.0 ± 0.0
0.869IleIle: 0.869 ± 0.787
1.738IleLys: 1.738 ± 0.867
1.738IleLeu: 1.738 ± 1.445
0.869IleMet: 0.869 ± 0.722
2.606IleAsn: 2.606 ± 1.301
3.475IlePro: 3.475 ± 0.537
2.606IleGln: 2.606 ± 1.183
1.738IleArg: 1.738 ± 0.867
2.606IleSer: 2.606 ± 1.487
0.869IleThr: 0.869 ± 0.744
0.869IleVal: 0.869 ± 0.744
0.0IleTrp: 0.0 ± 0.0
0.0IleTyr: 0.0 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
9.557LysAla: 9.557 ± 2.974
0.0LysCys: 0.0 ± 0.0
2.606LysAsp: 2.606 ± 1.301
3.475LysGlu: 3.475 ± 0.537
1.738LysPhe: 1.738 ± 1.445
0.869LysGly: 0.869 ± 0.787
0.0LysHis: 0.0 ± 0.0
2.606LysIle: 2.606 ± 1.183
3.475LysLys: 3.475 ± 2.206
5.213LysLeu: 5.213 ± 1.73
1.738LysMet: 1.738 ± 1.178
1.738LysAsn: 1.738 ± 0.698
0.0LysPro: 0.0 ± 0.0
0.869LysGln: 0.869 ± 0.787
5.213LysArg: 5.213 ± 1.406
0.869LysSer: 0.869 ± 0.744
2.606LysThr: 2.606 ± 1.183
1.738LysVal: 1.738 ± 0.698
0.869LysTrp: 0.869 ± 0.744
2.606LysTyr: 2.606 ± 0.54
0.0LysXaa: 0.0 ± 0.0
Leu
5.213LeuAla: 5.213 ± 1.73
0.869LeuCys: 0.869 ± 1.028
4.344LeuAsp: 4.344 ± 2.26
6.95LeuGlu: 6.95 ± 1.073
1.738LeuPhe: 1.738 ± 1.143
7.819LeuGly: 7.819 ± 1.142
0.0LeuHis: 0.0 ± 0.0
2.606LeuIle: 2.606 ± 1.409
4.344LeuLys: 4.344 ± 3.612
11.295LeuLeu: 11.295 ± 1.642
1.738LeuMet: 1.738 ± 0.865
2.606LeuAsn: 2.606 ± 1.183
5.213LeuPro: 5.213 ± 1.78
6.082LeuGln: 6.082 ± 3.48
6.082LeuArg: 6.082 ± 1.232
8.688LeuSer: 8.688 ± 1.273
3.475LeuThr: 3.475 ± 0.936
7.819LeuVal: 7.819 ± 2.695
2.606LeuTrp: 2.606 ± 2.043
4.344LeuTyr: 4.344 ± 1.209
0.0LeuXaa: 0.0 ± 0.0
Met
1.738MetAla: 1.738 ± 1.488
0.869MetCys: 0.869 ± 0.744
3.475MetAsp: 3.475 ± 2.066
0.0MetGlu: 0.0 ± 0.0
1.738MetPhe: 1.738 ± 0.867
1.738MetGly: 1.738 ± 1.055
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
1.738MetLys: 1.738 ± 1.574
4.344MetLeu: 4.344 ± 1.222
0.0MetMet: 0.0 ± 0.0
0.869MetAsn: 0.869 ± 0.722
2.606MetPro: 2.606 ± 0.54
0.0MetGln: 0.0 ± 0.0
4.344MetArg: 4.344 ± 1.222
2.606MetSer: 2.606 ± 0.54
2.606MetThr: 2.606 ± 1.102
0.869MetVal: 0.869 ± 0.744
0.869MetTrp: 0.869 ± 0.787
0.869MetTyr: 0.869 ± 1.028
0.0MetXaa: 0.0 ± 0.0
Asn
1.738AsnAla: 1.738 ± 1.178
0.0AsnCys: 0.0 ± 0.0
2.606AsnAsp: 2.606 ± 1.411
2.606AsnGlu: 2.606 ± 0.888
2.606AsnPhe: 2.606 ± 1.409
0.869AsnGly: 0.869 ± 0.787
0.869AsnHis: 0.869 ± 0.722
2.606AsnIle: 2.606 ± 0.54
0.869AsnLys: 0.869 ± 0.787
1.738AsnLeu: 1.738 ± 0.865
1.738AsnMet: 1.738 ± 0.698
0.0AsnAsn: 0.0 ± 0.0
3.475AsnPro: 3.475 ± 0.537
0.0AsnGln: 0.0 ± 0.0
0.869AsnArg: 0.869 ± 0.787
0.0AsnSer: 0.0 ± 0.0
1.738AsnThr: 1.738 ± 0.865
0.869AsnVal: 0.869 ± 0.787
1.738AsnTrp: 1.738 ± 1.055
0.869AsnTyr: 0.869 ± 0.744
0.0AsnXaa: 0.0 ± 0.0
Pro
3.475ProAla: 3.475 ± 1.305
0.0ProCys: 0.0 ± 0.0
2.606ProAsp: 2.606 ± 1.409
6.95ProGlu: 6.95 ± 2.074
1.738ProPhe: 1.738 ± 0.865
0.869ProGly: 0.869 ± 1.028
0.869ProHis: 0.869 ± 0.722
3.475ProIle: 3.475 ± 1.831
0.0ProLys: 0.0 ± 0.0
5.213ProLeu: 5.213 ± 1.616
2.606ProMet: 2.606 ± 0.888
0.869ProAsn: 0.869 ± 0.744
5.213ProPro: 5.213 ± 1.406
3.475ProGln: 3.475 ± 1.734
3.475ProArg: 3.475 ± 1.098
5.213ProSer: 5.213 ± 1.586
3.475ProThr: 3.475 ± 1.305
5.213ProVal: 5.213 ± 0.827
0.869ProTrp: 0.869 ± 0.722
3.475ProTyr: 3.475 ± 1.343
0.0ProXaa: 0.0 ± 0.0
Gln
3.475GlnAla: 3.475 ± 2.034
0.0GlnCys: 0.0 ± 0.0
3.475GlnAsp: 3.475 ± 2.401
1.738GlnGlu: 1.738 ± 1.178
2.606GlnPhe: 2.606 ± 1.409
3.475GlnGly: 3.475 ± 1.734
1.738GlnHis: 1.738 ± 0.698
2.606GlnIle: 2.606 ± 1.409
0.869GlnLys: 0.869 ± 0.744
4.344GlnLeu: 4.344 ± 1.582
0.869GlnMet: 0.869 ± 0.722
0.869GlnAsn: 0.869 ± 1.028
3.475GlnPro: 3.475 ± 1.037
0.869GlnGln: 0.869 ± 0.722
1.738GlnArg: 1.738 ± 0.867
1.738GlnSer: 1.738 ± 0.865
3.475GlnThr: 3.475 ± 2.206
2.606GlnVal: 2.606 ± 1.443
0.869GlnTrp: 0.869 ± 0.744
1.738GlnTyr: 1.738 ± 0.865
0.0GlnXaa: 0.0 ± 0.0
Arg
6.95ArgAla: 6.95 ± 3.116
0.0ArgCys: 0.0 ± 0.0
8.688ArgAsp: 8.688 ± 1.826
7.819ArgGlu: 7.819 ± 1.332
4.344ArgPhe: 4.344 ± 1.784
5.213ArgGly: 5.213 ± 2.365
0.0ArgHis: 0.0 ± 0.0
0.0ArgIle: 0.0 ± 0.0
3.475ArgLys: 3.475 ± 1.111
6.95ArgLeu: 6.95 ± 3.46
3.475ArgMet: 3.475 ± 1.466
0.869ArgAsn: 0.869 ± 0.722
4.344ArgPro: 4.344 ± 1.209
0.869ArgGln: 0.869 ± 0.744
10.426ArgArg: 10.426 ± 2.969
2.606ArgSer: 2.606 ± 2.361
3.475ArgThr: 3.475 ± 1.853
3.475ArgVal: 3.475 ± 1.853
0.0ArgTrp: 0.0 ± 0.0
0.869ArgTyr: 0.869 ± 0.722
0.0ArgXaa: 0.0 ± 0.0
Ser
4.344SerAla: 4.344 ± 2.039
0.0SerCys: 0.0 ± 0.0
3.475SerAsp: 3.475 ± 1.343
3.475SerGlu: 3.475 ± 1.79
1.738SerPhe: 1.738 ± 0.867
7.819SerGly: 7.819 ± 1.142
1.738SerHis: 1.738 ± 0.865
0.869SerIle: 0.869 ± 0.744
1.738SerLys: 1.738 ± 0.698
5.213SerLeu: 5.213 ± 0.983
0.0SerMet: 0.0 ± 0.0
2.606SerAsn: 2.606 ± 1.102
5.213SerPro: 5.213 ± 2.051
0.869SerGln: 0.869 ± 0.722
3.475SerArg: 3.475 ± 2.206
6.082SerSer: 6.082 ± 3.664
1.738SerThr: 1.738 ± 1.445
3.475SerVal: 3.475 ± 1.305
2.606SerTrp: 2.606 ± 1.017
0.869SerTyr: 0.869 ± 1.028
0.0SerXaa: 0.0 ± 0.0
Thr
5.213ThrAla: 5.213 ± 1.78
0.0ThrCys: 0.0 ± 0.0
6.082ThrAsp: 6.082 ± 3.639
5.213ThrGlu: 5.213 ± 2.545
1.738ThrPhe: 1.738 ± 1.445
6.082ThrGly: 6.082 ± 2.556
1.738ThrHis: 1.738 ± 1.143
3.475ThrIle: 3.475 ± 2.034
3.475ThrLys: 3.475 ± 0.936
5.213ThrLeu: 5.213 ± 1.616
0.0ThrMet: 0.0 ± 0.0
0.0ThrAsn: 0.0 ± 0.0
3.475ThrPro: 3.475 ± 1.396
3.475ThrGln: 3.475 ± 2.087
3.475ThrArg: 3.475 ± 2.066
2.606ThrSer: 2.606 ± 1.409
5.213ThrThr: 5.213 ± 1.661
1.738ThrVal: 1.738 ± 1.143
0.869ThrTrp: 0.869 ± 0.744
3.475ThrTyr: 3.475 ± 1.537
0.0ThrXaa: 0.0 ± 0.0
Val
7.819ValAla: 7.819 ± 1.142
0.0ValCys: 0.0 ± 0.0
2.606ValAsp: 2.606 ± 1.301
5.213ValGlu: 5.213 ± 1.406
0.0ValPhe: 0.0 ± 0.0
6.082ValGly: 6.082 ± 1.232
0.869ValHis: 0.869 ± 0.787
3.475ValIle: 3.475 ± 1.305
2.606ValLys: 2.606 ± 1.487
6.082ValLeu: 6.082 ± 2.458
1.738ValMet: 1.738 ± 1.407
0.869ValAsn: 0.869 ± 1.028
2.606ValPro: 2.606 ± 1.954
2.606ValGln: 2.606 ± 0.54
2.606ValArg: 2.606 ± 2.167
2.606ValSer: 2.606 ± 1.102
2.606ValThr: 2.606 ± 0.54
5.213ValVal: 5.213 ± 2.521
1.738ValTrp: 1.738 ± 1.143
0.869ValTyr: 0.869 ± 0.722
0.0ValXaa: 0.0 ± 0.0
Trp
1.738TrpAla: 1.738 ± 0.698
0.0TrpCys: 0.0 ± 0.0
3.475TrpAsp: 3.475 ± 0.936
0.869TrpGlu: 0.869 ± 1.028
0.0TrpPhe: 0.0 ± 0.0
0.869TrpGly: 0.869 ± 1.028
0.0TrpHis: 0.0 ± 0.0
0.869TrpIle: 0.869 ± 1.028
0.869TrpLys: 0.869 ± 0.744
4.344TrpLeu: 4.344 ± 2.26
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.869TrpPro: 0.869 ± 0.744
0.869TrpGln: 0.869 ± 0.787
1.738TrpArg: 1.738 ± 1.488
1.738TrpSer: 1.738 ± 1.143
2.606TrpThr: 2.606 ± 1.954
0.869TrpVal: 0.869 ± 1.028
0.0TrpTrp: 0.0 ± 0.0
1.738TrpTyr: 1.738 ± 0.698
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.344TyrAla: 4.344 ± 1.222
0.869TyrCys: 0.869 ± 0.722
1.738TyrAsp: 1.738 ± 1.445
1.738TyrGlu: 1.738 ± 1.488
0.869TyrPhe: 0.869 ± 0.744
3.475TyrGly: 3.475 ± 1.466
0.869TyrHis: 0.869 ± 0.722
0.0TyrIle: 0.0 ± 0.0
2.606TyrLys: 2.606 ± 1.301
4.344TyrLeu: 4.344 ± 2.717
0.869TyrMet: 0.869 ± 0.645
0.0TyrAsn: 0.0 ± 0.0
2.606TyrPro: 2.606 ± 1.443
1.738TyrGln: 1.738 ± 0.698
1.738TyrArg: 1.738 ± 1.488
1.738TyrSer: 1.738 ± 0.865
2.606TyrThr: 2.606 ± 1.954
0.0TyrVal: 0.0 ± 0.0
1.738TyrTrp: 1.738 ± 1.055
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (1152 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski