Amino acid dipepetide frequency for Trichoderma harzianum bipartite mycovirus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.653AlaAla: 11.653 ± 8.849
1.059AlaCys: 1.059 ± 0.55
4.237AlaAsp: 4.237 ± 0.543
7.415AlaGlu: 7.415 ± 0.537
2.119AlaPhe: 2.119 ± 1.099
5.297AlaGly: 5.297 ± 2.217
4.237AlaHis: 4.237 ± 0.543
3.178AlaIle: 3.178 ± 0.006
3.178AlaLys: 3.178 ± 0.006
9.534AlaLeu: 9.534 ± 4.983
3.178AlaMet: 3.178 ± 0.006
2.119AlaAsn: 2.119 ± 0.556
3.178AlaPro: 3.178 ± 1.649
8.475AlaGln: 8.475 ± 5.533
4.237AlaArg: 4.237 ± 4.422
4.237AlaSer: 4.237 ± 0.543
5.297AlaThr: 5.297 ± 3.872
5.297AlaVal: 5.297 ± 2.217
1.059AlaTrp: 1.059 ± 1.105
3.178AlaTyr: 3.178 ± 1.649
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
1.059CysPhe: 1.059 ± 0.55
2.119CysGly: 2.119 ± 1.099
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
1.059CysLeu: 1.059 ± 0.55
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
0.0CysSer: 0.0 ± 0.0
1.059CysThr: 1.059 ± 0.55
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
9.534AspAla: 9.534 ± 0.018
0.0AspCys: 0.0 ± 0.0
4.237AspAsp: 4.237 ± 2.199
3.178AspGlu: 3.178 ± 0.006
1.059AspPhe: 1.059 ± 1.105
2.119AspGly: 2.119 ± 1.099
0.0AspHis: 0.0 ± 0.0
3.178AspIle: 3.178 ± 0.006
3.178AspLys: 3.178 ± 0.006
5.297AspLeu: 5.297 ± 1.093
1.059AspMet: 1.059 ± 0.55
1.059AspAsn: 1.059 ± 0.55
5.297AspPro: 5.297 ± 2.217
1.059AspGln: 1.059 ± 1.105
7.415AspArg: 7.415 ± 2.192
3.178AspSer: 3.178 ± 0.006
0.0AspThr: 0.0 ± 0.0
2.119AspVal: 2.119 ± 1.099
2.119AspTrp: 2.119 ± 0.556
4.237AspTyr: 4.237 ± 0.543
0.0AspXaa: 0.0 ± 0.0
Glu
3.178GluAla: 3.178 ± 0.006
0.0GluCys: 0.0 ± 0.0
3.178GluAsp: 3.178 ± 0.006
10.593GluGlu: 10.593 ± 1.124
1.059GluPhe: 1.059 ± 0.55
2.119GluGly: 2.119 ± 0.556
0.0GluHis: 0.0 ± 0.0
6.356GluIle: 6.356 ± 1.667
0.0GluLys: 0.0 ± 0.0
4.237GluLeu: 4.237 ± 2.767
6.356GluMet: 6.356 ± 0.012
3.178GluAsn: 3.178 ± 0.006
2.119GluPro: 2.119 ± 2.211
3.178GluGln: 3.178 ± 0.006
2.119GluArg: 2.119 ± 0.556
4.237GluSer: 4.237 ± 2.199
6.356GluThr: 6.356 ± 1.643
4.237GluVal: 4.237 ± 0.543
1.059GluTrp: 1.059 ± 0.55
7.415GluTyr: 7.415 ± 3.847
0.0GluXaa: 0.0 ± 0.0
Phe
0.0PheAla: 0.0 ± 0.0
1.059PheCys: 1.059 ± 0.55
7.415PheAsp: 7.415 ± 2.192
2.119PheGlu: 2.119 ± 0.556
2.119PhePhe: 2.119 ± 1.099
2.119PheGly: 2.119 ± 0.556
1.059PheHis: 1.059 ± 0.55
0.0PheIle: 0.0 ± 0.0
4.237PheLys: 4.237 ± 2.199
3.178PheLeu: 3.178 ± 0.006
0.0PheMet: 0.0 ± 0.0
1.059PheAsn: 1.059 ± 0.55
3.178PhePro: 3.178 ± 1.649
3.178PheGln: 3.178 ± 1.649
2.119PheArg: 2.119 ± 1.099
1.059PheSer: 1.059 ± 1.105
0.0PheThr: 0.0 ± 0.0
4.237PheVal: 4.237 ± 0.543
0.0PheTrp: 0.0 ± 0.0
1.059PheTyr: 1.059 ± 0.55
0.0PheXaa: 0.0 ± 0.0
Gly
7.415GlyAla: 7.415 ± 2.773
0.0GlyCys: 0.0 ± 0.0
4.237GlyAsp: 4.237 ± 1.112
5.297GlyGlu: 5.297 ± 0.562
1.059GlyPhe: 1.059 ± 0.55
5.297GlyGly: 5.297 ± 0.562
2.119GlyHis: 2.119 ± 1.099
2.119GlyIle: 2.119 ± 1.099
6.356GlyLys: 6.356 ± 0.012
4.237GlyLeu: 4.237 ± 0.543
1.059GlyMet: 1.059 ± 1.105
3.178GlyAsn: 3.178 ± 1.649
2.119GlyPro: 2.119 ± 0.556
1.059GlyGln: 1.059 ± 0.55
9.534GlyArg: 9.534 ± 1.673
1.059GlySer: 1.059 ± 0.55
2.119GlyThr: 2.119 ± 0.556
3.178GlyVal: 3.178 ± 1.649
0.0GlyTrp: 0.0 ± 0.0
1.059GlyTyr: 1.059 ± 1.105
0.0GlyXaa: 0.0 ± 0.0
His
1.059HisAla: 1.059 ± 1.105
1.059HisCys: 1.059 ± 0.55
1.059HisAsp: 1.059 ± 0.55
1.059HisGlu: 1.059 ± 0.55
1.059HisPhe: 1.059 ± 0.55
2.119HisGly: 2.119 ± 1.099
1.059HisHis: 1.059 ± 0.55
1.059HisIle: 1.059 ± 0.55
0.0HisLys: 0.0 ± 0.0
1.059HisLeu: 1.059 ± 0.55
0.0HisMet: 0.0 ± 0.0
2.119HisAsn: 2.119 ± 1.099
1.059HisPro: 1.059 ± 0.55
1.059HisGln: 1.059 ± 0.55
0.0HisArg: 0.0 ± 0.0
1.059HisSer: 1.059 ± 0.55
4.237HisThr: 4.237 ± 2.199
2.119HisVal: 2.119 ± 0.556
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
4.237IleAla: 4.237 ± 0.543
0.0IleCys: 0.0 ± 0.0
2.119IleAsp: 2.119 ± 0.556
3.178IleGlu: 3.178 ± 1.649
1.059IlePhe: 1.059 ± 0.55
5.297IleGly: 5.297 ± 2.217
0.0IleHis: 0.0 ± 0.0
3.178IleIle: 3.178 ± 0.006
0.0IleLys: 0.0 ± 0.0
5.297IleLeu: 5.297 ± 2.748
1.059IleMet: 1.059 ± 1.105
4.237IleAsn: 4.237 ± 1.112
6.356IlePro: 6.356 ± 1.667
3.178IleGln: 3.178 ± 1.661
1.059IleArg: 1.059 ± 0.55
1.059IleSer: 1.059 ± 0.55
2.119IleThr: 2.119 ± 1.099
3.178IleVal: 3.178 ± 0.006
0.0IleTrp: 0.0 ± 0.0
1.059IleTyr: 1.059 ± 0.55
0.0IleXaa: 0.0 ± 0.0
Lys
4.237LysAla: 4.237 ± 2.767
0.0LysCys: 0.0 ± 0.0
2.119LysAsp: 2.119 ± 2.211
2.119LysGlu: 2.119 ± 0.556
2.119LysPhe: 2.119 ± 1.099
1.059LysGly: 1.059 ± 0.55
2.119LysHis: 2.119 ± 1.099
2.119LysIle: 2.119 ± 2.211
5.297LysLys: 5.297 ± 2.217
6.356LysLeu: 6.356 ± 1.643
1.059LysMet: 1.059 ± 1.105
1.059LysAsn: 1.059 ± 1.105
5.297LysPro: 5.297 ± 2.748
1.059LysGln: 1.059 ± 0.55
4.237LysArg: 4.237 ± 0.543
1.059LysSer: 1.059 ± 0.55
3.178LysThr: 3.178 ± 0.006
1.059LysVal: 1.059 ± 0.55
0.0LysTrp: 0.0 ± 0.0
3.178LysTyr: 3.178 ± 0.006
0.0LysXaa: 0.0 ± 0.0
Leu
7.415LeuAla: 7.415 ± 4.428
1.059LeuCys: 1.059 ± 0.55
5.297LeuAsp: 5.297 ± 0.562
2.119LeuGlu: 2.119 ± 1.099
4.237LeuPhe: 4.237 ± 2.199
6.356LeuGly: 6.356 ± 3.298
2.119LeuHis: 2.119 ± 1.099
5.297LeuIle: 5.297 ± 0.562
2.119LeuLys: 2.119 ± 0.556
18.008LeuLeu: 18.008 ± 2.724
1.059LeuMet: 1.059 ± 0.55
5.297LeuAsn: 5.297 ± 1.093
4.237LeuPro: 4.237 ± 1.112
3.178LeuGln: 3.178 ± 0.006
8.475LeuArg: 8.475 ± 2.223
4.237LeuSer: 4.237 ± 2.199
5.297LeuThr: 5.297 ± 1.093
6.356LeuVal: 6.356 ± 0.012
0.0LeuTrp: 0.0 ± 0.0
2.119LeuTyr: 2.119 ± 0.556
0.0LeuXaa: 0.0 ± 0.0
Met
7.415MetAla: 7.415 ± 2.773
0.0MetCys: 0.0 ± 0.0
2.119MetAsp: 2.119 ± 0.556
1.059MetGlu: 1.059 ± 1.105
0.0MetPhe: 0.0 ± 0.0
1.059MetGly: 1.059 ± 1.105
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
0.0MetLys: 0.0 ± 0.0
1.059MetLeu: 1.059 ± 0.55
1.059MetMet: 1.059 ± 0.55
1.059MetAsn: 1.059 ± 0.55
1.059MetPro: 1.059 ± 0.55
3.178MetGln: 3.178 ± 1.661
0.0MetArg: 0.0 ± 0.0
3.178MetSer: 3.178 ± 1.649
2.119MetThr: 2.119 ± 0.556
2.119MetVal: 2.119 ± 1.099
1.059MetTrp: 1.059 ± 0.55
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.119AsnAla: 2.119 ± 1.099
0.0AsnCys: 0.0 ± 0.0
1.059AsnAsp: 1.059 ± 1.105
0.0AsnGlu: 0.0 ± 0.0
3.178AsnPhe: 3.178 ± 0.006
1.059AsnGly: 1.059 ± 0.55
0.0AsnHis: 0.0 ± 0.0
4.237AsnIle: 4.237 ± 1.112
1.059AsnLys: 1.059 ± 0.55
1.059AsnLeu: 1.059 ± 0.55
1.059AsnMet: 1.059 ± 0.55
1.059AsnAsn: 1.059 ± 0.55
5.297AsnPro: 5.297 ± 1.093
1.059AsnGln: 1.059 ± 0.55
3.178AsnArg: 3.178 ± 3.316
2.119AsnSer: 2.119 ± 2.211
4.237AsnThr: 4.237 ± 0.543
1.059AsnVal: 1.059 ± 0.55
0.0AsnTrp: 0.0 ± 0.0
5.297AsnTyr: 5.297 ± 0.562
0.0AsnXaa: 0.0 ± 0.0
Pro
6.356ProAla: 6.356 ± 0.012
0.0ProCys: 0.0 ± 0.0
3.178ProAsp: 3.178 ± 1.649
8.475ProGlu: 8.475 ± 2.742
1.059ProPhe: 1.059 ± 0.55
2.119ProGly: 2.119 ± 1.099
1.059ProHis: 1.059 ± 0.55
5.297ProIle: 5.297 ± 1.093
3.178ProLys: 3.178 ± 1.661
4.237ProLeu: 4.237 ± 1.112
1.059ProMet: 1.059 ± 1.105
3.178ProAsn: 3.178 ± 1.661
5.297ProPro: 5.297 ± 0.562
4.237ProGln: 4.237 ± 1.112
2.119ProArg: 2.119 ± 2.211
3.178ProSer: 3.178 ± 0.006
2.119ProThr: 2.119 ± 1.099
4.237ProVal: 4.237 ± 1.112
0.0ProTrp: 0.0 ± 0.0
3.178ProTyr: 3.178 ± 0.006
0.0ProXaa: 0.0 ± 0.0
Gln
2.119GlnAla: 2.119 ± 1.099
0.0GlnCys: 0.0 ± 0.0
0.0GlnAsp: 0.0 ± 0.0
2.119GlnGlu: 2.119 ± 2.211
5.297GlnPhe: 5.297 ± 1.093
5.297GlnGly: 5.297 ± 2.217
1.059GlnHis: 1.059 ± 0.55
0.0GlnIle: 0.0 ± 0.0
1.059GlnLys: 1.059 ± 0.55
1.059GlnLeu: 1.059 ± 1.105
2.119GlnMet: 2.119 ± 1.383
0.0GlnAsn: 0.0 ± 0.0
2.119GlnPro: 2.119 ± 2.211
1.059GlnGln: 1.059 ± 1.105
3.178GlnArg: 3.178 ± 1.661
3.178GlnSer: 3.178 ± 0.006
3.178GlnThr: 3.178 ± 1.661
1.059GlnVal: 1.059 ± 0.55
1.059GlnTrp: 1.059 ± 0.55
1.059GlnTyr: 1.059 ± 0.55
0.0GlnXaa: 0.0 ± 0.0
Arg
6.356ArgAla: 6.356 ± 3.322
0.0ArgCys: 0.0 ± 0.0
7.415ArgAsp: 7.415 ± 1.118
5.297ArgGlu: 5.297 ± 2.217
2.119ArgPhe: 2.119 ± 1.099
3.178ArgGly: 3.178 ± 0.006
1.059ArgHis: 1.059 ± 0.55
6.356ArgIle: 6.356 ± 1.643
3.178ArgLys: 3.178 ± 0.006
8.475ArgLeu: 8.475 ± 2.223
3.178ArgMet: 3.178 ± 0.006
2.119ArgAsn: 2.119 ± 1.099
2.119ArgPro: 2.119 ± 0.556
0.0ArgGln: 0.0 ± 0.0
7.415ArgArg: 7.415 ± 0.537
5.297ArgSer: 5.297 ± 3.872
4.237ArgThr: 4.237 ± 1.112
5.297ArgVal: 5.297 ± 1.093
1.059ArgTrp: 1.059 ± 0.55
1.059ArgTyr: 1.059 ± 0.55
0.0ArgXaa: 0.0 ± 0.0
Ser
4.237SerAla: 4.237 ± 0.543
0.0SerCys: 0.0 ± 0.0
1.059SerAsp: 1.059 ± 0.55
0.0SerGlu: 0.0 ± 0.0
3.178SerPhe: 3.178 ± 1.661
3.178SerGly: 3.178 ± 1.649
6.356SerHis: 6.356 ± 1.643
3.178SerIle: 3.178 ± 1.649
3.178SerLys: 3.178 ± 0.006
6.356SerLeu: 6.356 ± 1.643
0.0SerMet: 0.0 ± 0.0
1.059SerAsn: 1.059 ± 1.105
2.119SerPro: 2.119 ± 2.211
1.059SerGln: 1.059 ± 0.55
3.178SerArg: 3.178 ± 0.006
3.178SerSer: 3.178 ± 1.649
2.119SerThr: 2.119 ± 0.556
4.237SerVal: 4.237 ± 1.112
1.059SerTrp: 1.059 ± 0.55
0.0SerTyr: 0.0 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
6.356ThrAla: 6.356 ± 1.667
1.059ThrCys: 1.059 ± 0.55
4.237ThrAsp: 4.237 ± 0.543
5.297ThrGlu: 5.297 ± 2.748
2.119ThrPhe: 2.119 ± 1.099
4.237ThrGly: 4.237 ± 2.767
0.0ThrHis: 0.0 ± 0.0
1.059ThrIle: 1.059 ± 1.105
2.119ThrLys: 2.119 ± 0.556
4.237ThrLeu: 4.237 ± 2.199
1.059ThrMet: 1.059 ± 0.55
3.178ThrAsn: 3.178 ± 3.316
5.297ThrPro: 5.297 ± 1.093
1.059ThrGln: 1.059 ± 1.105
4.237ThrArg: 4.237 ± 2.199
3.178ThrSer: 3.178 ± 1.649
3.178ThrThr: 3.178 ± 1.649
2.119ThrVal: 2.119 ± 0.556
3.178ThrTrp: 3.178 ± 1.649
3.178ThrTyr: 3.178 ± 0.006
0.0ThrXaa: 0.0 ± 0.0
Val
6.356ValAla: 6.356 ± 3.322
0.0ValCys: 0.0 ± 0.0
4.237ValAsp: 4.237 ± 2.199
9.534ValGlu: 9.534 ± 0.018
0.0ValPhe: 0.0 ± 0.0
6.356ValGly: 6.356 ± 0.012
0.0ValHis: 0.0 ± 0.0
0.0ValIle: 0.0 ± 0.0
5.297ValLys: 5.297 ± 1.093
2.119ValLeu: 2.119 ± 0.556
1.059ValMet: 1.059 ± 0.55
2.119ValAsn: 2.119 ± 0.556
2.119ValPro: 2.119 ± 1.099
0.0ValGln: 0.0 ± 0.0
8.475ValArg: 8.475 ± 1.087
1.059ValSer: 1.059 ± 0.55
3.178ValThr: 3.178 ± 0.006
4.237ValVal: 4.237 ± 0.543
0.0ValTrp: 0.0 ± 0.0
1.059ValTyr: 1.059 ± 0.55
0.0ValXaa: 0.0 ± 0.0
Trp
1.059TrpAla: 1.059 ± 0.55
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
1.059TrpGlu: 1.059 ± 1.105
2.119TrpPhe: 2.119 ± 1.099
1.059TrpGly: 1.059 ± 1.105
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
3.178TrpLeu: 3.178 ± 1.649
1.059TrpMet: 1.059 ± 0.55
0.0TrpAsn: 0.0 ± 0.0
1.059TrpPro: 1.059 ± 0.55
0.0TrpGln: 0.0 ± 0.0
1.059TrpArg: 1.059 ± 0.55
0.0TrpSer: 0.0 ± 0.0
1.059TrpThr: 1.059 ± 0.55
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.059TyrAla: 1.059 ± 0.55
0.0TyrCys: 0.0 ± 0.0
2.119TyrAsp: 2.119 ± 1.099
1.059TyrGlu: 1.059 ± 0.55
3.178TyrPhe: 3.178 ± 0.006
1.059TyrGly: 1.059 ± 0.55
0.0TyrHis: 0.0 ± 0.0
1.059TyrIle: 1.059 ± 0.55
5.297TyrLys: 5.297 ± 2.217
4.237TyrLeu: 4.237 ± 2.199
0.0TyrMet: 0.0 ± 0.0
1.059TyrAsn: 1.059 ± 0.55
4.237TyrPro: 4.237 ± 0.543
0.0TyrGln: 0.0 ± 0.0
3.178TyrArg: 3.178 ± 0.006
3.178TyrSer: 3.178 ± 1.661
5.297TyrThr: 5.297 ± 2.748
1.059TyrVal: 1.059 ± 0.55
1.059TyrTrp: 1.059 ± 0.55
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (945 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski