Amino acid dipepetide frequency for Torque teno virus (isolate Human/Ghana/GH1/1996) (TTV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.221AlaAla: 9.221 ± 8.735
1.025AlaCys: 1.025 ± 4.047
4.098AlaAsp: 4.098 ± 3.343
3.074AlaGlu: 3.074 ± 3.73
0.0AlaPhe: 0.0 ± 0.0
5.123AlaGly: 5.123 ± 5.396
1.025AlaHis: 1.025 ± 2.073
2.049AlaIle: 2.049 ± 1.041
4.098AlaLys: 4.098 ± 2.083
4.098AlaLeu: 4.098 ± 3.343
1.025AlaMet: 1.025 ± 2.073
1.025AlaAsn: 1.025 ± 0.521
8.197AlaPro: 8.197 ± 7.987
2.049AlaGln: 2.049 ± 1.041
4.098AlaArg: 4.098 ± 1.19
3.074AlaSer: 3.074 ± 1.355
3.074AlaThr: 3.074 ± 1.562
4.098AlaVal: 4.098 ± 5.798
0.0AlaTrp: 0.0 ± 0.0
4.098AlaTyr: 4.098 ± 2.083
0.0AlaXaa: 0.0 ± 0.0
Cys
1.025CysAla: 1.025 ± 0.521
0.0CysCys: 0.0 ± 0.0
1.025CysAsp: 1.025 ± 0.521
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
2.049CysGly: 2.049 ± 1.672
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
2.049CysLys: 2.049 ± 3.737
1.025CysLeu: 1.025 ± 0.521
0.0CysMet: 0.0 ± 0.0
1.025CysAsn: 1.025 ± 0.521
1.025CysPro: 1.025 ± 2.073
0.0CysGln: 0.0 ± 0.0
1.025CysArg: 1.025 ± 0.521
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
5.123AspAla: 5.123 ± 7.869
0.0AspCys: 0.0 ± 0.0
1.025AspAsp: 1.025 ± 0.521
2.049AspGlu: 2.049 ± 1.041
2.049AspPhe: 2.049 ± 1.041
0.0AspGly: 0.0 ± 0.0
1.025AspHis: 1.025 ± 2.073
4.098AspIle: 4.098 ± 2.083
4.098AspLys: 4.098 ± 2.083
6.148AspLeu: 6.148 ± 1.486
2.049AspMet: 2.049 ± 1.041
0.0AspAsn: 0.0 ± 0.0
4.098AspPro: 4.098 ± 3.282
2.049AspGln: 2.049 ± 1.672
3.074AspArg: 3.074 ± 4.228
3.074AspSer: 3.074 ± 1.562
4.098AspThr: 4.098 ± 3.282
2.049AspVal: 2.049 ± 1.041
2.049AspTrp: 2.049 ± 3.737
1.025AspTyr: 1.025 ± 0.521
0.0AspXaa: 0.0 ± 0.0
Glu
3.074GluAla: 3.074 ± 4.228
0.0GluCys: 0.0 ± 0.0
4.098GluAsp: 4.098 ± 1.19
5.123GluGlu: 5.123 ± 3.337
1.025GluPhe: 1.025 ± 0.521
4.098GluGly: 4.098 ± 3.343
0.0GluHis: 0.0 ± 0.0
0.0GluIle: 0.0 ± 0.0
2.049GluLys: 2.049 ± 1.041
3.074GluLeu: 3.074 ± 1.562
0.0GluMet: 0.0 ± 0.0
2.049GluAsn: 2.049 ± 3.737
0.0GluPro: 0.0 ± 0.0
1.025GluGln: 1.025 ± 0.521
0.0GluArg: 0.0 ± 0.0
2.049GluSer: 2.049 ± 1.041
4.098GluThr: 4.098 ± 1.19
2.049GluVal: 2.049 ± 1.041
0.0GluTrp: 0.0 ± 0.0
2.049GluTyr: 2.049 ± 3.737
0.0GluXaa: 0.0 ± 0.0
Phe
1.025PheAla: 1.025 ± 0.521
1.025PheCys: 1.025 ± 0.521
0.0PheAsp: 0.0 ± 0.0
1.025PheGlu: 1.025 ± 0.521
1.025PhePhe: 1.025 ± 0.521
5.123PheGly: 5.123 ± 3.16
1.025PheHis: 1.025 ± 0.521
3.074PheIle: 3.074 ± 1.355
3.074PheLys: 3.074 ± 3.478
1.025PheLeu: 1.025 ± 0.521
1.025PheMet: 1.025 ± 0.521
2.049PheAsn: 2.049 ± 1.672
1.025PhePro: 1.025 ± 0.521
3.074PheGln: 3.074 ± 3.478
1.025PheArg: 1.025 ± 0.521
1.025PheSer: 1.025 ± 0.521
6.148PheThr: 6.148 ± 3.124
2.049PheVal: 2.049 ± 1.041
0.0PheTrp: 0.0 ± 0.0
1.025PheTyr: 1.025 ± 0.521
0.0PheXaa: 0.0 ± 0.0
Gly
5.123GlyAla: 5.123 ± 5.396
1.025GlyCys: 1.025 ± 2.073
5.123GlyAsp: 5.123 ± 4.997
5.123GlyGlu: 5.123 ± 1.242
2.049GlyPhe: 2.049 ± 3.737
12.295GlyGly: 12.295 ± 7.658
0.0GlyHis: 0.0 ± 0.0
0.0GlyIle: 0.0 ± 0.0
2.049GlyLys: 2.049 ± 1.041
2.049GlyLeu: 2.049 ± 1.041
3.074GlyMet: 3.074 ± 1.562
7.172GlyAsn: 7.172 ± 3.17
5.123GlyPro: 5.123 ± 2.998
0.0GlyGln: 0.0 ± 0.0
6.148GlyArg: 6.148 ± 1.486
4.098GlySer: 4.098 ± 2.083
3.074GlyThr: 3.074 ± 1.562
2.049GlyVal: 2.049 ± 1.041
1.025GlyTrp: 1.025 ± 0.521
3.074GlyTyr: 3.074 ± 1.562
0.0GlyXaa: 0.0 ± 0.0
His
2.049HisAla: 2.049 ± 1.672
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
0.0HisGlu: 0.0 ± 0.0
1.025HisPhe: 1.025 ± 2.073
2.049HisGly: 2.049 ± 1.672
0.0HisHis: 0.0 ± 0.0
0.0HisIle: 0.0 ± 0.0
0.0HisLys: 0.0 ± 0.0
1.025HisLeu: 1.025 ± 2.073
1.025HisMet: 1.025 ± 0.521
0.0HisAsn: 0.0 ± 0.0
2.049HisPro: 2.049 ± 1.041
1.025HisGln: 1.025 ± 0.521
0.0HisArg: 0.0 ± 0.0
3.074HisSer: 3.074 ± 1.562
2.049HisThr: 2.049 ± 1.041
0.0HisVal: 0.0 ± 0.0
0.0HisTrp: 0.0 ± 0.0
2.049HisTyr: 2.049 ± 1.672
0.0HisXaa: 0.0 ± 0.0
Ile
2.049IleAla: 2.049 ± 1.041
1.025IleCys: 1.025 ± 0.521
1.025IleAsp: 1.025 ± 0.521
0.0IleGlu: 0.0 ± 0.0
1.025IlePhe: 1.025 ± 0.521
1.025IleGly: 1.025 ± 2.073
2.049IleHis: 2.049 ± 1.041
3.074IleIle: 3.074 ± 1.562
5.123IleLys: 5.123 ± 2.603
3.074IleLeu: 3.074 ± 1.562
0.0IleMet: 0.0 ± 0.0
2.049IleAsn: 2.049 ± 1.041
3.074IlePro: 3.074 ± 1.562
3.074IleGln: 3.074 ± 1.562
4.098IleArg: 4.098 ± 2.083
4.098IleSer: 4.098 ± 2.083
1.025IleThr: 1.025 ± 0.521
3.074IleVal: 3.074 ± 1.355
2.049IleTrp: 2.049 ± 1.041
0.0IleTyr: 0.0 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
6.148LysAla: 6.148 ± 1.486
1.025LysCys: 1.025 ± 0.521
1.025LysAsp: 1.025 ± 0.521
4.098LysGlu: 4.098 ± 7.474
2.049LysPhe: 2.049 ± 1.041
2.049LysGly: 2.049 ± 1.041
1.025LysHis: 1.025 ± 0.521
6.148LysIle: 6.148 ± 3.124
6.148LysLys: 6.148 ± 1.486
4.098LysLeu: 4.098 ± 3.773
2.049LysMet: 2.049 ± 1.041
1.025LysAsn: 1.025 ± 0.521
2.049LysPro: 2.049 ± 3.737
1.025LysGln: 1.025 ± 0.521
6.148LysArg: 6.148 ± 1.486
2.049LysSer: 2.049 ± 1.041
0.0LysThr: 0.0 ± 0.0
2.049LysVal: 2.049 ± 1.041
3.074LysTrp: 3.074 ± 1.562
4.098LysTyr: 4.098 ± 3.282
0.0LysXaa: 0.0 ± 0.0
Leu
5.123LeuAla: 5.123 ± 2.998
2.049LeuCys: 2.049 ± 1.041
4.098LeuAsp: 4.098 ± 1.19
1.025LeuGlu: 1.025 ± 0.521
4.098LeuPhe: 4.098 ± 2.083
6.148LeuGly: 6.148 ± 3.122
1.025LeuHis: 1.025 ± 0.521
4.098LeuIle: 4.098 ± 1.19
3.074LeuLys: 3.074 ± 1.562
13.32LeuLeu: 13.32 ± 3.547
0.0LeuMet: 0.0 ± 0.0
3.074LeuAsn: 3.074 ± 1.355
6.148LeuPro: 6.148 ± 5.015
3.074LeuGln: 3.074 ± 1.562
6.148LeuArg: 6.148 ± 4.483
4.098LeuSer: 4.098 ± 3.343
5.123LeuThr: 5.123 ± 2.603
1.025LeuVal: 1.025 ± 0.521
2.049LeuTrp: 2.049 ± 1.041
4.098LeuTyr: 4.098 ± 2.083
0.0LeuXaa: 0.0 ± 0.0
Met
2.049MetAla: 2.049 ± 1.672
1.025MetCys: 1.025 ± 2.073
0.0MetAsp: 0.0 ± 0.0
0.0MetGlu: 0.0 ± 0.0
3.074MetPhe: 3.074 ± 4.228
0.0MetGly: 0.0 ± 0.0
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
1.025MetLys: 1.025 ± 0.521
2.049MetLeu: 2.049 ± 1.041
0.0MetMet: 0.0 ± 0.0
3.074MetAsn: 3.074 ± 1.562
2.049MetPro: 2.049 ± 1.041
2.049MetGln: 2.049 ± 1.041
1.025MetArg: 1.025 ± 0.521
0.0MetSer: 0.0 ± 0.0
1.025MetThr: 1.025 ± 0.521
1.025MetVal: 1.025 ± 0.521
1.025MetTrp: 1.025 ± 2.073
1.025MetTyr: 1.025 ± 0.521
0.0MetXaa: 0.0 ± 0.0
Asn
2.049AsnAla: 2.049 ± 1.041
0.0AsnCys: 0.0 ± 0.0
3.074AsnAsp: 3.074 ± 4.228
1.025AsnGlu: 1.025 ± 0.521
3.074AsnPhe: 3.074 ± 3.478
1.025AsnGly: 1.025 ± 0.521
1.025AsnHis: 1.025 ± 2.073
3.074AsnIle: 3.074 ± 1.562
0.0AsnLys: 0.0 ± 0.0
4.098AsnLeu: 4.098 ± 3.773
2.049AsnMet: 2.049 ± 1.041
0.0AsnAsn: 0.0 ± 0.0
6.148AsnPro: 6.148 ± 2.929
4.098AsnGln: 4.098 ± 3.282
1.025AsnArg: 1.025 ± 0.521
1.025AsnSer: 1.025 ± 0.521
6.148AsnThr: 6.148 ± 3.124
2.049AsnVal: 2.049 ± 1.041
1.025AsnTrp: 1.025 ± 2.073
6.148AsnTyr: 6.148 ± 3.124
0.0AsnXaa: 0.0 ± 0.0
Pro
3.074ProAla: 3.074 ± 3.73
1.025ProCys: 1.025 ± 0.521
3.074ProAsp: 3.074 ± 1.355
3.074ProGlu: 3.074 ± 3.73
8.197ProPhe: 8.197 ± 4.165
9.221ProGly: 9.221 ± 2.376
0.0ProHis: 0.0 ± 0.0
1.025ProIle: 1.025 ± 0.521
5.123ProLys: 5.123 ± 7.201
3.074ProLeu: 3.074 ± 1.355
2.049ProMet: 2.049 ± 1.812
1.025ProAsn: 1.025 ± 0.521
8.197ProPro: 8.197 ± 7.987
4.098ProGln: 4.098 ± 3.343
5.123ProArg: 5.123 ± 3.337
3.074ProSer: 3.074 ± 3.478
4.098ProThr: 4.098 ± 2.083
3.074ProVal: 3.074 ± 1.562
4.098ProTrp: 4.098 ± 3.773
3.074ProTyr: 3.074 ± 7.772
0.0ProXaa: 0.0 ± 0.0
Gln
0.0GlnAla: 0.0 ± 0.0
0.0GlnCys: 0.0 ± 0.0
1.025GlnAsp: 1.025 ± 0.521
3.074GlnGlu: 3.074 ± 1.562
0.0GlnPhe: 0.0 ± 0.0
0.0GlnGly: 0.0 ± 0.0
0.0GlnHis: 0.0 ± 0.0
2.049GlnIle: 2.049 ± 1.041
3.074GlnLys: 3.074 ± 1.355
4.098GlnLeu: 4.098 ± 2.083
0.0GlnMet: 0.0 ± 2.797
5.123GlnAsn: 5.123 ± 1.242
2.049GlnPro: 2.049 ± 1.041
11.27GlnGln: 11.27 ± 3.698
4.098GlnArg: 4.098 ± 1.19
4.098GlnSer: 4.098 ± 2.083
3.074GlnThr: 3.074 ± 1.562
3.074GlnVal: 3.074 ± 1.562
2.049GlnTrp: 2.049 ± 1.672
3.074GlnTyr: 3.074 ± 4.228
0.0GlnXaa: 0.0 ± 0.0
Arg
5.123ArgAla: 5.123 ± 5.396
1.025ArgCys: 1.025 ± 0.521
2.049ArgAsp: 2.049 ± 3.737
0.0ArgGlu: 0.0 ± 0.0
2.049ArgPhe: 2.049 ± 1.041
7.172ArgGly: 7.172 ± 3.17
2.049ArgHis: 2.049 ± 1.672
2.049ArgIle: 2.049 ± 1.041
5.123ArgLys: 5.123 ± 2.998
3.074ArgLeu: 3.074 ± 1.355
2.049ArgMet: 2.049 ± 1.041
5.123ArgAsn: 5.123 ± 2.998
5.123ArgPro: 5.123 ± 3.16
3.074ArgGln: 3.074 ± 1.562
31.762ArgArg: 31.762 ± 13.968
3.074ArgSer: 3.074 ± 1.562
2.049ArgThr: 2.049 ± 1.672
4.098ArgVal: 4.098 ± 3.282
6.148ArgTrp: 6.148 ± 3.124
2.049ArgTyr: 2.049 ± 1.041
0.0ArgXaa: 0.0 ± 0.0
Ser
1.025SerAla: 1.025 ± 0.521
0.0SerCys: 0.0 ± 0.0
2.049SerAsp: 2.049 ± 1.041
1.025SerGlu: 1.025 ± 0.521
0.0SerPhe: 0.0 ± 0.0
4.098SerGly: 4.098 ± 2.083
1.025SerHis: 1.025 ± 2.073
4.098SerIle: 4.098 ± 2.083
3.074SerLys: 3.074 ± 1.562
8.197SerLeu: 8.197 ± 2.273
1.025SerMet: 1.025 ± 0.521
4.098SerAsn: 4.098 ± 3.282
4.098SerPro: 4.098 ± 2.083
4.098SerGln: 4.098 ± 2.083
2.049SerArg: 2.049 ± 1.041
8.197SerSer: 8.197 ± 3.461
1.025SerThr: 1.025 ± 0.521
3.074SerVal: 3.074 ± 3.73
1.025SerTrp: 1.025 ± 0.521
1.025SerTyr: 1.025 ± 0.521
0.0SerXaa: 0.0 ± 0.0
Thr
4.098ThrAla: 4.098 ± 2.083
0.0ThrCys: 0.0 ± 0.0
9.221ThrAsp: 9.221 ± 4.686
3.074ThrGlu: 3.074 ± 1.562
3.074ThrPhe: 3.074 ± 1.562
1.025ThrGly: 1.025 ± 0.521
2.049ThrHis: 2.049 ± 1.041
1.025ThrIle: 1.025 ± 0.521
3.074ThrLys: 3.074 ± 1.562
4.098ThrLeu: 4.098 ± 2.083
2.049ThrMet: 2.049 ± 1.304
2.049ThrAsn: 2.049 ± 1.041
3.074ThrPro: 3.074 ± 4.228
3.074ThrGln: 3.074 ± 1.355
1.025ThrArg: 1.025 ± 0.521
1.025ThrSer: 1.025 ± 0.521
19.467ThrThr: 19.467 ± 7.76
5.123ThrVal: 5.123 ± 2.603
2.049ThrTrp: 2.049 ± 1.041
2.049ThrTyr: 2.049 ± 1.041
0.0ThrXaa: 0.0 ± 0.0
Val
3.074ValAla: 3.074 ± 3.73
0.0ValCys: 0.0 ± 0.0
3.074ValAsp: 3.074 ± 1.562
1.025ValGlu: 1.025 ± 0.521
0.0ValPhe: 0.0 ± 0.0
2.049ValGly: 2.049 ± 1.041
1.025ValHis: 1.025 ± 0.521
0.0ValIle: 0.0 ± 0.0
2.049ValLys: 2.049 ± 1.041
7.172ValLeu: 7.172 ± 2.496
1.025ValMet: 1.025 ± 2.073
3.074ValAsn: 3.074 ± 3.478
4.098ValPro: 4.098 ± 2.083
2.049ValGln: 2.049 ± 1.041
4.098ValArg: 4.098 ± 1.19
3.074ValSer: 3.074 ± 1.562
1.025ValThr: 1.025 ± 0.521
3.074ValVal: 3.074 ± 1.562
0.0ValTrp: 0.0 ± 0.0
3.074ValTyr: 3.074 ± 1.562
0.0ValXaa: 0.0 ± 0.0
Trp
2.049TrpAla: 2.049 ± 3.737
0.0TrpCys: 0.0 ± 0.0
2.049TrpAsp: 2.049 ± 3.737
1.025TrpGlu: 1.025 ± 0.521
0.0TrpPhe: 0.0 ± 0.0
2.049TrpGly: 2.049 ± 1.041
0.0TrpHis: 0.0 ± 0.0
3.074TrpIle: 3.074 ± 1.562
2.049TrpLys: 2.049 ± 3.737
2.049TrpLeu: 2.049 ± 1.041
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
1.025TrpPro: 1.025 ± 2.073
2.049TrpGln: 2.049 ± 1.672
6.148TrpArg: 6.148 ± 3.124
0.0TrpSer: 0.0 ± 0.0
2.049TrpThr: 2.049 ± 1.672
0.0TrpVal: 0.0 ± 0.0
1.025TrpTrp: 1.025 ± 0.521
4.098TrpTyr: 4.098 ± 1.19
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.049TyrAla: 2.049 ± 1.041
0.0TyrCys: 0.0 ± 0.0
2.049TyrAsp: 2.049 ± 1.041
1.025TyrGlu: 1.025 ± 4.047
2.049TyrPhe: 2.049 ± 1.041
3.074TyrGly: 3.074 ± 1.562
3.074TyrHis: 3.074 ± 1.562
3.074TyrIle: 3.074 ± 1.562
1.025TyrLys: 1.025 ± 0.521
2.049TyrLeu: 2.049 ± 1.672
0.0TyrMet: 0.0 ± 0.0
5.123TyrAsn: 5.123 ± 3.16
6.148TyrPro: 6.148 ± 3.122
0.0TyrGln: 0.0 ± 0.0
6.148TyrArg: 6.148 ± 2.929
4.098TyrSer: 4.098 ± 1.19
3.074TyrThr: 3.074 ± 1.562
1.025TyrVal: 1.025 ± 0.521
2.049TyrTrp: 2.049 ± 3.737
1.025TyrTyr: 1.025 ± 4.047
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (977 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski