Amino acid dipepetide frequency for Torque teno virus 26

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.257AlaAla: 6.257 ± 11.365
0.0AlaCys: 0.0 ± 0.0
2.086AlaAsp: 2.086 ± 1.037
5.214AlaGlu: 5.214 ± 5.552
2.086AlaPhe: 2.086 ± 1.705
2.086AlaGly: 2.086 ± 1.037
1.043AlaHis: 1.043 ± 0.519
1.043AlaIle: 1.043 ± 0.519
2.086AlaLys: 2.086 ± 1.037
6.257AlaLeu: 6.257 ± 1.252
0.0AlaMet: 0.0 ± 0.0
0.0AlaAsn: 0.0 ± 0.0
3.128AlaPro: 3.128 ± 3.85
1.043AlaGln: 1.043 ± 2.154
4.171AlaArg: 4.171 ± 2.22
2.086AlaSer: 2.086 ± 1.037
4.171AlaThr: 4.171 ± 1.583
3.128AlaVal: 3.128 ± 1.308
3.128AlaTrp: 3.128 ± 1.308
1.043AlaTyr: 1.043 ± 0.519
0.0AlaXaa: 0.0 ± 0.0
Cys
1.043CysAla: 1.043 ± 0.519
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
1.043CysPhe: 1.043 ± 1.99
4.171CysGly: 4.171 ± 3.41
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
1.043CysLys: 1.043 ± 0.519
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
1.043CysAsn: 1.043 ± 0.519
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
2.086CysSer: 2.086 ± 1.713
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.128AspAla: 3.128 ± 3.85
0.0AspCys: 0.0 ± 0.0
3.128AspAsp: 3.128 ± 1.556
5.214AspGlu: 5.214 ± 2.593
2.086AspPhe: 2.086 ± 1.037
3.128AspGly: 3.128 ± 1.308
1.043AspHis: 1.043 ± 0.519
1.043AspIle: 1.043 ± 0.519
0.0AspLys: 0.0 ± 0.0
6.257AspLeu: 6.257 ± 3.112
0.0AspMet: 0.0 ± 0.0
0.0AspAsn: 0.0 ± 0.0
8.342AspPro: 8.342 ± 6.82
0.0AspGln: 0.0 ± 0.0
5.214AspArg: 5.214 ± 1.759
4.171AspSer: 4.171 ± 3.427
2.086AspThr: 2.086 ± 1.037
1.043AspVal: 1.043 ± 2.154
2.086AspTrp: 2.086 ± 1.705
0.0AspTyr: 0.0 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
5.214GluAla: 5.214 ± 1.726
0.0GluCys: 0.0 ± 0.0
3.128GluAsp: 3.128 ± 3.85
5.214GluGlu: 5.214 ± 5.552
1.043GluPhe: 1.043 ± 0.519
4.171GluGly: 4.171 ± 1.027
0.0GluHis: 0.0 ± 0.0
0.0GluIle: 0.0 ± 0.0
0.0GluLys: 0.0 ± 0.0
6.257GluLeu: 6.257 ± 3.568
1.043GluMet: 1.043 ± 0.519
5.214GluAsn: 5.214 ± 1.759
4.171GluPro: 4.171 ± 1.027
1.043GluGln: 1.043 ± 0.519
0.0GluArg: 0.0 ± 0.0
3.128GluSer: 3.128 ± 1.566
4.171GluThr: 4.171 ± 2.074
1.043GluVal: 1.043 ± 0.519
1.043GluTrp: 1.043 ± 0.519
1.043GluTyr: 1.043 ± 0.519
0.0GluXaa: 0.0 ± 0.0
Phe
1.043PheAla: 1.043 ± 0.519
3.128PheCys: 3.128 ± 1.308
1.043PheAsp: 1.043 ± 0.519
1.043PheGlu: 1.043 ± 0.519
0.0PhePhe: 0.0 ± 0.0
3.128PheGly: 3.128 ± 1.308
2.086PheHis: 2.086 ± 1.705
1.043PheIle: 1.043 ± 0.519
2.086PheLys: 2.086 ± 1.037
1.043PheLeu: 1.043 ± 2.154
1.043PheMet: 1.043 ± 1.468
2.086PheAsn: 2.086 ± 1.037
3.128PhePro: 3.128 ± 1.556
3.128PheGln: 3.128 ± 1.566
3.128PheArg: 3.128 ± 1.556
2.086PheSer: 2.086 ± 1.037
0.0PheThr: 0.0 ± 0.0
0.0PheVal: 0.0 ± 0.0
0.0PheTrp: 0.0 ± 0.0
2.086PheTyr: 2.086 ± 1.037
0.0PheXaa: 0.0 ± 0.0
Gly
3.128GlyAla: 3.128 ± 1.308
1.043GlyCys: 1.043 ± 2.154
4.171GlyAsp: 4.171 ± 6.002
1.043GlyGlu: 1.043 ± 0.519
1.043GlyPhe: 1.043 ± 0.519
13.556GlyGly: 13.556 ± 14.953
2.086GlyHis: 2.086 ± 1.037
2.086GlyIle: 2.086 ± 1.037
1.043GlyLys: 1.043 ± 0.519
3.128GlyLeu: 3.128 ± 3.85
2.086GlyMet: 2.086 ± 0.919
2.086GlyAsn: 2.086 ± 1.037
9.385GlyPro: 9.385 ± 11.551
3.128GlyGln: 3.128 ± 1.308
6.257GlyArg: 6.257 ± 2.055
4.171GlySer: 4.171 ± 1.583
5.214GlyThr: 5.214 ± 0.967
4.171GlyVal: 4.171 ± 1.027
2.086GlyTrp: 2.086 ± 1.037
3.128GlyTyr: 3.128 ± 1.556
0.0GlyXaa: 0.0 ± 0.0
His
1.043HisAla: 1.043 ± 2.154
0.0HisCys: 0.0 ± 0.0
1.043HisAsp: 1.043 ± 0.519
1.043HisGlu: 1.043 ± 0.519
1.043HisPhe: 1.043 ± 0.519
1.043HisGly: 1.043 ± 0.519
2.086HisHis: 2.086 ± 1.705
1.043HisIle: 1.043 ± 0.519
0.0HisLys: 0.0 ± 0.0
4.171HisLeu: 4.171 ± 1.027
0.0HisMet: 0.0 ± 0.0
1.043HisAsn: 1.043 ± 2.154
2.086HisPro: 2.086 ± 1.037
0.0HisGln: 0.0 ± 0.0
2.086HisArg: 2.086 ± 1.037
0.0HisSer: 0.0 ± 0.0
0.0HisThr: 0.0 ± 0.0
0.0HisVal: 0.0 ± 0.0
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
2.086IleAla: 2.086 ± 1.037
1.043IleCys: 1.043 ± 0.519
3.128IleAsp: 3.128 ± 1.556
0.0IleGlu: 0.0 ± 0.0
1.043IlePhe: 1.043 ± 1.99
0.0IleGly: 0.0 ± 0.0
0.0IleHis: 0.0 ± 0.0
0.0IleIle: 0.0 ± 0.0
4.171IleLys: 4.171 ± 2.074
2.086IleLeu: 2.086 ± 1.037
1.043IleMet: 1.043 ± 0.519
1.043IleAsn: 1.043 ± 0.519
2.086IlePro: 2.086 ± 1.037
1.043IleGln: 1.043 ± 0.519
0.0IleArg: 0.0 ± 0.0
3.128IleSer: 3.128 ± 1.556
3.128IleThr: 3.128 ± 1.556
2.086IleVal: 2.086 ± 1.037
0.0IleTrp: 0.0 ± 0.0
0.0IleTyr: 0.0 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
4.171LysAla: 4.171 ± 2.074
1.043LysCys: 1.043 ± 1.99
3.128LysAsp: 3.128 ± 1.556
1.043LysGlu: 1.043 ± 1.99
4.171LysPhe: 4.171 ± 2.074
0.0LysGly: 0.0 ± 0.0
0.0LysHis: 0.0 ± 0.0
3.128LysIle: 3.128 ± 1.566
6.257LysLys: 6.257 ± 3.112
5.214LysLeu: 5.214 ± 2.593
1.043LysMet: 1.043 ± 0.519
0.0LysAsn: 0.0 ± 0.0
5.214LysPro: 5.214 ± 3.241
2.086LysGln: 2.086 ± 1.037
6.257LysArg: 6.257 ± 3.131
2.086LysSer: 2.086 ± 1.713
3.128LysThr: 3.128 ± 1.556
1.043LysVal: 1.043 ± 0.519
1.043LysTrp: 1.043 ± 0.519
1.043LysTyr: 1.043 ± 0.519
0.0LysXaa: 0.0 ± 0.0
Leu
4.171LeuAla: 4.171 ± 4.601
0.0LeuCys: 0.0 ± 0.0
5.214LeuAsp: 5.214 ± 1.726
3.128LeuGlu: 3.128 ± 1.556
2.086LeuPhe: 2.086 ± 1.037
5.214LeuGly: 5.214 ± 2.593
0.0LeuHis: 0.0 ± 0.0
4.171LeuIle: 4.171 ± 2.074
5.214LeuLys: 5.214 ± 2.593
10.428LeuLeu: 10.428 ± 1.934
3.128LeuMet: 3.128 ± 1.442
1.043LeuAsn: 1.043 ± 0.519
8.342LeuPro: 8.342 ± 6.82
11.47LeuGln: 11.47 ± 4.235
7.299LeuArg: 7.299 ± 3.63
7.299LeuSer: 7.299 ± 0.833
4.171LeuThr: 4.171 ± 2.074
3.128LeuVal: 3.128 ± 1.556
4.171LeuTrp: 4.171 ± 2.074
3.128LeuTyr: 3.128 ± 1.566
0.0LeuXaa: 0.0 ± 0.0
Met
2.086MetAla: 2.086 ± 1.705
0.0MetCys: 0.0 ± 0.0
0.0MetAsp: 0.0 ± 0.0
0.0MetGlu: 0.0 ± 0.0
2.086MetPhe: 2.086 ± 1.037
1.043MetGly: 1.043 ± 0.519
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
2.086MetLys: 2.086 ± 1.713
4.171MetLeu: 4.171 ± 2.074
1.043MetMet: 1.043 ± 1.99
2.086MetAsn: 2.086 ± 1.037
1.043MetPro: 1.043 ± 0.519
0.0MetGln: 0.0 ± 0.0
2.086MetArg: 2.086 ± 1.037
1.043MetSer: 1.043 ± 0.519
3.128MetThr: 3.128 ± 1.556
1.043MetVal: 1.043 ± 0.519
0.0MetTrp: 0.0 ± 0.0
1.043MetTyr: 1.043 ± 0.519
0.0MetXaa: 0.0 ± 0.0
Asn
0.0AsnAla: 0.0 ± 0.0
1.043AsnCys: 1.043 ± 0.519
3.128AsnAsp: 3.128 ± 1.308
1.043AsnGlu: 1.043 ± 1.99
3.128AsnPhe: 3.128 ± 1.308
0.0AsnGly: 0.0 ± 0.0
1.043AsnHis: 1.043 ± 2.154
2.086AsnIle: 2.086 ± 1.037
3.128AsnLys: 3.128 ± 1.556
4.171AsnLeu: 4.171 ± 2.074
1.043AsnMet: 1.043 ± 0.519
3.128AsnAsn: 3.128 ± 1.556
3.128AsnPro: 3.128 ± 1.556
3.128AsnGln: 3.128 ± 1.556
4.171AsnArg: 4.171 ± 1.583
3.128AsnSer: 3.128 ± 1.566
2.086AsnThr: 2.086 ± 1.037
3.128AsnVal: 3.128 ± 1.556
1.043AsnTrp: 1.043 ± 0.519
1.043AsnTyr: 1.043 ± 0.519
0.0AsnXaa: 0.0 ± 0.0
Pro
5.214ProAla: 5.214 ± 2.995
2.086ProCys: 2.086 ± 1.713
4.171ProAsp: 4.171 ± 1.027
5.214ProGlu: 5.214 ± 0.967
1.043ProPhe: 1.043 ± 0.519
10.428ProGly: 10.428 ± 8.525
0.0ProHis: 0.0 ± 0.0
1.043ProIle: 1.043 ± 0.519
3.128ProLys: 3.128 ± 1.556
6.257ProLeu: 6.257 ± 1.252
3.128ProMet: 3.128 ± 1.556
1.043ProAsn: 1.043 ± 2.154
18.77ProPro: 18.77 ± 25.705
5.214ProGln: 5.214 ± 2.995
9.385ProArg: 9.385 ± 6.211
5.214ProSer: 5.214 ± 1.726
5.214ProThr: 5.214 ± 0.967
2.086ProVal: 2.086 ± 1.705
2.086ProTrp: 2.086 ± 1.037
2.086ProTyr: 2.086 ± 1.037
0.0ProXaa: 0.0 ± 0.0
Gln
3.128GlnAla: 3.128 ± 1.556
0.0GlnCys: 0.0 ± 0.0
0.0GlnAsp: 0.0 ± 0.0
2.086GlnGlu: 2.086 ± 1.037
1.043GlnPhe: 1.043 ± 0.519
2.086GlnGly: 2.086 ± 1.705
0.0GlnHis: 0.0 ± 0.0
1.043GlnIle: 1.043 ± 0.519
5.214GlnLys: 5.214 ± 3.241
4.171GlnLeu: 4.171 ± 2.074
1.043GlnMet: 1.043 ± 0.519
2.086GlnAsn: 2.086 ± 1.713
6.257GlnPro: 6.257 ± 1.163
6.257GlnGln: 6.257 ± 1.252
5.214GlnArg: 5.214 ± 0.967
3.128GlnSer: 3.128 ± 1.556
3.128GlnThr: 3.128 ± 1.566
0.0GlnVal: 0.0 ± 0.0
3.128GlnTrp: 3.128 ± 1.308
2.086GlnTyr: 2.086 ± 1.037
0.0GlnXaa: 0.0 ± 0.0
Arg
0.0ArgAla: 0.0 ± 0.0
0.0ArgCys: 0.0 ± 0.0
4.171ArgAsp: 4.171 ± 2.074
5.214ArgGlu: 5.214 ± 1.726
2.086ArgPhe: 2.086 ± 1.705
11.47ArgGly: 11.47 ± 3.651
2.086ArgHis: 2.086 ± 1.037
3.128ArgIle: 3.128 ± 1.556
5.214ArgLys: 5.214 ± 5.381
5.214ArgLeu: 5.214 ± 1.759
1.043ArgMet: 1.043 ± 0.519
5.214ArgAsn: 5.214 ± 2.593
2.086ArgPro: 2.086 ± 3.231
6.257ArgGln: 6.257 ± 3.131
35.454ArgArg: 35.454 ± 14.442
4.171ArgSer: 4.171 ± 1.583
3.128ArgThr: 3.128 ± 1.566
5.214ArgVal: 5.214 ± 1.759
6.257ArgTrp: 6.257 ± 3.112
4.171ArgTyr: 4.171 ± 2.074
0.0ArgXaa: 0.0 ± 0.0
Ser
2.086SerAla: 2.086 ± 1.713
0.0SerCys: 0.0 ± 0.0
2.086SerAsp: 2.086 ± 3.979
2.086SerGlu: 2.086 ± 1.713
1.043SerPhe: 1.043 ± 2.154
4.171SerGly: 4.171 ± 1.583
3.128SerHis: 3.128 ± 1.308
2.086SerIle: 2.086 ± 1.037
2.086SerLys: 2.086 ± 3.979
7.299SerLeu: 7.299 ± 3.63
3.128SerMet: 3.128 ± 1.556
5.214SerAsn: 5.214 ± 1.759
5.214SerPro: 5.214 ± 2.593
1.043SerGln: 1.043 ± 0.519
6.257SerArg: 6.257 ± 5.14
12.513SerSer: 12.513 ± 19.264
0.0SerThr: 0.0 ± 0.0
1.043SerVal: 1.043 ± 0.519
4.171SerTrp: 4.171 ± 2.074
1.043SerTyr: 1.043 ± 0.519
0.0SerXaa: 0.0 ± 0.0
Thr
1.043ThrAla: 1.043 ± 0.519
0.0ThrCys: 0.0 ± 0.0
3.128ThrAsp: 3.128 ± 1.556
4.171ThrGlu: 4.171 ± 2.074
0.0ThrPhe: 0.0 ± 0.0
2.086ThrGly: 2.086 ± 1.705
1.043ThrHis: 1.043 ± 0.519
1.043ThrIle: 1.043 ± 0.519
4.171ThrLys: 4.171 ± 2.074
8.342ThrLeu: 8.342 ± 2.845
2.086ThrMet: 2.086 ± 1.037
5.214ThrAsn: 5.214 ± 2.593
4.171ThrPro: 4.171 ± 1.583
1.043ThrGln: 1.043 ± 0.519
1.043ThrArg: 1.043 ± 1.99
1.043ThrSer: 1.043 ± 0.519
6.257ThrThr: 6.257 ± 2.055
3.128ThrVal: 3.128 ± 1.556
2.086ThrTrp: 2.086 ± 1.705
3.128ThrTyr: 3.128 ± 1.556
0.0ThrXaa: 0.0 ± 0.0
Val
2.086ValAla: 2.086 ± 1.705
1.043ValCys: 1.043 ± 0.519
1.043ValAsp: 1.043 ± 0.519
1.043ValGlu: 1.043 ± 2.154
2.086ValPhe: 2.086 ± 1.037
5.214ValGly: 5.214 ± 2.995
1.043ValHis: 1.043 ± 0.519
0.0ValIle: 0.0 ± 0.0
1.043ValLys: 1.043 ± 0.519
4.171ValLeu: 4.171 ± 2.074
0.0ValMet: 0.0 ± 0.0
2.086ValAsn: 2.086 ± 1.037
2.086ValPro: 2.086 ± 1.037
1.043ValGln: 1.043 ± 0.519
4.171ValArg: 4.171 ± 2.074
2.086ValSer: 2.086 ± 1.713
3.128ValThr: 3.128 ± 1.556
2.086ValVal: 2.086 ± 1.037
3.128ValTrp: 3.128 ± 1.556
0.0ValTyr: 0.0 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
2.086TrpAla: 2.086 ± 1.037
0.0TrpCys: 0.0 ± 0.0
3.128TrpAsp: 3.128 ± 1.308
3.128TrpGlu: 3.128 ± 1.308
1.043TrpPhe: 1.043 ± 0.519
1.043TrpGly: 1.043 ± 0.519
1.043TrpHis: 1.043 ± 0.519
2.086TrpIle: 2.086 ± 1.037
2.086TrpLys: 2.086 ± 1.037
3.128TrpLeu: 3.128 ± 1.556
1.043TrpMet: 1.043 ± 0.519
2.086TrpAsn: 2.086 ± 1.705
1.043TrpPro: 1.043 ± 0.519
2.086TrpGln: 2.086 ± 1.037
6.257TrpArg: 6.257 ± 1.163
1.043TrpSer: 1.043 ± 0.519
1.043TrpThr: 1.043 ± 0.519
0.0TrpVal: 0.0 ± 0.0
4.171TrpTrp: 4.171 ± 3.41
2.086TrpTyr: 2.086 ± 1.037
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.043TyrAla: 1.043 ± 0.519
0.0TyrCys: 0.0 ± 0.0
1.043TyrAsp: 1.043 ± 0.519
0.0TyrGlu: 0.0 ± 0.0
3.128TyrPhe: 3.128 ± 1.556
1.043TyrGly: 1.043 ± 0.519
0.0TyrHis: 0.0 ± 0.0
1.043TyrIle: 1.043 ± 0.519
1.043TyrLys: 1.043 ± 0.519
1.043TyrLeu: 1.043 ± 0.519
0.0TyrMet: 0.0 ± 0.0
2.086TyrAsn: 2.086 ± 1.037
3.128TyrPro: 3.128 ± 1.566
2.086TyrGln: 2.086 ± 1.037
3.128TyrArg: 3.128 ± 1.556
2.086TyrSer: 2.086 ± 1.037
1.043TyrThr: 1.043 ± 0.519
5.214TyrVal: 5.214 ± 2.593
0.0TyrTrp: 0.0 ± 0.0
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (960 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski