Amino acid dipepetide frequency for Torque teno virus 28

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.16AlaAla: 10.16 ± 7.695
0.0AlaCys: 0.0 ± 0.0
0.726AlaAsp: 0.726 ± 0.458
2.903AlaGlu: 2.903 ± 2.263
0.0AlaPhe: 0.0 ± 0.0
6.531AlaGly: 6.531 ± 2.312
0.726AlaHis: 0.726 ± 0.867
2.177AlaIle: 2.177 ± 1.763
3.628AlaLys: 3.628 ± 1.911
2.903AlaLeu: 2.903 ± 1.35
0.726AlaMet: 0.726 ± 0.458
0.726AlaAsn: 0.726 ± 0.458
4.354AlaPro: 4.354 ± 3.526
5.08AlaGln: 5.08 ± 1.1
2.177AlaArg: 2.177 ± 0.675
6.531AlaSer: 6.531 ± 1.903
6.531AlaThr: 6.531 ± 1.63
2.903AlaVal: 2.903 ± 1.094
2.177AlaTrp: 2.177 ± 1.375
3.628AlaTyr: 3.628 ± 1.911
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
0.726CysPhe: 0.726 ± 0.458
3.628CysGly: 3.628 ± 0.977
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
1.451CysLys: 1.451 ± 0.773
0.726CysLeu: 0.726 ± 0.867
0.726CysMet: 0.726 ± 0.458
0.0CysAsn: 0.0 ± 0.0
0.726CysPro: 0.726 ± 0.458
1.451CysGln: 1.451 ± 0.693
0.0CysArg: 0.0 ± 0.0
3.628CysSer: 3.628 ± 1.911
0.726CysThr: 0.726 ± 0.458
2.903CysVal: 2.903 ± 1.35
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.903AspAla: 2.903 ± 1.35
0.726AspCys: 0.726 ± 0.867
4.354AspAsp: 4.354 ± 0.714
0.726AspGlu: 0.726 ± 0.458
0.0AspPhe: 0.0 ± 0.0
0.0AspGly: 0.0 ± 0.0
0.0AspHis: 0.0 ± 0.0
4.354AspIle: 4.354 ± 0.921
1.451AspLys: 1.451 ± 0.916
4.354AspLeu: 4.354 ± 1.447
3.628AspMet: 3.628 ± 1.071
0.0AspAsn: 0.0 ± 0.0
8.708AspPro: 8.708 ± 1.992
0.726AspGln: 0.726 ± 0.458
0.0AspArg: 0.0 ± 0.0
4.354AspSer: 4.354 ± 1.852
2.177AspThr: 2.177 ± 1.763
4.354AspVal: 4.354 ± 1.518
0.726AspTrp: 0.726 ± 0.908
2.177AspTyr: 2.177 ± 1.375
0.0AspXaa: 0.0 ± 0.0
Glu
2.177GluAla: 2.177 ± 0.889
0.726GluCys: 0.726 ± 0.867
3.628GluAsp: 3.628 ± 0.977
10.885GluGlu: 10.885 ± 2.932
0.0GluPhe: 0.0 ± 0.0
3.628GluGly: 3.628 ± 0.977
2.903GluHis: 2.903 ± 1.35
2.177GluIle: 2.177 ± 0.889
0.726GluLys: 0.726 ± 0.458
4.354GluLeu: 4.354 ± 0.714
0.0GluMet: 0.0 ± 0.84
2.177GluAsn: 2.177 ± 1.375
1.451GluPro: 1.451 ± 0.693
4.354GluGln: 4.354 ± 1.447
1.451GluArg: 1.451 ± 1.079
5.08GluSer: 5.08 ± 2.27
3.628GluThr: 3.628 ± 0.545
0.0GluVal: 0.0 ± 0.0
1.451GluTrp: 1.451 ± 0.693
3.628GluTyr: 3.628 ± 1.478
0.0GluXaa: 0.0 ± 0.0
Phe
2.903PheAla: 2.903 ± 1.35
4.354PheCys: 4.354 ± 3.526
2.177PheAsp: 2.177 ± 0.889
0.726PheGlu: 0.726 ± 0.458
0.0PhePhe: 0.0 ± 0.0
0.726PheGly: 0.726 ± 0.458
0.726PheHis: 0.726 ± 0.458
1.451PheIle: 1.451 ± 0.916
3.628PheLys: 3.628 ± 1.56
0.726PheLeu: 0.726 ± 0.458
1.451PheMet: 1.451 ± 0.916
2.177PheAsn: 2.177 ± 1.375
1.451PhePro: 1.451 ± 0.773
1.451PheGln: 1.451 ± 0.693
0.726PheArg: 0.726 ± 0.458
2.177PheSer: 2.177 ± 0.889
2.903PheThr: 2.903 ± 1.546
1.451PheVal: 1.451 ± 0.916
0.0PheTrp: 0.0 ± 0.0
0.726PheTyr: 0.726 ± 0.908
0.0PheXaa: 0.0 ± 0.0
Gly
5.08GlyAla: 5.08 ± 3.106
2.177GlyCys: 2.177 ± 1.763
9.434GlyAsp: 9.434 ± 5.339
0.0GlyGlu: 0.0 ± 0.0
1.451GlyPhe: 1.451 ± 0.916
5.08GlyGly: 5.08 ± 0.695
0.726GlyHis: 0.726 ± 0.458
1.451GlyIle: 1.451 ± 0.916
3.628GlyLys: 3.628 ± 0.545
4.354GlyLeu: 4.354 ± 2.749
0.0GlyMet: 0.0 ± 0.0
3.628GlyAsn: 3.628 ± 1.56
2.903GlyPro: 2.903 ± 1.184
0.0GlyGln: 0.0 ± 0.0
5.806GlyArg: 5.806 ± 1.887
3.628GlySer: 3.628 ± 0.545
2.177GlyThr: 2.177 ± 0.675
1.451GlyVal: 1.451 ± 0.916
1.451GlyTrp: 1.451 ± 0.916
2.177GlyTyr: 2.177 ± 1.375
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.0HisCys: 0.0 ± 0.0
0.726HisAsp: 0.726 ± 0.458
2.177HisGlu: 2.177 ± 1.375
2.177HisPhe: 2.177 ± 1.763
0.726HisGly: 0.726 ± 0.458
0.726HisHis: 0.726 ± 0.458
0.726HisIle: 0.726 ± 0.458
0.726HisLys: 0.726 ± 0.867
0.726HisLeu: 0.726 ± 0.458
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
1.451HisPro: 1.451 ± 0.916
0.726HisGln: 0.726 ± 0.458
2.177HisArg: 2.177 ± 0.889
2.903HisSer: 2.903 ± 2.328
1.451HisThr: 1.451 ± 0.693
0.0HisVal: 0.0 ± 0.0
2.903HisTrp: 2.903 ± 1.35
0.726HisTyr: 0.726 ± 0.908
0.0HisXaa: 0.0 ± 0.0
Ile
0.726IleAla: 0.726 ± 0.458
2.177IleCys: 2.177 ± 0.889
0.726IleAsp: 0.726 ± 0.458
0.0IleGlu: 0.0 ± 0.0
0.726IlePhe: 0.726 ± 0.458
2.903IleGly: 2.903 ± 1.184
0.0IleHis: 0.0 ± 0.0
2.903IleIle: 2.903 ± 1.546
1.451IleLys: 1.451 ± 0.916
2.903IleLeu: 2.903 ± 1.35
0.0IleMet: 0.0 ± 0.0
2.903IleAsn: 2.903 ± 0.408
5.806IlePro: 5.806 ± 2.325
2.177IleGln: 2.177 ± 0.889
1.451IleArg: 1.451 ± 0.693
0.0IleSer: 0.0 ± 0.0
1.451IleThr: 1.451 ± 0.916
2.903IleVal: 2.903 ± 1.833
0.0IleTrp: 0.0 ± 0.0
3.628IleTyr: 3.628 ± 1.478
0.0IleXaa: 0.0 ± 0.0
Lys
3.628LysAla: 3.628 ± 0.977
2.177LysCys: 2.177 ± 1.375
4.354LysAsp: 4.354 ± 1.518
2.177LysGlu: 2.177 ± 1.501
1.451LysPhe: 1.451 ± 1.817
2.177LysGly: 2.177 ± 1.375
1.451LysHis: 1.451 ± 0.693
2.903LysIle: 2.903 ± 1.546
5.08LysLys: 5.08 ± 3.863
4.354LysLeu: 4.354 ± 2.749
0.0LysMet: 0.0 ± 0.0
3.628LysAsn: 3.628 ± 1.417
2.177LysPro: 2.177 ± 0.675
4.354LysGln: 4.354 ± 1.896
7.257LysArg: 7.257 ± 1.678
7.257LysSer: 7.257 ± 2.635
5.08LysThr: 5.08 ± 1.017
1.451LysVal: 1.451 ± 0.773
2.903LysTrp: 2.903 ± 1.833
0.726LysTyr: 0.726 ± 0.458
0.0LysXaa: 0.0 ± 0.0
Leu
10.885LeuAla: 10.885 ± 4.678
0.0LeuCys: 0.0 ± 0.0
1.451LeuAsp: 1.451 ± 0.916
5.08LeuGlu: 5.08 ± 3.106
5.806LeuPhe: 5.806 ± 2.325
2.903LeuGly: 2.903 ± 1.184
1.451LeuHis: 1.451 ± 0.916
1.451LeuIle: 1.451 ± 0.916
5.08LeuLys: 5.08 ± 2.328
5.806LeuLeu: 5.806 ± 1.301
3.628LeuMet: 3.628 ± 1.399
1.451LeuAsn: 1.451 ± 0.916
5.806LeuPro: 5.806 ± 0.863
2.903LeuGln: 2.903 ± 1.094
4.354LeuArg: 4.354 ± 1.971
2.903LeuSer: 2.903 ± 1.833
5.08LeuThr: 5.08 ± 3.208
0.726LeuVal: 0.726 ± 0.458
0.726LeuTrp: 0.726 ± 0.458
2.903LeuTyr: 2.903 ± 1.833
0.0LeuXaa: 0.0 ± 0.0
Met
0.726MetAla: 0.726 ± 0.458
0.0MetCys: 0.0 ± 0.0
0.0MetAsp: 0.0 ± 0.0
0.726MetGlu: 0.726 ± 0.458
0.726MetPhe: 0.726 ± 0.458
0.0MetGly: 0.0 ± 0.0
0.0MetHis: 0.0 ± 0.0
0.726MetIle: 0.726 ± 0.458
0.726MetLys: 0.726 ± 0.458
2.903MetLeu: 2.903 ± 1.833
1.451MetMet: 1.451 ± 0.924
0.726MetAsn: 0.726 ± 0.458
2.903MetPro: 2.903 ± 1.833
0.0MetGln: 0.0 ± 0.0
2.177MetArg: 2.177 ± 1.623
2.903MetSer: 2.903 ± 1.35
0.726MetThr: 0.726 ± 0.867
0.0MetVal: 0.0 ± 0.0
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
0.726AsnAla: 0.726 ± 0.458
0.0AsnCys: 0.0 ± 0.0
0.0AsnAsp: 0.0 ± 0.0
1.451AsnGlu: 1.451 ± 0.916
4.354AsnPhe: 4.354 ± 1.777
1.451AsnGly: 1.451 ± 0.773
0.0AsnHis: 0.0 ± 0.0
0.726AsnIle: 0.726 ± 0.458
2.903AsnLys: 2.903 ± 1.094
5.806AsnLeu: 5.806 ± 0.931
0.0AsnMet: 0.0 ± 0.0
2.903AsnAsn: 2.903 ± 1.094
5.08AsnPro: 5.08 ± 1.74
1.451AsnGln: 1.451 ± 0.916
2.177AsnArg: 2.177 ± 1.375
4.354AsnSer: 4.354 ± 2.079
3.628AsnThr: 3.628 ± 2.172
0.0AsnVal: 0.0 ± 0.0
1.451AsnTrp: 1.451 ± 0.916
3.628AsnTyr: 3.628 ± 1.417
0.0AsnXaa: 0.0 ± 0.0
Pro
8.708ProAla: 8.708 ± 6.625
0.726ProCys: 0.726 ± 0.458
3.628ProAsp: 3.628 ± 1.56
5.806ProGlu: 5.806 ± 1.29
2.177ProPhe: 2.177 ± 1.375
7.257ProGly: 7.257 ± 5.688
1.451ProHis: 1.451 ± 1.079
3.628ProIle: 3.628 ± 1.164
5.806ProLys: 5.806 ± 2.739
6.531ProLeu: 6.531 ± 3.281
1.451ProMet: 1.451 ± 0.773
0.726ProAsn: 0.726 ± 0.908
13.788ProPro: 13.788 ± 4.115
2.903ProGln: 2.903 ± 0.408
7.983ProArg: 7.983 ± 1.454
7.257ProSer: 7.257 ± 3.311
3.628ProThr: 3.628 ± 1.023
1.451ProVal: 1.451 ± 1.817
3.628ProTrp: 3.628 ± 1.56
2.177ProTyr: 2.177 ± 1.375
0.0ProXaa: 0.0 ± 0.0
Gln
2.903GlnAla: 2.903 ± 0.408
0.0GlnCys: 0.0 ± 0.0
0.726GlnAsp: 0.726 ± 0.458
3.628GlnGlu: 3.628 ± 1.478
0.726GlnPhe: 0.726 ± 0.458
0.726GlnGly: 0.726 ± 0.908
0.0GlnHis: 0.0 ± 0.0
2.177GlnIle: 2.177 ± 0.793
5.806GlnLys: 5.806 ± 1.12
1.451GlnLeu: 1.451 ± 0.916
0.0GlnMet: 0.0 ± 0.0
0.726GlnAsn: 0.726 ± 0.867
6.531GlnPro: 6.531 ± 1.898
5.08GlnGln: 5.08 ± 1.173
5.08GlnArg: 5.08 ± 1.1
2.903GlnSer: 2.903 ± 0.408
2.177GlnThr: 2.177 ± 0.889
2.903GlnVal: 2.903 ± 1.094
1.451GlnTrp: 1.451 ± 0.916
2.177GlnTyr: 2.177 ± 0.793
0.0GlnXaa: 0.0 ± 0.0
Arg
1.451ArgAla: 1.451 ± 1.079
0.726ArgCys: 0.726 ± 0.458
2.903ArgAsp: 2.903 ± 1.443
5.08ArgGlu: 5.08 ± 1.74
0.726ArgPhe: 0.726 ± 0.908
5.806ArgGly: 5.806 ± 2.472
3.628ArgHis: 3.628 ± 0.977
1.451ArgIle: 1.451 ± 0.916
2.903ArgLys: 2.903 ± 1.546
4.354ArgLeu: 4.354 ± 1.447
0.726ArgMet: 0.726 ± 0.458
2.177ArgAsn: 2.177 ± 0.793
10.16ArgPro: 10.16 ± 2.652
3.628ArgGln: 3.628 ± 1.417
26.125ArgArg: 26.125 ± 8.433
4.354ArgSer: 4.354 ± 0.834
1.451ArgThr: 1.451 ± 0.916
2.903ArgVal: 2.903 ± 1.184
1.451ArgTrp: 1.451 ± 0.773
7.257ArgTyr: 7.257 ± 1.71
0.0ArgXaa: 0.0 ± 0.0
Ser
2.177SerAla: 2.177 ± 0.793
0.726SerCys: 0.726 ± 0.458
6.531SerAsp: 6.531 ± 3.068
5.08SerGlu: 5.08 ± 1.855
2.177SerPhe: 2.177 ± 1.375
4.354SerGly: 4.354 ± 1.586
3.628SerHis: 3.628 ± 1.843
0.0SerIle: 0.0 ± 0.0
7.983SerLys: 7.983 ± 2.93
1.451SerLeu: 1.451 ± 0.916
0.0SerMet: 0.0 ± 0.0
4.354SerAsn: 4.354 ± 2.079
5.08SerPro: 5.08 ± 3.137
2.903SerGln: 2.903 ± 2.352
5.08SerArg: 5.08 ± 2.27
9.434SerSer: 9.434 ± 5.84
10.16SerThr: 10.16 ± 5.135
3.628SerVal: 3.628 ± 0.977
3.628SerTrp: 3.628 ± 1.911
1.451SerTyr: 1.451 ± 0.916
0.0SerXaa: 0.0 ± 0.0
Thr
4.354ThrAla: 4.354 ± 1.777
0.726ThrCys: 0.726 ± 0.867
0.726ThrAsp: 0.726 ± 0.458
5.08ThrGlu: 5.08 ± 2.209
4.354ThrPhe: 4.354 ± 0.714
5.806ThrGly: 5.806 ± 1.791
2.177ThrHis: 2.177 ± 1.375
3.628ThrIle: 3.628 ± 1.478
3.628ThrLys: 3.628 ± 1.478
3.628ThrLeu: 3.628 ± 2.172
0.0ThrMet: 0.0 ± 0.0
6.531ThrAsn: 6.531 ± 1.66
7.257ThrPro: 7.257 ± 2.586
5.08ThrGln: 5.08 ± 2.352
2.177ThrArg: 2.177 ± 0.793
2.903ThrSer: 2.903 ± 2.521
2.903ThrThr: 2.903 ± 0.408
0.726ThrVal: 0.726 ± 0.458
0.0ThrTrp: 0.0 ± 0.0
2.903ThrTyr: 2.903 ± 1.833
0.0ThrXaa: 0.0 ± 0.0
Val
0.0ValAla: 0.0 ± 0.0
0.726ValCys: 0.726 ± 0.458
1.451ValAsp: 1.451 ± 0.693
0.726ValGlu: 0.726 ± 0.458
1.451ValPhe: 1.451 ± 0.916
0.726ValGly: 0.726 ± 0.458
0.726ValHis: 0.726 ± 0.458
2.177ValIle: 2.177 ± 1.763
2.177ValLys: 2.177 ± 1.375
7.257ValLeu: 7.257 ± 1.054
0.726ValMet: 0.726 ± 0.458
0.726ValAsn: 0.726 ± 0.908
0.726ValPro: 0.726 ± 0.458
0.726ValGln: 0.726 ± 0.908
5.806ValArg: 5.806 ± 1.887
2.903ValSer: 2.903 ± 1.094
1.451ValThr: 1.451 ± 0.916
3.628ValVal: 3.628 ± 0.977
0.0ValTrp: 0.0 ± 0.0
2.177ValTyr: 2.177 ± 1.375
0.0ValXaa: 0.0 ± 0.0
Trp
0.726TrpAla: 0.726 ± 0.458
0.726TrpCys: 0.726 ± 0.458
2.177TrpAsp: 2.177 ± 1.375
2.177TrpGlu: 2.177 ± 0.675
1.451TrpPhe: 1.451 ± 0.916
0.726TrpGly: 0.726 ± 0.458
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
1.451TrpLys: 1.451 ± 0.693
2.903TrpLeu: 2.903 ± 1.546
0.726TrpMet: 0.726 ± 0.692
0.0TrpAsn: 0.0 ± 0.0
0.726TrpPro: 0.726 ± 0.458
1.451TrpGln: 1.451 ± 0.916
4.354TrpArg: 4.354 ± 0.714
0.726TrpSer: 0.726 ± 0.458
0.726TrpThr: 0.726 ± 0.458
0.0TrpVal: 0.0 ± 0.0
0.726TrpTrp: 0.726 ± 0.458
3.628TrpTyr: 3.628 ± 0.977
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.354TyrAla: 4.354 ± 0.714
0.0TyrCys: 0.0 ± 0.0
0.0TyrAsp: 0.0 ± 0.0
0.726TyrGlu: 0.726 ± 0.458
1.451TyrPhe: 1.451 ± 0.916
1.451TyrGly: 1.451 ± 0.916
0.726TyrHis: 0.726 ± 0.458
0.726TyrIle: 0.726 ± 0.458
4.354TyrLys: 4.354 ± 1.518
2.903TyrLeu: 2.903 ± 1.833
1.451TyrMet: 1.451 ± 0.916
6.531TyrAsn: 6.531 ± 0.883
3.628TyrPro: 3.628 ± 1.164
0.726TyrGln: 0.726 ± 0.458
2.903TyrArg: 2.903 ± 1.833
3.628TyrSer: 3.628 ± 1.417
6.531TyrThr: 6.531 ± 4.124
2.903TyrVal: 2.903 ± 1.833
0.726TyrTrp: 0.726 ± 0.867
0.726TyrTyr: 0.726 ± 0.458
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (1379 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski