Amino acid dipepetide frequency for Simian torque teno virus 31

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.54AlaAla: 8.54 ± 6.777
0.776AlaCys: 0.776 ± 0.427
1.553AlaAsp: 1.553 ± 0.79
1.553AlaGlu: 1.553 ± 0.79
1.553AlaPhe: 1.553 ± 0.907
9.317AlaGly: 9.317 ± 8.145
0.776AlaHis: 0.776 ± 0.427
2.329AlaIle: 2.329 ± 1.007
0.776AlaLys: 0.776 ± 1.151
4.658AlaLeu: 4.658 ± 2.014
0.776AlaMet: 0.776 ± 1.084
1.553AlaAsn: 1.553 ± 0.79
6.211AlaPro: 6.211 ± 5.396
2.329AlaGln: 2.329 ± 2.442
1.553AlaArg: 1.553 ± 1.64
1.553AlaSer: 1.553 ± 0.854
3.106AlaThr: 3.106 ± 2.081
6.211AlaVal: 6.211 ± 1.537
3.106AlaTrp: 3.106 ± 1.91
0.776AlaTyr: 0.776 ± 0.427
0.0AlaXaa: 0.0 ± 0.0
Cys
0.776CysAla: 0.776 ± 0.427
0.0CysCys: 0.0 ± 0.0
0.776CysAsp: 0.776 ± 0.427
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
0.0CysGly: 0.0 ± 0.0
1.553CysHis: 1.553 ± 0.79
0.776CysIle: 0.776 ± 1.151
2.329CysLys: 2.329 ± 1.282
1.553CysLeu: 1.553 ± 0.79
0.776CysMet: 0.776 ± 0.427
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
2.329CysSer: 2.329 ± 1.007
1.553CysThr: 1.553 ± 0.854
3.106CysVal: 3.106 ± 1.709
0.0CysTrp: 0.0 ± 0.0
0.776CysTyr: 0.776 ± 0.427
0.0CysXaa: 0.0 ± 0.0
Asp
6.211AspAla: 6.211 ± 2.951
0.0AspCys: 0.0 ± 0.0
3.106AspAsp: 3.106 ± 1.034
3.106AspGlu: 3.106 ± 0.846
2.329AspPhe: 2.329 ± 0.828
3.106AspGly: 3.106 ± 1.476
0.0AspHis: 0.0 ± 0.0
3.106AspIle: 3.106 ± 1.709
2.329AspLys: 2.329 ± 0.825
4.658AspLeu: 4.658 ± 1.651
0.0AspMet: 0.0 ± 0.0
0.776AspAsn: 0.776 ± 1.084
8.54AspPro: 8.54 ± 3.568
1.553AspGln: 1.553 ± 0.854
0.776AspArg: 0.776 ± 0.427
2.329AspSer: 2.329 ± 1.998
3.882AspThr: 3.882 ± 0.705
0.776AspVal: 0.776 ± 0.965
0.776AspTrp: 0.776 ± 1.084
0.776AspTyr: 0.776 ± 0.965
0.0AspXaa: 0.0 ± 0.0
Glu
3.106GluAla: 3.106 ± 1.915
0.0GluCys: 0.0 ± 0.0
3.106GluAsp: 3.106 ± 0.985
3.106GluGlu: 3.106 ± 1.58
0.776GluPhe: 0.776 ± 0.427
4.658GluGly: 4.658 ± 1.745
0.0GluHis: 0.0 ± 0.0
3.882GluIle: 3.882 ± 1.227
0.776GluLys: 0.776 ± 0.427
4.658GluLeu: 4.658 ± 0.954
0.0GluMet: 0.0 ± 0.0
0.776GluAsn: 0.776 ± 0.427
3.106GluPro: 3.106 ± 0.846
0.776GluGln: 0.776 ± 0.427
4.658GluArg: 4.658 ± 1.47
1.553GluSer: 1.553 ± 0.955
6.211GluThr: 6.211 ± 2.626
0.776GluVal: 0.776 ± 1.084
0.0GluTrp: 0.0 ± 0.0
1.553GluTyr: 1.553 ± 0.955
0.0GluXaa: 0.0 ± 0.0
Phe
0.0PheAla: 0.0 ± 0.0
1.553PheCys: 1.553 ± 0.79
0.776PheAsp: 0.776 ± 0.427
0.776PheGlu: 0.776 ± 1.151
0.0PhePhe: 0.0 ± 0.0
2.329PheGly: 2.329 ± 0.828
1.553PheHis: 1.553 ± 0.854
2.329PheIle: 2.329 ± 1.282
0.776PheLys: 0.776 ± 0.427
5.435PheLeu: 5.435 ± 2.991
1.553PheMet: 1.553 ± 0.854
0.776PheAsn: 0.776 ± 0.427
3.882PhePro: 3.882 ± 2.628
1.553PheGln: 1.553 ± 1.499
3.106PheArg: 3.106 ± 1.034
3.106PheSer: 3.106 ± 0.955
0.776PheThr: 0.776 ± 0.427
0.0PheVal: 0.0 ± 0.0
0.776PheTrp: 0.776 ± 0.965
0.776PheTyr: 0.776 ± 0.427
0.0PheXaa: 0.0 ± 0.0
Gly
5.435GlyAla: 5.435 ± 3.593
1.553GlyCys: 1.553 ± 0.907
4.658GlyAsp: 4.658 ± 4.164
3.882GlyGlu: 3.882 ± 2.218
0.776GlyPhe: 0.776 ± 1.084
10.87GlyGly: 10.87 ± 3.241
3.106GlyHis: 3.106 ± 1.051
1.553GlyIle: 1.553 ± 1.453
1.553GlyLys: 1.553 ± 0.854
4.658GlyLeu: 4.658 ± 2.563
0.0GlyMet: 0.0 ± 0.0
3.106GlyAsn: 3.106 ± 1.709
5.435GlyPro: 5.435 ± 2.857
4.658GlyGln: 4.658 ± 4.348
10.87GlyArg: 10.87 ± 0.46
5.435GlySer: 5.435 ± 1.469
1.553GlyThr: 1.553 ± 0.79
3.106GlyVal: 3.106 ± 1.813
6.211GlyTrp: 6.211 ± 2.646
4.658GlyTyr: 4.658 ± 1.57
0.0GlyXaa: 0.0 ± 0.0
His
1.553HisAla: 1.553 ± 0.854
1.553HisCys: 1.553 ± 0.79
0.776HisAsp: 0.776 ± 0.427
0.0HisGlu: 0.0 ± 0.0
1.553HisPhe: 1.553 ± 0.79
3.882HisGly: 3.882 ± 1.558
1.553HisHis: 1.553 ± 0.854
1.553HisIle: 1.553 ± 0.854
0.776HisLys: 0.776 ± 0.427
0.776HisLeu: 0.776 ± 0.965
2.329HisMet: 2.329 ± 0.825
0.776HisAsn: 0.776 ± 0.427
4.658HisPro: 4.658 ± 2.97
1.553HisGln: 1.553 ± 0.854
2.329HisArg: 2.329 ± 1.282
2.329HisSer: 2.329 ± 0.825
0.0HisThr: 0.0 ± 0.0
0.0HisVal: 0.0 ± 0.0
0.776HisTrp: 0.776 ± 0.427
2.329HisTyr: 2.329 ± 1.303
0.0HisXaa: 0.0 ± 0.0
Ile
0.0IleAla: 0.0 ± 0.0
0.776IleCys: 0.776 ± 0.427
3.882IleAsp: 3.882 ± 0.973
0.0IleGlu: 0.0 ± 0.0
3.106IlePhe: 3.106 ± 1.217
0.0IleGly: 0.0 ± 0.0
0.776IleHis: 0.776 ± 1.151
1.553IleIle: 1.553 ± 0.854
1.553IleLys: 1.553 ± 0.854
2.329IleLeu: 2.329 ± 1.282
0.0IleMet: 0.0 ± 0.0
2.329IleAsn: 2.329 ± 0.828
6.988IlePro: 6.988 ± 1.622
0.0IleGln: 0.0 ± 0.0
1.553IleArg: 1.553 ± 0.854
3.882IleSer: 3.882 ± 1.899
3.882IleThr: 3.882 ± 1.683
0.776IleVal: 0.776 ± 0.427
0.0IleTrp: 0.0 ± 0.0
0.0IleTyr: 0.0 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
2.329LysAla: 2.329 ± 1.165
0.776LysCys: 0.776 ± 0.427
2.329LysAsp: 2.329 ± 1.282
1.553LysGlu: 1.553 ± 2.168
2.329LysPhe: 2.329 ± 1.282
4.658LysGly: 4.658 ± 0.629
1.553LysHis: 1.553 ± 0.854
1.553LysIle: 1.553 ± 0.854
0.776LysLys: 0.776 ± 0.427
0.776LysLeu: 0.776 ± 0.427
1.553LysMet: 1.553 ± 0.805
0.0LysAsn: 0.0 ± 0.0
0.776LysPro: 0.776 ± 0.427
0.776LysGln: 0.776 ± 0.427
3.882LysArg: 3.882 ± 1.751
2.329LysSer: 2.329 ± 1.007
4.658LysThr: 4.658 ± 0.803
1.553LysVal: 1.553 ± 0.854
2.329LysTrp: 2.329 ± 1.282
0.776LysTyr: 0.776 ± 0.427
0.0LysXaa: 0.0 ± 0.0
Leu
5.435LeuAla: 5.435 ± 1.606
1.553LeuCys: 1.553 ± 0.854
5.435LeuAsp: 5.435 ± 1.174
4.658LeuGlu: 4.658 ± 1.745
1.553LeuPhe: 1.553 ± 0.854
3.882LeuGly: 3.882 ± 0.899
4.658LeuHis: 4.658 ± 3.423
1.553LeuIle: 1.553 ± 0.854
3.106LeuLys: 3.106 ± 1.709
9.317LeuLeu: 9.317 ± 3.985
4.658LeuMet: 4.658 ± 2.046
1.553LeuAsn: 1.553 ± 0.854
3.882LeuPro: 3.882 ± 1.227
3.882LeuGln: 3.882 ± 2.136
5.435LeuArg: 5.435 ± 1.174
3.882LeuSer: 3.882 ± 2.238
4.658LeuThr: 4.658 ± 2.563
0.776LeuVal: 0.776 ± 0.427
0.776LeuTrp: 0.776 ± 0.427
4.658LeuTyr: 4.658 ± 1.745
0.0LeuXaa: 0.0 ± 0.0
Met
3.106MetAla: 3.106 ± 0.955
0.0MetCys: 0.0 ± 0.0
0.776MetAsp: 0.776 ± 0.427
0.776MetGlu: 0.776 ± 0.427
2.329MetPhe: 2.329 ± 1.303
0.0MetGly: 0.0 ± 0.0
0.776MetHis: 0.776 ± 0.427
0.0MetIle: 0.0 ± 0.0
0.776MetLys: 0.776 ± 0.427
0.776MetLeu: 0.776 ± 0.427
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
3.106MetPro: 3.106 ± 1.217
0.0MetGln: 0.0 ± 0.0
0.776MetArg: 0.776 ± 0.427
3.106MetSer: 3.106 ± 1.58
2.329MetThr: 2.329 ± 1.007
0.776MetVal: 0.776 ± 0.965
0.0MetTrp: 0.0 ± 0.0
1.553MetTyr: 1.553 ± 0.854
0.0MetXaa: 0.0 ± 0.0
Asn
2.329AsnAla: 2.329 ± 1.139
0.0AsnCys: 0.0 ± 0.0
1.553AsnAsp: 1.553 ± 0.854
0.776AsnGlu: 0.776 ± 0.427
0.0AsnPhe: 0.0 ± 0.0
0.0AsnGly: 0.0 ± 0.0
0.0AsnHis: 0.0 ± 0.0
1.553AsnIle: 1.553 ± 0.854
3.106AsnLys: 3.106 ± 1.709
2.329AsnLeu: 2.329 ± 0.828
1.553AsnMet: 1.553 ± 0.777
0.776AsnAsn: 0.776 ± 0.427
3.882AsnPro: 3.882 ± 1.227
0.776AsnGln: 0.776 ± 0.427
0.776AsnArg: 0.776 ± 1.151
3.106AsnSer: 3.106 ± 1.051
0.776AsnThr: 0.776 ± 0.965
0.776AsnVal: 0.776 ± 0.427
0.0AsnTrp: 0.0 ± 0.0
1.553AsnTyr: 1.553 ± 0.854
0.0AsnXaa: 0.0 ± 0.0
Pro
5.435ProAla: 5.435 ± 4.531
3.882ProCys: 3.882 ± 2.136
2.329ProAsp: 2.329 ± 1.282
4.658ProGlu: 4.658 ± 1.57
4.658ProPhe: 4.658 ± 0.803
9.317ProGly: 9.317 ± 4.393
2.329ProHis: 2.329 ± 1.303
1.553ProIle: 1.553 ± 2.168
3.882ProLys: 3.882 ± 3.789
6.988ProLeu: 6.988 ± 1.846
0.776ProMet: 0.776 ± 0.427
3.882ProAsn: 3.882 ± 0.899
13.199ProPro: 13.199 ± 4.329
5.435ProGln: 5.435 ± 2.115
13.199ProArg: 13.199 ± 4.866
3.882ProSer: 3.882 ± 1.051
3.106ProThr: 3.106 ± 1.709
3.882ProVal: 3.882 ± 3.382
0.776ProTrp: 0.776 ± 0.427
1.553ProTyr: 1.553 ± 0.907
0.0ProXaa: 0.0 ± 0.0
Gln
3.106GlnAla: 3.106 ± 2.031
0.776GlnCys: 0.776 ± 0.427
0.776GlnAsp: 0.776 ± 0.427
2.329GlnGlu: 2.329 ± 1.165
0.0GlnPhe: 0.0 ± 0.0
2.329GlnGly: 2.329 ± 1.901
2.329GlnHis: 2.329 ± 1.303
0.776GlnIle: 0.776 ± 1.084
0.776GlnLys: 0.776 ± 1.084
4.658GlnLeu: 4.658 ± 1.651
0.0GlnMet: 0.0 ± 0.0
0.776GlnAsn: 0.776 ± 0.427
2.329GlnPro: 2.329 ± 0.828
2.329GlnGln: 2.329 ± 1.007
4.658GlnArg: 4.658 ± 2.014
2.329GlnSer: 2.329 ± 1.303
2.329GlnThr: 2.329 ± 1.711
5.435GlnVal: 5.435 ± 1.736
0.776GlnTrp: 0.776 ± 0.427
2.329GlnTyr: 2.329 ± 1.139
0.0GlnXaa: 0.0 ± 0.0
Arg
3.882ArgAla: 3.882 ± 1.683
0.776ArgCys: 0.776 ± 0.427
3.882ArgAsp: 3.882 ± 2.376
4.658ArgGlu: 4.658 ± 1.655
2.329ArgPhe: 2.329 ± 0.828
10.87ArgGly: 10.87 ± 1.507
2.329ArgHis: 2.329 ± 1.282
0.776ArgIle: 0.776 ± 1.151
3.106ArgLys: 3.106 ± 0.846
8.54ArgLeu: 8.54 ± 2.785
3.106ArgMet: 3.106 ± 1.475
0.776ArgAsn: 0.776 ± 1.151
8.54ArgPro: 8.54 ± 3.849
3.106ArgGln: 3.106 ± 1.709
47.36ArgArg: 47.36 ± 12.689
5.435ArgSer: 5.435 ± 2.592
6.988ArgThr: 6.988 ± 2.825
3.106ArgVal: 3.106 ± 0.846
4.658ArgTrp: 4.658 ± 1.745
2.329ArgTyr: 2.329 ± 1.282
0.0ArgXaa: 0.0 ± 0.0
Ser
1.553SerAla: 1.553 ± 0.955
0.0SerCys: 0.0 ± 0.0
4.658SerAsp: 4.658 ± 2.932
4.658SerGlu: 4.658 ± 1.868
2.329SerPhe: 2.329 ± 1.282
1.553SerGly: 1.553 ± 0.79
1.553SerHis: 1.553 ± 0.79
2.329SerIle: 2.329 ± 0.828
3.106SerLys: 3.106 ± 1.217
5.435SerLeu: 5.435 ± 2.136
1.553SerMet: 1.553 ± 0.907
2.329SerAsn: 2.329 ± 1.282
5.435SerPro: 5.435 ± 3.11
3.106SerGln: 3.106 ± 1.915
5.435SerArg: 5.435 ± 2.592
13.199SerSer: 13.199 ± 10.991
6.211SerThr: 6.211 ± 3.757
3.106SerVal: 3.106 ± 1.034
4.658SerTrp: 4.658 ± 2.932
2.329SerTyr: 2.329 ± 0.828
0.0SerXaa: 0.0 ± 0.0
Thr
3.882ThrAla: 3.882 ± 2.923
1.553ThrCys: 1.553 ± 0.955
1.553ThrAsp: 1.553 ± 0.854
3.882ThrGlu: 3.882 ± 1.227
3.106ThrPhe: 3.106 ± 1.034
6.988ThrGly: 6.988 ± 2.566
3.106ThrHis: 3.106 ± 1.217
1.553ThrIle: 1.553 ± 0.854
3.882ThrLys: 3.882 ± 1.521
3.106ThrLeu: 3.106 ± 1.034
0.0ThrMet: 0.0 ± 0.0
1.553ThrAsn: 1.553 ± 0.854
6.988ThrPro: 6.988 ± 3.065
5.435ThrGln: 5.435 ± 1.96
6.211ThrArg: 6.211 ± 1.243
3.106ThrSer: 3.106 ± 0.985
3.106ThrThr: 3.106 ± 1.051
0.776ThrVal: 0.776 ± 0.427
0.776ThrTrp: 0.776 ± 0.427
0.776ThrTyr: 0.776 ± 0.427
0.0ThrXaa: 0.0 ± 0.0
Val
1.553ValAla: 1.553 ± 0.907
0.0ValCys: 0.0 ± 0.0
4.658ValAsp: 4.658 ± 0.954
0.776ValGlu: 0.776 ± 0.427
0.776ValPhe: 0.776 ± 1.151
3.882ValGly: 3.882 ± 1.521
2.329ValHis: 2.329 ± 0.825
1.553ValIle: 1.553 ± 0.907
0.776ValLys: 0.776 ± 0.427
2.329ValLeu: 2.329 ± 0.825
0.776ValMet: 0.776 ± 0.427
0.0ValAsn: 0.0 ± 0.0
1.553ValPro: 1.553 ± 1.64
1.553ValGln: 1.553 ± 0.907
6.211ValArg: 6.211 ± 1.537
5.435ValSer: 5.435 ± 1.96
0.776ValThr: 0.776 ± 0.965
0.776ValVal: 0.776 ± 0.427
0.0ValTrp: 0.0 ± 0.0
0.776ValTyr: 0.776 ± 0.427
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
1.553TrpAsp: 1.553 ± 0.955
0.776TrpGlu: 0.776 ± 0.427
0.776TrpPhe: 0.776 ± 0.427
3.106TrpGly: 3.106 ± 1.709
0.0TrpHis: 0.0 ± 0.0
1.553TrpIle: 1.553 ± 0.955
0.776TrpLys: 0.776 ± 0.965
2.329TrpLeu: 2.329 ± 0.825
0.0TrpMet: 0.0 ± 0.0
0.776TrpAsn: 0.776 ± 0.427
3.106TrpPro: 3.106 ± 1.051
1.553TrpGln: 1.553 ± 0.955
3.106TrpArg: 3.106 ± 1.709
3.882TrpSer: 3.882 ± 2.938
0.776TrpThr: 0.776 ± 1.084
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
2.329TrpTyr: 2.329 ± 1.282
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.776TyrAla: 0.776 ± 0.965
0.776TyrCys: 0.776 ± 0.427
0.0TyrAsp: 0.0 ± 0.0
1.553TyrGlu: 1.553 ± 0.854
1.553TyrPhe: 1.553 ± 0.854
2.329TyrGly: 2.329 ± 1.282
0.776TyrHis: 0.776 ± 0.427
1.553TyrIle: 1.553 ± 0.79
2.329TyrLys: 2.329 ± 1.007
0.776TyrLeu: 0.776 ± 1.151
0.776TyrMet: 0.776 ± 0.427
3.106TyrAsn: 3.106 ± 1.813
3.106TyrPro: 3.106 ± 0.955
0.776TyrGln: 0.776 ± 1.151
4.658TyrArg: 4.658 ± 1.873
2.329TyrSer: 2.329 ± 0.825
4.658TyrThr: 4.658 ± 2.563
0.776TyrVal: 0.776 ± 0.427
0.0TyrTrp: 0.0 ± 0.0
0.776TyrTyr: 0.776 ± 0.427
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (1289 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski