Amino acid dipepetide frequency for Capybara microvirus Cap3_SP_465

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.628AlaAla: 11.628 ± 8.736
0.0AlaCys: 0.0 ± 0.0
3.42AlaAsp: 3.42 ± 1.825
2.736AlaGlu: 2.736 ± 1.457
3.42AlaPhe: 3.42 ± 1.626
3.42AlaGly: 3.42 ± 3.032
2.052AlaHis: 2.052 ± 0.838
3.42AlaIle: 3.42 ± 2.329
2.736AlaLys: 2.736 ± 1.918
4.788AlaLeu: 4.788 ± 1.282
2.052AlaMet: 2.052 ± 1.215
6.84AlaAsn: 6.84 ± 3.825
0.684AlaPro: 0.684 ± 0.447
2.736AlaGln: 2.736 ± 2.426
1.368AlaArg: 1.368 ± 0.638
8.892AlaSer: 8.892 ± 3.514
3.42AlaThr: 3.42 ± 1.688
3.42AlaVal: 3.42 ± 0.966
0.684AlaTrp: 0.684 ± 0.447
4.104AlaTyr: 4.104 ± 1.079
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.684CysCys: 0.684 ± 0.447
2.736CysAsp: 2.736 ± 1.644
2.052CysGlu: 2.052 ± 0.796
0.0CysPhe: 0.0 ± 0.0
0.684CysGly: 0.684 ± 0.806
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
2.052CysLeu: 2.052 ± 0.884
0.0CysMet: 0.0 ± 0.0
0.684CysAsn: 0.684 ± 0.806
0.0CysPro: 0.0 ± 0.0
0.684CysGln: 0.684 ± 0.65
0.0CysArg: 0.0 ± 0.0
0.0CysSer: 0.0 ± 0.0
0.684CysThr: 0.684 ± 0.65
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.368AspAla: 1.368 ± 0.638
1.368AspCys: 1.368 ± 0.968
6.156AspAsp: 6.156 ± 2.356
4.104AspGlu: 4.104 ± 1.91
4.104AspPhe: 4.104 ± 1.139
2.052AspGly: 2.052 ± 1.34
1.368AspHis: 1.368 ± 0.578
3.42AspIle: 3.42 ± 1.319
4.104AspLys: 4.104 ± 2.171
6.156AspLeu: 6.156 ± 0.912
2.736AspMet: 2.736 ± 0.95
4.104AspAsn: 4.104 ± 1.72
0.684AspPro: 0.684 ± 0.806
2.052AspGln: 2.052 ± 0.5
0.684AspArg: 0.684 ± 0.447
3.42AspSer: 3.42 ± 1.663
3.42AspThr: 3.42 ± 1.414
4.104AspVal: 4.104 ± 1.416
0.684AspTrp: 0.684 ± 0.447
4.788AspTyr: 4.788 ± 1.53
0.0AspXaa: 0.0 ± 0.0
Glu
3.42GluAla: 3.42 ± 1.046
0.684GluCys: 0.684 ± 0.806
3.42GluAsp: 3.42 ± 1.318
2.736GluGlu: 2.736 ± 2.6
2.736GluPhe: 2.736 ± 1.678
0.0GluGly: 0.0 ± 0.0
1.368GluHis: 1.368 ± 0.802
3.42GluIle: 3.42 ± 1.319
2.736GluLys: 2.736 ± 1.404
5.472GluLeu: 5.472 ± 2.718
2.736GluMet: 2.736 ± 1.261
2.736GluAsn: 2.736 ± 1.212
0.0GluPro: 0.0 ± 0.0
3.42GluGln: 3.42 ± 1.649
0.0GluArg: 0.0 ± 0.0
1.368GluSer: 1.368 ± 1.121
4.104GluThr: 4.104 ± 2.422
5.472GluVal: 5.472 ± 1.608
0.684GluTrp: 0.684 ± 0.447
3.42GluTyr: 3.42 ± 1.81
0.0GluXaa: 0.0 ± 0.0
Phe
2.736PheAla: 2.736 ± 1.031
0.0PheCys: 0.0 ± 0.0
3.42PheAsp: 3.42 ± 1.055
2.052PheGlu: 2.052 ± 1.291
1.368PhePhe: 1.368 ± 0.638
4.104PheGly: 4.104 ± 1.444
0.684PheHis: 0.684 ± 0.65
2.052PheIle: 2.052 ± 0.791
2.736PheLys: 2.736 ± 1.238
1.368PheLeu: 1.368 ± 0.894
3.42PheMet: 3.42 ± 1.594
6.156PheAsn: 6.156 ± 1.388
0.684PhePro: 0.684 ± 0.806
1.368PheGln: 1.368 ± 0.894
2.052PheArg: 2.052 ± 0.804
3.42PheSer: 3.42 ± 1.395
4.104PheThr: 4.104 ± 1.13
3.42PheVal: 3.42 ± 0.901
1.368PheTrp: 1.368 ± 1.3
1.368PheTyr: 1.368 ± 0.578
0.0PheXaa: 0.0 ± 0.0
Gly
3.42GlyAla: 3.42 ± 2.271
0.684GlyCys: 0.684 ± 0.806
3.42GlyAsp: 3.42 ± 1.571
4.788GlyGlu: 4.788 ± 1.287
2.052GlyPhe: 2.052 ± 0.804
4.104GlyGly: 4.104 ± 1.299
1.368GlyHis: 1.368 ± 0.894
3.42GlyIle: 3.42 ± 1.008
2.052GlyLys: 2.052 ± 1.513
5.472GlyLeu: 5.472 ± 1.78
0.0GlyMet: 0.0 ± 0.0
5.472GlyAsn: 5.472 ± 1.104
1.368GlyPro: 1.368 ± 0.802
0.684GlyGln: 0.684 ± 0.606
0.684GlyArg: 0.684 ± 0.447
9.576GlySer: 9.576 ± 1.391
4.788GlyThr: 4.788 ± 1.867
4.788GlyVal: 4.788 ± 1.251
0.0GlyTrp: 0.0 ± 0.0
2.052GlyTyr: 2.052 ± 1.34
0.0GlyXaa: 0.0 ± 0.0
His
0.684HisAla: 0.684 ± 0.65
0.684HisCys: 0.684 ± 0.447
1.368HisAsp: 1.368 ± 0.578
0.0HisGlu: 0.0 ± 0.0
1.368HisPhe: 1.368 ± 0.578
0.684HisGly: 0.684 ± 0.447
0.0HisHis: 0.0 ± 0.0
1.368HisIle: 1.368 ± 0.802
0.684HisLys: 0.684 ± 0.447
1.368HisLeu: 1.368 ± 0.578
0.684HisMet: 0.684 ± 0.447
0.0HisAsn: 0.0 ± 0.0
0.0HisPro: 0.0 ± 0.0
2.052HisGln: 2.052 ± 0.838
1.368HisArg: 1.368 ± 0.996
2.736HisSer: 2.736 ± 1.378
0.684HisThr: 0.684 ± 0.447
1.368HisVal: 1.368 ± 0.894
0.684HisTrp: 0.684 ± 0.447
0.684HisTyr: 0.684 ± 0.783
0.0HisXaa: 0.0 ± 0.0
Ile
2.736IleAla: 2.736 ± 1.388
0.684IleCys: 0.684 ± 0.783
4.104IleAsp: 4.104 ± 1.903
2.052IleGlu: 2.052 ± 1.692
0.0IlePhe: 0.0 ± 0.0
2.736IleGly: 2.736 ± 1.165
0.0IleHis: 0.0 ± 0.0
3.42IleIle: 3.42 ± 1.312
2.736IleLys: 2.736 ± 1.066
5.472IleLeu: 5.472 ± 1.813
0.684IleMet: 0.684 ± 0.447
2.736IleAsn: 2.736 ± 1.36
4.104IlePro: 4.104 ± 1.994
1.368IleGln: 1.368 ± 0.76
0.0IleArg: 0.0 ± 0.0
4.104IleSer: 4.104 ± 1.606
2.736IleThr: 2.736 ± 0.95
2.736IleVal: 2.736 ± 1.796
0.684IleTrp: 0.684 ± 0.447
4.788IleTyr: 4.788 ± 1.743
0.0IleXaa: 0.0 ± 0.0
Lys
4.104LysAla: 4.104 ± 1.542
0.0LysCys: 0.0 ± 0.0
1.368LysAsp: 1.368 ± 0.968
5.472LysGlu: 5.472 ± 1.126
2.052LysPhe: 2.052 ± 1.692
2.052LysGly: 2.052 ± 0.804
1.368LysHis: 1.368 ± 0.802
2.052LysIle: 2.052 ± 1.652
2.736LysLys: 2.736 ± 2.052
7.524LysLeu: 7.524 ± 2.199
1.368LysMet: 1.368 ± 0.764
4.788LysAsn: 4.788 ± 3.413
0.0LysPro: 0.0 ± 0.0
5.472LysGln: 5.472 ± 3.262
0.684LysArg: 0.684 ± 0.65
6.156LysSer: 6.156 ± 1.686
3.42LysThr: 3.42 ± 0.866
4.788LysVal: 4.788 ± 2.602
0.0LysTrp: 0.0 ± 0.0
3.42LysTyr: 3.42 ± 0.896
0.0LysXaa: 0.0 ± 0.0
Leu
6.84LeuAla: 6.84 ± 1.779
2.052LeuCys: 2.052 ± 0.912
7.524LeuAsp: 7.524 ± 2.769
5.472LeuGlu: 5.472 ± 2.35
1.368LeuPhe: 1.368 ± 0.968
5.472LeuGly: 5.472 ± 1.517
0.684LeuHis: 0.684 ± 0.447
6.156LeuIle: 6.156 ± 1.556
5.472LeuLys: 5.472 ± 2.508
4.788LeuLeu: 4.788 ± 1.259
2.052LeuMet: 2.052 ± 0.692
5.472LeuAsn: 5.472 ± 1.29
6.84LeuPro: 6.84 ± 1.971
4.104LeuGln: 4.104 ± 0.997
2.736LeuArg: 2.736 ± 1.088
6.84LeuSer: 6.84 ± 0.87
2.736LeuThr: 2.736 ± 1.165
3.42LeuVal: 3.42 ± 1.132
0.684LeuTrp: 0.684 ± 0.447
2.052LeuTyr: 2.052 ± 0.912
0.0LeuXaa: 0.0 ± 0.0
Met
1.368MetAla: 1.368 ± 0.579
0.0MetCys: 0.0 ± 0.0
0.684MetAsp: 0.684 ± 0.447
0.684MetGlu: 0.684 ± 0.657
0.684MetPhe: 0.684 ± 0.447
0.684MetGly: 0.684 ± 0.447
0.684MetHis: 0.684 ± 0.447
0.0MetIle: 0.0 ± 0.0
4.104MetLys: 4.104 ± 1.798
2.736MetLeu: 2.736 ± 1.479
0.684MetMet: 0.684 ± 0.433
0.684MetAsn: 0.684 ± 0.606
1.368MetPro: 1.368 ± 0.894
0.684MetGln: 0.684 ± 0.65
2.052MetArg: 2.052 ± 0.912
4.788MetSer: 4.788 ± 1.351
0.0MetThr: 0.0 ± 0.0
0.0MetVal: 0.0 ± 0.0
0.684MetTrp: 0.684 ± 0.447
1.368MetTyr: 1.368 ± 0.802
0.0MetXaa: 0.0 ± 0.0
Asn
7.524AsnAla: 7.524 ± 2.161
0.0AsnCys: 0.0 ± 0.0
2.052AsnAsp: 2.052 ± 1.032
2.052AsnGlu: 2.052 ± 0.796
2.052AsnPhe: 2.052 ± 0.796
3.42AsnGly: 3.42 ± 1.055
0.684AsnHis: 0.684 ± 0.447
2.052AsnIle: 2.052 ± 0.5
6.156AsnLys: 6.156 ± 0.968
6.156AsnLeu: 6.156 ± 1.037
0.0AsnMet: 0.0 ± 0.0
2.736AsnAsn: 2.736 ± 0.95
3.42AsnPro: 3.42 ± 1.651
2.052AsnGln: 2.052 ± 0.804
2.736AsnArg: 2.736 ± 1.159
8.208AsnSer: 8.208 ± 5.033
4.788AsnThr: 4.788 ± 1.386
6.156AsnVal: 6.156 ± 1.334
0.684AsnTrp: 0.684 ± 0.65
3.42AsnTyr: 3.42 ± 0.866
0.0AsnXaa: 0.0 ± 0.0
Pro
0.684ProAla: 0.684 ± 0.447
0.0ProCys: 0.0 ± 0.0
2.736ProAsp: 2.736 ± 0.82
0.684ProGlu: 0.684 ± 0.447
4.788ProPhe: 4.788 ± 1.786
2.736ProGly: 2.736 ± 0.82
1.368ProHis: 1.368 ± 0.578
2.736ProIle: 2.736 ± 0.987
0.0ProLys: 0.0 ± 0.0
2.736ProLeu: 2.736 ± 1.277
0.684ProMet: 0.684 ± 0.447
4.104ProAsn: 4.104 ± 1.189
0.684ProPro: 0.684 ± 0.65
0.684ProGln: 0.684 ± 0.447
0.0ProArg: 0.0 ± 0.0
2.052ProSer: 2.052 ± 0.884
4.104ProThr: 4.104 ± 0.713
4.104ProVal: 4.104 ± 2.049
0.0ProTrp: 0.0 ± 0.0
1.368ProTyr: 1.368 ± 0.996
0.0ProXaa: 0.0 ± 0.0
Gln
4.104GlnAla: 4.104 ± 2.918
0.0GlnCys: 0.0 ± 0.0
0.0GlnAsp: 0.0 ± 0.0
2.736GlnGlu: 2.736 ± 1.238
3.42GlnPhe: 3.42 ± 1.073
1.368GlnGly: 1.368 ± 0.894
0.0GlnHis: 0.0 ± 0.0
0.684GlnIle: 0.684 ± 0.606
2.736GlnLys: 2.736 ± 1.159
3.42GlnLeu: 3.42 ± 1.134
0.684GlnMet: 0.684 ± 0.65
2.052GlnAsn: 2.052 ± 1.34
1.368GlnPro: 1.368 ± 0.894
2.736GlnGln: 2.736 ± 1.678
2.052GlnArg: 2.052 ± 1.099
6.156GlnSer: 6.156 ± 0.968
4.104GlnThr: 4.104 ± 0.997
2.052GlnVal: 2.052 ± 1.146
0.0GlnTrp: 0.0 ± 0.0
2.736GlnTyr: 2.736 ± 1.005
0.0GlnXaa: 0.0 ± 0.0
Arg
0.684ArgAla: 0.684 ± 0.447
0.0ArgCys: 0.0 ± 0.0
3.42ArgAsp: 3.42 ± 1.222
1.368ArgGlu: 1.368 ± 0.882
3.42ArgPhe: 3.42 ± 1.59
0.684ArgGly: 0.684 ± 0.783
0.0ArgHis: 0.0 ± 0.0
0.684ArgIle: 0.684 ± 0.65
2.052ArgLys: 2.052 ± 0.791
3.42ArgLeu: 3.42 ± 1.651
0.0ArgMet: 0.0 ± 0.0
0.684ArgAsn: 0.684 ± 0.783
2.052ArgPro: 2.052 ± 1.146
0.684ArgGln: 0.684 ± 0.806
1.368ArgArg: 1.368 ± 0.968
4.104ArgSer: 4.104 ± 1.405
2.052ArgThr: 2.052 ± 0.838
0.684ArgVal: 0.684 ± 0.447
0.0ArgTrp: 0.0 ± 0.0
2.736ArgTyr: 2.736 ± 1.165
0.0ArgXaa: 0.0 ± 0.0
Ser
6.84SerAla: 6.84 ± 5.283
1.368SerCys: 1.368 ± 0.76
4.788SerAsp: 4.788 ± 1.485
2.052SerGlu: 2.052 ± 0.5
4.788SerPhe: 4.788 ± 1.832
8.892SerGly: 8.892 ± 2.2
2.052SerHis: 2.052 ± 0.951
5.472SerIle: 5.472 ± 1.382
5.472SerLys: 5.472 ± 1.307
5.472SerLeu: 5.472 ± 1.308
2.736SerMet: 2.736 ± 1.159
5.472SerAsn: 5.472 ± 2.049
4.788SerPro: 4.788 ± 1.162
3.42SerGln: 3.42 ± 1.617
6.84SerArg: 6.84 ± 2.359
11.628SerSer: 11.628 ± 4.349
9.576SerThr: 9.576 ± 2.858
6.156SerVal: 6.156 ± 2.172
1.368SerTrp: 1.368 ± 0.578
4.104SerTyr: 4.104 ± 1.069
0.0SerXaa: 0.0 ± 0.0
Thr
4.104ThrAla: 4.104 ± 2.87
0.684ThrCys: 0.684 ± 0.65
0.684ThrAsp: 0.684 ± 0.447
2.736ThrGlu: 2.736 ± 0.95
2.052ThrPhe: 2.052 ± 1.553
7.524ThrGly: 7.524 ± 1.788
1.368ThrHis: 1.368 ± 0.578
0.684ThrIle: 0.684 ± 0.447
3.42ThrLys: 3.42 ± 1.003
4.104ThrLeu: 4.104 ± 0.784
0.684ThrMet: 0.684 ± 0.447
2.736ThrAsn: 2.736 ± 0.97
2.736ThrPro: 2.736 ± 0.575
3.42ThrGln: 3.42 ± 1.623
0.684ThrArg: 0.684 ± 0.447
7.524ThrSer: 7.524 ± 2.145
1.368ThrThr: 1.368 ± 0.578
6.156ThrVal: 6.156 ± 1.775
1.368ThrTrp: 1.368 ± 0.894
6.156ThrTyr: 6.156 ± 1.048
0.0ThrXaa: 0.0 ± 0.0
Val
6.156ValAla: 6.156 ± 2.526
0.684ValCys: 0.684 ± 0.806
6.84ValAsp: 6.84 ± 1.731
2.736ValGlu: 2.736 ± 1.066
2.736ValPhe: 2.736 ± 0.82
6.84ValGly: 6.84 ± 1.533
0.684ValHis: 0.684 ± 0.783
3.42ValIle: 3.42 ± 1.318
2.736ValLys: 2.736 ± 1.066
6.84ValLeu: 6.84 ± 1.097
0.684ValMet: 0.684 ± 0.447
5.472ValAsn: 5.472 ± 1.64
2.736ValPro: 2.736 ± 1.787
1.368ValGln: 1.368 ± 0.579
2.736ValArg: 2.736 ± 1.121
6.156ValSer: 6.156 ± 2.284
1.368ValThr: 1.368 ± 0.894
1.368ValVal: 1.368 ± 0.578
0.0ValTrp: 0.0 ± 0.0
2.736ValTyr: 2.736 ± 1.937
0.0ValXaa: 0.0 ± 0.0
Trp
1.368TrpAla: 1.368 ± 0.894
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.684TrpGlu: 0.684 ± 0.447
1.368TrpPhe: 1.368 ± 0.578
0.0TrpGly: 0.0 ± 0.0
1.368TrpHis: 1.368 ± 0.578
0.684TrpIle: 0.684 ± 0.447
2.052TrpLys: 2.052 ± 1.146
0.0TrpLeu: 0.0 ± 0.0
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.684TrpPro: 0.684 ± 0.447
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
1.368TrpSer: 1.368 ± 0.894
0.0TrpThr: 0.0 ± 0.0
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.684TrpTyr: 0.684 ± 0.447
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.052TyrAla: 2.052 ± 0.801
0.684TyrCys: 0.684 ± 0.447
2.736TyrAsp: 2.736 ± 1.482
2.736TyrGlu: 2.736 ± 1.088
4.104TyrPhe: 4.104 ± 1.567
2.736TyrGly: 2.736 ± 1.005
1.368TyrHis: 1.368 ± 0.894
2.736TyrIle: 2.736 ± 1.494
4.788TyrLys: 4.788 ± 1.779
3.42TyrLeu: 3.42 ± 1.81
1.368TyrMet: 1.368 ± 0.755
3.42TyrAsn: 3.42 ± 1.37
2.052TyrPro: 2.052 ± 0.884
3.42TyrGln: 3.42 ± 0.472
2.052TyrArg: 2.052 ± 0.991
4.788TyrSer: 4.788 ± 0.939
2.736TyrThr: 2.736 ± 1.157
4.104TyrVal: 4.104 ± 1.269
0.684TyrTrp: 0.684 ± 0.447
2.052TyrTyr: 2.052 ± 0.804
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (1463 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski