Amino acid dipepetide frequency for Capybara microvirus Cap1_SP_93

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.632AlaAla: 4.632 ± 2.439
0.579AlaCys: 0.579 ± 0.452
1.737AlaAsp: 1.737 ± 1.809
1.737AlaGlu: 1.737 ± 0.985
3.474AlaPhe: 3.474 ± 0.824
4.632AlaGly: 4.632 ± 2.043
0.579AlaHis: 0.579 ± 0.39
2.895AlaIle: 2.895 ± 2.279
1.737AlaLys: 1.737 ± 1.116
1.737AlaLeu: 1.737 ± 0.876
2.316AlaMet: 2.316 ± 0.984
2.895AlaAsn: 2.895 ± 1.109
2.316AlaPro: 2.316 ± 0.542
6.369AlaGln: 6.369 ± 4.24
2.316AlaArg: 2.316 ± 0.84
4.632AlaSer: 4.632 ± 1.97
2.316AlaThr: 2.316 ± 0.984
2.316AlaVal: 2.316 ± 1.631
0.579AlaTrp: 0.579 ± 0.39
4.053AlaTyr: 4.053 ± 0.416
0.0AlaXaa: 0.0 ± 0.0
Cys
0.579CysAla: 0.579 ± 0.452
0.0CysCys: 0.0 ± 0.0
2.895CysAsp: 2.895 ± 0.961
0.579CysGlu: 0.579 ± 0.738
2.316CysPhe: 2.316 ± 1.56
1.737CysGly: 1.737 ± 0.665
0.579CysHis: 0.579 ± 0.452
0.0CysIle: 0.0 ± 0.0
1.158CysLys: 1.158 ± 0.414
2.316CysLeu: 2.316 ± 1.807
0.0CysMet: 0.0 ± 0.0
0.579CysAsn: 0.579 ± 0.39
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
1.158CysSer: 1.158 ± 0.414
0.579CysThr: 0.579 ± 0.39
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.579CysTyr: 0.579 ± 0.452
0.0CysXaa: 0.0 ± 0.0
Asp
3.474AspAla: 3.474 ± 0.51
1.158AspCys: 1.158 ± 0.904
6.369AspAsp: 6.369 ± 1.413
3.474AspGlu: 3.474 ± 2.081
4.053AspPhe: 4.053 ± 0.72
1.737AspGly: 1.737 ± 1.355
1.158AspHis: 1.158 ± 0.414
2.895AspIle: 2.895 ± 1.155
5.79AspLys: 5.79 ± 1.247
8.107AspLeu: 8.107 ± 0.914
1.158AspMet: 1.158 ± 0.78
5.79AspAsn: 5.79 ± 0.955
2.895AspPro: 2.895 ± 1.672
2.316AspGln: 2.316 ± 1.021
2.316AspArg: 2.316 ± 0.299
6.369AspSer: 6.369 ± 2.939
4.053AspThr: 4.053 ± 1.122
3.474AspVal: 3.474 ± 1.348
1.737AspTrp: 1.737 ± 0.665
6.369AspTyr: 6.369 ± 2.223
0.0AspXaa: 0.0 ± 0.0
Glu
0.579GluAla: 0.579 ± 0.603
0.0GluCys: 0.0 ± 0.0
4.053GluAsp: 4.053 ± 1.845
1.158GluGlu: 1.158 ± 0.601
2.895GluPhe: 2.895 ± 0.864
2.316GluGly: 2.316 ± 1.202
0.0GluHis: 0.0 ± 0.0
2.316GluIle: 2.316 ± 1.257
4.053GluLys: 4.053 ± 1.206
9.265GluLeu: 9.265 ± 2.384
0.579GluMet: 0.579 ± 0.603
3.474GluAsn: 3.474 ± 2.233
1.158GluPro: 1.158 ± 0.904
1.158GluGln: 1.158 ± 0.511
3.474GluArg: 3.474 ± 1.4
1.158GluSer: 1.158 ± 0.601
2.895GluThr: 2.895 ± 1.137
1.158GluVal: 1.158 ± 0.414
0.579GluTrp: 0.579 ± 0.603
2.895GluTyr: 2.895 ± 1.155
0.0GluXaa: 0.0 ± 0.0
Phe
2.316PheAla: 2.316 ± 1.009
1.158PheCys: 1.158 ± 0.414
5.211PheAsp: 5.211 ± 1.154
2.316PheGlu: 2.316 ± 1.34
3.474PhePhe: 3.474 ± 1.378
5.211PheGly: 5.211 ± 1.988
2.316PheHis: 2.316 ± 0.984
2.316PheIle: 2.316 ± 0.827
0.0PheLys: 0.0 ± 0.0
1.737PheLeu: 1.737 ± 1.17
0.0PheMet: 0.0 ± 0.0
3.474PheAsn: 3.474 ± 1.251
1.737PhePro: 1.737 ± 0.985
2.895PheGln: 2.895 ± 0.354
4.053PheArg: 4.053 ± 1.553
5.211PheSer: 5.211 ± 1.154
1.158PheThr: 1.158 ± 0.78
4.053PheVal: 4.053 ± 1.319
0.0PheTrp: 0.0 ± 0.0
1.737PheTyr: 1.737 ± 1.17
0.0PheXaa: 0.0 ± 0.0
Gly
4.053GlyAla: 4.053 ± 0.685
0.0GlyCys: 0.0 ± 0.0
2.316GlyAsp: 2.316 ± 1.009
2.316GlyGlu: 2.316 ± 0.827
4.053GlyPhe: 4.053 ± 0.728
2.895GlyGly: 2.895 ± 0.64
1.737GlyHis: 1.737 ± 0.773
2.895GlyIle: 2.895 ± 0.681
3.474GlyLys: 3.474 ± 0.824
5.211GlyLeu: 5.211 ± 1.597
1.737GlyMet: 1.737 ± 1.047
2.895GlyAsn: 2.895 ± 0.64
0.0GlyPro: 0.0 ± 0.0
2.895GlyGln: 2.895 ± 0.64
1.737GlyArg: 1.737 ± 0.665
5.211GlySer: 5.211 ± 4.133
2.316GlyThr: 2.316 ± 0.986
3.474GlyVal: 3.474 ± 0.824
0.579GlyTrp: 0.579 ± 0.39
2.316GlyTyr: 2.316 ± 1.009
0.0GlyXaa: 0.0 ± 0.0
His
0.579HisAla: 0.579 ± 0.39
0.579HisCys: 0.579 ± 0.452
1.737HisAsp: 1.737 ± 0.773
0.0HisGlu: 0.0 ± 0.0
1.158HisPhe: 1.158 ± 0.904
0.579HisGly: 0.579 ± 0.39
0.579HisHis: 0.579 ± 0.39
2.316HisIle: 2.316 ± 0.827
1.158HisLys: 1.158 ± 0.904
2.895HisLeu: 2.895 ± 1.037
0.579HisMet: 0.579 ± 0.552
1.158HisAsn: 1.158 ± 0.414
1.158HisPro: 1.158 ± 0.78
0.579HisGln: 0.579 ± 0.39
1.158HisArg: 1.158 ± 0.78
0.0HisSer: 0.0 ± 0.0
1.737HisThr: 1.737 ± 0.665
1.158HisVal: 1.158 ± 0.78
0.579HisTrp: 0.579 ± 0.452
2.316HisTyr: 2.316 ± 1.197
0.0HisXaa: 0.0 ± 0.0
Ile
4.053IleAla: 4.053 ± 1.842
1.737IleCys: 1.737 ± 1.031
3.474IleAsp: 3.474 ± 0.494
1.737IleGlu: 1.737 ± 0.271
2.895IlePhe: 2.895 ± 1.109
2.316IleGly: 2.316 ± 1.631
0.0IleHis: 0.0 ± 0.0
2.895IleIle: 2.895 ± 0.64
2.895IleLys: 2.895 ± 1.636
5.79IleLeu: 5.79 ± 1.684
1.737IleMet: 1.737 ± 0.271
4.053IleAsn: 4.053 ± 1.09
2.895IlePro: 2.895 ± 1.037
4.053IleGln: 4.053 ± 0.685
3.474IleArg: 3.474 ± 0.754
4.632IleSer: 4.632 ± 1.086
4.053IleThr: 4.053 ± 0.72
3.474IleVal: 3.474 ± 0.728
1.158IleTrp: 1.158 ± 0.414
1.158IleTyr: 1.158 ± 0.414
0.0IleXaa: 0.0 ± 0.0
Lys
1.158LysAla: 1.158 ± 1.206
0.0LysCys: 0.0 ± 0.0
5.79LysAsp: 5.79 ± 1.92
3.474LysGlu: 3.474 ± 1.592
1.737LysPhe: 1.737 ± 0.665
4.053LysGly: 4.053 ± 0.416
0.579LysHis: 0.579 ± 0.452
3.474LysIle: 3.474 ± 0.494
1.737LysLys: 1.737 ± 0.876
5.211LysLeu: 5.211 ± 2.203
1.158LysMet: 1.158 ± 1.206
6.369LysAsn: 6.369 ± 1.048
2.895LysPro: 2.895 ± 1.037
2.316LysGln: 2.316 ± 1.257
4.053LysArg: 4.053 ± 1.316
5.79LysSer: 5.79 ± 2.149
2.316LysThr: 2.316 ± 1.308
4.053LysVal: 4.053 ± 1.173
1.737LysTrp: 1.737 ± 0.271
2.895LysTyr: 2.895 ± 0.961
0.0LysXaa: 0.0 ± 0.0
Leu
3.474LeuAla: 3.474 ± 2.256
1.737LeuCys: 1.737 ± 0.773
6.948LeuAsp: 6.948 ± 1.084
4.632LeuGlu: 4.632 ± 1.341
4.053LeuPhe: 4.053 ± 1.965
6.369LeuGly: 6.369 ± 1.278
1.158LeuHis: 1.158 ± 0.414
4.053LeuIle: 4.053 ± 0.72
8.107LeuLys: 8.107 ± 1.456
7.528LeuLeu: 7.528 ± 1.617
4.053LeuMet: 4.053 ± 1.173
4.053LeuAsn: 4.053 ± 2.142
3.474LeuPro: 3.474 ± 1.437
3.474LeuGln: 3.474 ± 1.751
1.737LeuArg: 1.737 ± 0.665
6.948LeuSer: 6.948 ± 1.791
5.211LeuThr: 5.211 ± 1.61
2.895LeuVal: 2.895 ± 1.38
1.737LeuTrp: 1.737 ± 0.665
3.474LeuTyr: 3.474 ± 1.241
0.0LeuXaa: 0.0 ± 0.0
Met
2.895MetAla: 2.895 ± 1.058
0.0MetCys: 0.0 ± 0.0
1.737MetAsp: 1.737 ± 0.271
1.158MetGlu: 1.158 ± 0.601
0.579MetPhe: 0.579 ± 0.603
2.316MetGly: 2.316 ± 0.984
0.0MetHis: 0.0 ± 0.0
1.158MetIle: 1.158 ± 0.511
0.0MetLys: 0.0 ± 0.0
0.0MetLeu: 0.0 ± 0.0
0.579MetMet: 0.579 ± 0.603
1.737MetAsn: 1.737 ± 1.047
3.474MetPro: 3.474 ± 1.017
1.737MetGln: 1.737 ± 1.809
0.579MetArg: 0.579 ± 0.603
2.316MetSer: 2.316 ± 1.257
1.737MetThr: 1.737 ± 1.047
1.737MetVal: 1.737 ± 0.665
0.0MetTrp: 0.0 ± 0.0
1.158MetTyr: 1.158 ± 0.601
0.0MetXaa: 0.0 ± 0.0
Asn
2.895AsnAla: 2.895 ± 1.432
2.316AsnCys: 2.316 ± 1.009
4.632AsnAsp: 4.632 ± 0.853
2.316AsnGlu: 2.316 ± 0.84
1.737AsnPhe: 1.737 ± 1.031
2.895AsnGly: 2.895 ± 0.636
2.895AsnHis: 2.895 ± 1.636
4.053AsnIle: 4.053 ± 1.443
4.632AsnLys: 4.632 ± 1.645
6.369AsnLeu: 6.369 ± 1.088
1.737AsnMet: 1.737 ± 0.968
5.211AsnAsn: 5.211 ± 1.798
2.316AsnPro: 2.316 ± 0.837
3.474AsnGln: 3.474 ± 1.532
1.737AsnArg: 1.737 ± 0.271
3.474AsnSer: 3.474 ± 1.482
5.211AsnThr: 5.211 ± 1.154
2.895AsnVal: 2.895 ± 0.636
2.895AsnTrp: 2.895 ± 0.846
2.895AsnTyr: 2.895 ± 1.137
0.0AsnXaa: 0.0 ± 0.0
Pro
1.737ProAla: 1.737 ± 0.876
0.579ProCys: 0.579 ± 0.39
4.053ProAsp: 4.053 ± 1.79
2.316ProGlu: 2.316 ± 0.827
2.316ProPhe: 2.316 ± 1.56
0.579ProGly: 0.579 ± 0.39
1.158ProHis: 1.158 ± 0.414
4.053ProIle: 4.053 ± 2.351
1.158ProLys: 1.158 ± 0.414
1.737ProLeu: 1.737 ± 1.031
1.737ProMet: 1.737 ± 0.665
4.053ProAsn: 4.053 ± 1.823
0.579ProPro: 0.579 ± 0.452
1.737ProGln: 1.737 ± 0.68
2.316ProArg: 2.316 ± 1.009
4.632ProSer: 4.632 ± 0.475
3.474ProThr: 3.474 ± 1.045
0.0ProVal: 0.0 ± 0.0
0.0ProTrp: 0.0 ± 0.0
1.737ProTyr: 1.737 ± 0.773
0.0ProXaa: 0.0 ± 0.0
Gln
4.053GlnAla: 4.053 ± 0.925
0.0GlnCys: 0.0 ± 0.0
4.053GlnAsp: 4.053 ± 0.685
2.316GlnGlu: 2.316 ± 1.631
1.737GlnPhe: 1.737 ± 0.271
1.158GlnGly: 1.158 ± 0.511
1.158GlnHis: 1.158 ± 0.904
2.895GlnIle: 2.895 ± 2.279
2.895GlnLys: 2.895 ± 0.681
4.053GlnLeu: 4.053 ± 1.122
2.316GlnMet: 2.316 ± 1.69
2.895GlnAsn: 2.895 ± 0.681
1.737GlnPro: 1.737 ± 1.17
3.474GlnGln: 3.474 ± 1.366
4.632GlnArg: 4.632 ± 2.519
4.053GlnSer: 4.053 ± 1.842
1.737GlnThr: 1.737 ± 0.665
2.895GlnVal: 2.895 ± 0.681
1.737GlnTrp: 1.737 ± 0.271
3.474GlnTyr: 3.474 ± 0.541
0.0GlnXaa: 0.0 ± 0.0
Arg
2.895ArgAla: 2.895 ± 0.354
1.737ArgCys: 1.737 ± 0.271
1.737ArgAsp: 1.737 ± 0.773
4.632ArgGlu: 4.632 ± 1.084
3.474ArgPhe: 3.474 ± 1.241
1.158ArgGly: 1.158 ± 0.601
0.579ArgHis: 0.579 ± 0.39
2.895ArgIle: 2.895 ± 0.354
4.632ArgLys: 4.632 ± 1.637
2.316ArgLeu: 2.316 ± 0.542
0.579ArgMet: 0.579 ± 0.603
2.316ArgAsn: 2.316 ± 1.197
2.895ArgPro: 2.895 ± 1.155
1.737ArgGln: 1.737 ± 0.904
0.579ArgArg: 0.579 ± 0.452
2.316ArgSer: 2.316 ± 0.984
0.0ArgThr: 0.0 ± 0.0
1.737ArgVal: 1.737 ± 0.773
0.579ArgTrp: 0.579 ± 0.39
4.053ArgTyr: 4.053 ± 0.578
0.0ArgXaa: 0.0 ± 0.0
Ser
6.369SerAla: 6.369 ± 4.303
0.579SerCys: 0.579 ± 0.452
3.474SerAsp: 3.474 ± 1.045
2.316SerGlu: 2.316 ± 1.69
3.474SerPhe: 3.474 ± 1.017
4.632SerGly: 4.632 ± 1.224
1.737SerHis: 1.737 ± 1.17
5.211SerIle: 5.211 ± 1.597
5.211SerLys: 5.211 ± 1.402
8.686SerLeu: 8.686 ± 2.477
2.316SerMet: 2.316 ± 0.955
3.474SerAsn: 3.474 ± 2.081
4.632SerPro: 4.632 ± 1.772
5.79SerGln: 5.79 ± 3.182
2.316SerArg: 2.316 ± 0.827
6.369SerSer: 6.369 ± 2.63
3.474SerThr: 3.474 ± 1.017
4.632SerVal: 4.632 ± 1.717
0.0SerTrp: 0.0 ± 0.0
5.79SerTyr: 5.79 ± 2.31
0.0SerXaa: 0.0 ± 0.0
Thr
3.474ThrAla: 3.474 ± 1.532
1.158ThrCys: 1.158 ± 0.414
6.948ThrAsp: 6.948 ± 1.145
2.895ThrGlu: 2.895 ± 1.269
1.737ThrPhe: 1.737 ± 0.665
1.158ThrGly: 1.158 ± 0.78
1.158ThrHis: 1.158 ± 0.78
4.053ThrIle: 4.053 ± 0.837
2.316ThrLys: 2.316 ± 1.308
3.474ThrLeu: 3.474 ± 1.096
0.0ThrMet: 0.0 ± 0.0
4.053ThrAsn: 4.053 ± 2.137
2.316ThrPro: 2.316 ± 1.56
2.316ThrGln: 2.316 ± 1.009
1.737ThrArg: 1.737 ± 0.271
6.369ThrSer: 6.369 ± 1.392
2.316ThrThr: 2.316 ± 1.56
0.579ThrVal: 0.579 ± 0.39
1.158ThrTrp: 1.158 ± 0.78
4.053ThrTyr: 4.053 ± 0.948
0.0ThrXaa: 0.0 ± 0.0
Val
3.474ValAla: 3.474 ± 0.728
0.579ValCys: 0.579 ± 0.452
2.895ValAsp: 2.895 ± 0.681
1.737ValGlu: 1.737 ± 0.271
1.737ValPhe: 1.737 ± 1.17
4.053ValGly: 4.053 ± 1.842
1.737ValHis: 1.737 ± 0.665
2.895ValIle: 2.895 ± 1.672
3.474ValLys: 3.474 ± 2.402
2.316ValLeu: 2.316 ± 1.505
0.579ValMet: 0.579 ± 0.62
3.474ValAsn: 3.474 ± 0.51
2.895ValPro: 2.895 ± 1.432
1.737ValGln: 1.737 ± 0.68
1.737ValArg: 1.737 ± 1.355
5.211ValSer: 5.211 ± 1.835
3.474ValThr: 3.474 ± 0.728
1.158ValVal: 1.158 ± 0.601
0.0ValTrp: 0.0 ± 0.0
1.737ValTyr: 1.737 ± 0.773
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.579TrpAsp: 0.579 ± 0.39
1.158TrpGlu: 1.158 ± 0.601
1.158TrpPhe: 1.158 ± 0.414
0.579TrpGly: 0.579 ± 0.39
0.579TrpHis: 0.579 ± 0.452
0.579TrpIle: 0.579 ± 0.603
1.737TrpLys: 1.737 ± 0.876
2.316TrpLeu: 2.316 ± 1.009
0.579TrpMet: 0.579 ± 0.39
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
1.737TrpArg: 1.737 ± 0.665
1.158TrpSer: 1.158 ± 0.78
2.895TrpThr: 2.895 ± 0.64
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
1.158TrpTyr: 1.158 ± 0.601
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.158TyrAla: 1.158 ± 0.511
1.158TyrCys: 1.158 ± 0.414
4.053TyrAsp: 4.053 ± 0.837
2.895TyrGlu: 2.895 ± 1.533
2.316TyrPhe: 2.316 ± 1.009
1.737TyrGly: 1.737 ± 0.665
2.316TyrHis: 2.316 ± 0.827
4.632TyrIle: 4.632 ± 1.826
4.632TyrLys: 4.632 ± 1.654
4.053TyrLeu: 4.053 ± 1.079
0.579TyrMet: 0.579 ± 0.452
4.632TyrAsn: 4.632 ± 1.654
0.579TyrPro: 0.579 ± 0.39
5.211TyrGln: 5.211 ± 0.474
1.158TyrArg: 1.158 ± 0.414
3.474TyrSer: 3.474 ± 0.944
2.316TyrThr: 2.316 ± 0.984
5.79TyrVal: 5.79 ± 3.192
1.158TyrTrp: 1.158 ± 0.414
4.053TyrTyr: 4.053 ± 0.72
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (1728 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski