Amino acid dipepetide frequency for Microviridae Fen2266_11

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.746AlaAla: 8.746 ± 4.587
2.915AlaCys: 2.915 ± 2.014
5.831AlaAsp: 5.831 ± 1.212
8.017AlaGlu: 8.017 ± 2.202
2.915AlaPhe: 2.915 ± 0.902
7.289AlaGly: 7.289 ± 2.606
0.0AlaHis: 0.0 ± 0.0
4.373AlaIle: 4.373 ± 1.49
5.102AlaLys: 5.102 ± 4.306
8.746AlaLeu: 8.746 ± 0.811
5.102AlaMet: 5.102 ± 2.223
8.746AlaAsn: 8.746 ± 2.9
2.187AlaPro: 2.187 ± 1.105
8.746AlaGln: 8.746 ± 4.298
2.915AlaArg: 2.915 ± 1.51
5.102AlaSer: 5.102 ± 1.321
4.373AlaThr: 4.373 ± 0.832
6.56AlaVal: 6.56 ± 1.452
0.729AlaTrp: 0.729 ± 0.747
2.915AlaTyr: 2.915 ± 1.132
0.0AlaXaa: 0.0 ± 0.0
Cys
2.187CysAla: 2.187 ± 0.868
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
1.458CysGly: 1.458 ± 1.495
0.0CysHis: 0.0 ± 0.0
0.729CysIle: 0.729 ± 0.502
0.729CysLys: 0.729 ± 0.747
2.187CysLeu: 2.187 ± 1.293
1.458CysMet: 1.458 ± 1.277
0.0CysAsn: 0.0 ± 0.0
0.729CysPro: 0.729 ± 0.747
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
0.729CysVal: 0.729 ± 0.502
0.729CysTrp: 0.729 ± 0.502
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.644AspAla: 3.644 ± 1.257
0.0AspCys: 0.0 ± 0.0
4.373AspAsp: 4.373 ± 1.905
2.187AspGlu: 2.187 ± 1.001
5.102AspPhe: 5.102 ± 2.555
3.644AspGly: 3.644 ± 1.654
4.373AspHis: 4.373 ± 0.925
1.458AspIle: 1.458 ± 1.233
3.644AspLys: 3.644 ± 1.79
5.831AspLeu: 5.831 ± 1.098
2.187AspMet: 2.187 ± 2.235
4.373AspAsn: 4.373 ± 0.925
2.915AspPro: 2.915 ± 1.173
2.187AspGln: 2.187 ± 1.531
5.102AspArg: 5.102 ± 2.358
2.187AspSer: 2.187 ± 0.516
4.373AspThr: 4.373 ± 1.789
3.644AspVal: 3.644 ± 1.389
0.729AspTrp: 0.729 ± 0.502
3.644AspTyr: 3.644 ± 1.427
0.0AspXaa: 0.0 ± 0.0
Glu
8.017GluAla: 8.017 ± 2.918
0.729GluCys: 0.729 ± 0.502
4.373GluAsp: 4.373 ± 1.388
2.187GluGlu: 2.187 ± 0.868
1.458GluPhe: 1.458 ± 0.635
2.187GluGly: 2.187 ± 0.868
0.729GluHis: 0.729 ± 0.502
2.915GluIle: 2.915 ± 2.098
0.0GluLys: 0.0 ± 0.0
3.644GluLeu: 3.644 ± 2.502
0.729GluMet: 0.729 ± 0.502
0.0GluAsn: 0.0 ± 0.0
1.458GluPro: 1.458 ± 0.922
2.915GluGln: 2.915 ± 1.327
2.915GluArg: 2.915 ± 1.132
0.729GluSer: 0.729 ± 0.502
0.729GluThr: 0.729 ± 0.98
3.644GluVal: 3.644 ± 1.413
0.729GluTrp: 0.729 ± 0.502
2.187GluTyr: 2.187 ± 0.516
0.0GluXaa: 0.0 ± 0.0
Phe
5.831PheAla: 5.831 ± 1.636
0.729PheCys: 0.729 ± 0.747
2.915PheAsp: 2.915 ± 0.993
0.729PheGlu: 0.729 ± 1.134
2.915PhePhe: 2.915 ± 1.27
1.458PheGly: 1.458 ± 0.635
1.458PheHis: 1.458 ± 1.005
2.187PheIle: 2.187 ± 0.895
2.915PheLys: 2.915 ± 1.66
0.729PheLeu: 0.729 ± 0.502
0.729PheMet: 0.729 ± 0.98
0.729PheAsn: 0.729 ± 0.502
2.915PhePro: 2.915 ± 1.267
4.373PheGln: 4.373 ± 1.777
1.458PheArg: 1.458 ± 0.635
0.729PheSer: 0.729 ± 0.502
2.187PheThr: 2.187 ± 0.944
5.102PheVal: 5.102 ± 1.698
0.0PheTrp: 0.0 ± 0.0
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
2.915GlyAla: 2.915 ± 0.561
0.0GlyCys: 0.0 ± 0.0
5.102GlyAsp: 5.102 ± 1.868
2.187GlyGlu: 2.187 ± 1.097
2.187GlyPhe: 2.187 ± 0.516
4.373GlyGly: 4.373 ± 1.339
0.729GlyHis: 0.729 ± 0.502
2.187GlyIle: 2.187 ± 0.895
2.915GlyLys: 2.915 ± 2.465
4.373GlyLeu: 4.373 ± 0.956
1.458GlyMet: 1.458 ± 0.849
5.102GlyAsn: 5.102 ± 2.223
5.102GlyPro: 5.102 ± 1.76
1.458GlyGln: 1.458 ± 0.635
0.729GlyArg: 0.729 ± 0.747
5.831GlySer: 5.831 ± 1.833
3.644GlyThr: 3.644 ± 1.796
5.102GlyVal: 5.102 ± 1.56
0.729GlyTrp: 0.729 ± 0.502
5.102GlyTyr: 5.102 ± 1.32
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.0HisCys: 0.0 ± 0.0
1.458HisAsp: 1.458 ± 0.587
0.0HisGlu: 0.0 ± 0.0
1.458HisPhe: 1.458 ± 1.141
1.458HisGly: 1.458 ± 1.005
0.0HisHis: 0.0 ± 0.0
1.458HisIle: 1.458 ± 1.005
1.458HisLys: 1.458 ± 0.635
2.915HisLeu: 2.915 ± 1.503
0.729HisMet: 0.729 ± 1.155
0.729HisAsn: 0.729 ± 0.502
1.458HisPro: 1.458 ± 0.849
0.0HisGln: 0.0 ± 0.0
0.729HisArg: 0.729 ± 0.502
0.729HisSer: 0.729 ± 0.747
0.0HisThr: 0.0 ± 0.0
1.458HisVal: 1.458 ± 1.005
2.187HisTrp: 2.187 ± 1.507
2.187HisTyr: 2.187 ± 0.868
0.0HisXaa: 0.0 ± 0.0
Ile
4.373IleAla: 4.373 ± 0.832
0.729IleCys: 0.729 ± 0.502
2.915IleAsp: 2.915 ± 2.139
2.187IleGlu: 2.187 ± 1.507
0.729IlePhe: 0.729 ± 0.502
4.373IleGly: 4.373 ± 1.031
0.0IleHis: 0.0 ± 0.0
1.458IleIle: 1.458 ± 0.635
1.458IleLys: 1.458 ± 0.635
8.017IleLeu: 8.017 ± 1.632
2.187IleMet: 2.187 ± 1.04
3.644IleAsn: 3.644 ± 0.813
2.187IlePro: 2.187 ± 0.895
2.187IleGln: 2.187 ± 1.879
1.458IleArg: 1.458 ± 1.186
5.831IleSer: 5.831 ± 1.713
3.644IleThr: 3.644 ± 1.362
2.915IleVal: 2.915 ± 1.311
0.0IleTrp: 0.0 ± 0.0
2.915IleTyr: 2.915 ± 1.142
0.0IleXaa: 0.0 ± 0.0
Lys
5.102LysAla: 5.102 ± 2.094
1.458LysCys: 1.458 ± 0.635
1.458LysAsp: 1.458 ± 0.635
1.458LysGlu: 1.458 ± 0.635
3.644LysPhe: 3.644 ± 2.078
2.187LysGly: 2.187 ± 1.001
0.729LysHis: 0.729 ± 0.98
5.102LysIle: 5.102 ± 1.567
2.915LysLys: 2.915 ± 1.66
5.102LysLeu: 5.102 ± 0.915
2.915LysMet: 2.915 ± 1.346
2.187LysAsn: 2.187 ± 1.234
1.458LysPro: 1.458 ± 0.922
1.458LysGln: 1.458 ± 2.268
3.644LysArg: 3.644 ± 2.667
1.458LysSer: 1.458 ± 0.849
2.915LysThr: 2.915 ± 2.113
1.458LysVal: 1.458 ± 0.587
0.0LysTrp: 0.0 ± 0.0
1.458LysTyr: 1.458 ± 0.849
0.0LysXaa: 0.0 ± 0.0
Leu
4.373LeuAla: 4.373 ± 2.001
0.729LeuCys: 0.729 ± 0.747
5.102LeuAsp: 5.102 ± 2.422
1.458LeuGlu: 1.458 ± 1.005
1.458LeuPhe: 1.458 ± 1.495
7.289LeuGly: 7.289 ± 2.152
0.0LeuHis: 0.0 ± 0.0
5.102LeuIle: 5.102 ± 1.229
5.102LeuLys: 5.102 ± 1.657
7.289LeuLeu: 7.289 ± 3.352
2.915LeuMet: 2.915 ± 2.014
5.831LeuAsn: 5.831 ± 1.081
7.289LeuPro: 7.289 ± 2.406
4.373LeuGln: 4.373 ± 0.956
1.458LeuArg: 1.458 ± 0.922
8.746LeuSer: 8.746 ± 1.729
7.289LeuThr: 7.289 ± 1.562
5.102LeuVal: 5.102 ± 1.8
1.458LeuTrp: 1.458 ± 0.635
0.0LeuTyr: 0.0 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
3.644MetAla: 3.644 ± 1.312
1.458MetCys: 1.458 ± 1.125
2.187MetAsp: 2.187 ± 1.292
2.187MetGlu: 2.187 ± 0.895
2.187MetPhe: 2.187 ± 1.293
0.729MetGly: 0.729 ± 0.747
2.187MetHis: 2.187 ± 1.507
1.458MetIle: 1.458 ± 0.922
0.0MetLys: 0.0 ± 0.0
2.187MetLeu: 2.187 ± 1.472
2.187MetMet: 2.187 ± 1.292
2.187MetAsn: 2.187 ± 0.516
1.458MetPro: 1.458 ± 0.587
3.644MetGln: 3.644 ± 1.812
2.187MetArg: 2.187 ± 0.516
2.187MetSer: 2.187 ± 2.296
2.187MetThr: 2.187 ± 1.234
0.0MetVal: 0.0 ± 0.0
0.729MetTrp: 0.729 ± 0.626
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
8.017AsnAla: 8.017 ± 2.994
0.0AsnCys: 0.0 ± 0.0
2.187AsnAsp: 2.187 ± 1.672
1.458AsnGlu: 1.458 ± 0.587
1.458AsnPhe: 1.458 ± 0.849
2.915AsnGly: 2.915 ± 1.27
0.0AsnHis: 0.0 ± 0.0
0.729AsnIle: 0.729 ± 1.134
3.644AsnLys: 3.644 ± 1.217
5.831AsnLeu: 5.831 ± 1.484
2.915AsnMet: 2.915 ± 1.709
5.831AsnAsn: 5.831 ± 2.647
1.458AsnPro: 1.458 ± 1.252
2.915AsnGln: 2.915 ± 1.311
2.187AsnArg: 2.187 ± 0.895
2.915AsnSer: 2.915 ± 0.955
4.373AsnThr: 4.373 ± 1.76
2.915AsnVal: 2.915 ± 1.173
1.458AsnTrp: 1.458 ± 0.635
3.644AsnTyr: 3.644 ± 1.435
0.0AsnXaa: 0.0 ± 0.0
Pro
7.289ProAla: 7.289 ± 3.314
0.729ProCys: 0.729 ± 0.747
2.915ProAsp: 2.915 ± 2.311
2.187ProGlu: 2.187 ± 1.001
1.458ProPhe: 1.458 ± 0.635
2.915ProGly: 2.915 ± 1.142
2.915ProHis: 2.915 ± 0.955
5.102ProIle: 5.102 ± 1.293
2.915ProLys: 2.915 ± 0.902
0.729ProLeu: 0.729 ± 0.747
1.458ProMet: 1.458 ± 0.587
2.915ProAsn: 2.915 ± 0.561
1.458ProPro: 1.458 ± 1.005
2.915ProGln: 2.915 ± 0.993
2.187ProArg: 2.187 ± 0.868
4.373ProSer: 4.373 ± 1.345
1.458ProThr: 1.458 ± 0.587
2.915ProVal: 2.915 ± 1.267
1.458ProTrp: 1.458 ± 1.005
1.458ProTyr: 1.458 ± 0.587
0.0ProXaa: 0.0 ± 0.0
Gln
8.017GlnAla: 8.017 ± 4.037
0.0GlnCys: 0.0 ± 0.0
3.644GlnAsp: 3.644 ± 1.401
2.915GlnGlu: 2.915 ± 1.483
1.458GlnPhe: 1.458 ± 1.125
2.915GlnGly: 2.915 ± 2.009
0.729GlnHis: 0.729 ± 0.502
1.458GlnIle: 1.458 ± 0.849
2.187GlnLys: 2.187 ± 1.324
3.644GlnLeu: 3.644 ± 2.431
0.0GlnMet: 0.0 ± 0.0
3.644GlnAsn: 3.644 ± 2.034
3.644GlnPro: 3.644 ± 1.259
2.915GlnGln: 2.915 ± 2.505
4.373GlnArg: 4.373 ± 1.045
2.915GlnSer: 2.915 ± 2.083
2.915GlnThr: 2.915 ± 2.505
2.915GlnVal: 2.915 ± 1.173
0.0GlnTrp: 0.0 ± 0.0
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
3.644ArgAla: 3.644 ± 1.23
0.729ArgCys: 0.729 ± 0.747
2.915ArgAsp: 2.915 ± 2.014
0.729ArgGlu: 0.729 ± 1.134
0.729ArgPhe: 0.729 ± 0.502
0.729ArgGly: 0.729 ± 0.502
1.458ArgHis: 1.458 ± 1.233
4.373ArgIle: 4.373 ± 1.642
3.644ArgLys: 3.644 ± 2.476
5.102ArgLeu: 5.102 ± 1.868
1.458ArgMet: 1.458 ± 1.005
2.187ArgAsn: 2.187 ± 0.516
3.644ArgPro: 3.644 ± 1.895
0.729ArgGln: 0.729 ± 0.626
1.458ArgArg: 1.458 ± 1.277
0.729ArgSer: 0.729 ± 0.502
2.187ArgThr: 2.187 ± 0.868
2.915ArgVal: 2.915 ± 2.139
0.0ArgTrp: 0.0 ± 0.0
5.102ArgTyr: 5.102 ± 1.589
0.0ArgXaa: 0.0 ± 0.0
Ser
10.204SerAla: 10.204 ± 3.763
0.0SerCys: 0.0 ± 0.0
4.373SerAsp: 4.373 ± 1.388
4.373SerGlu: 4.373 ± 1.735
1.458SerPhe: 1.458 ± 1.252
2.915SerGly: 2.915 ± 2.009
1.458SerHis: 1.458 ± 1.005
2.187SerIle: 2.187 ± 1.105
2.915SerLys: 2.915 ± 1.142
4.373SerLeu: 4.373 ± 2.003
2.187SerMet: 2.187 ± 1.105
2.187SerAsn: 2.187 ± 1.105
5.102SerPro: 5.102 ± 0.829
1.458SerGln: 1.458 ± 1.141
3.644SerArg: 3.644 ± 1.974
8.017SerSer: 8.017 ± 3.159
5.102SerThr: 5.102 ± 1.293
4.373SerVal: 4.373 ± 0.806
0.729SerTrp: 0.729 ± 0.747
0.729SerTyr: 0.729 ± 0.747
0.0SerXaa: 0.0 ± 0.0
Thr
5.102ThrAla: 5.102 ± 1.872
0.0ThrCys: 0.0 ± 0.0
5.102ThrAsp: 5.102 ± 2.207
2.915ThrGlu: 2.915 ± 1.845
2.915ThrPhe: 2.915 ± 1.267
4.373ThrGly: 4.373 ± 0.806
0.729ThrHis: 0.729 ± 0.502
2.915ThrIle: 2.915 ± 1.173
2.187ThrLys: 2.187 ± 0.516
2.915ThrLeu: 2.915 ± 1.27
0.729ThrMet: 0.729 ± 0.747
2.915ThrAsn: 2.915 ± 1.844
2.915ThrPro: 2.915 ± 0.995
1.458ThrGln: 1.458 ± 0.587
2.187ThrArg: 2.187 ± 1.507
5.831ThrSer: 5.831 ± 2.346
6.56ThrThr: 6.56 ± 2.146
5.102ThrVal: 5.102 ± 1.956
0.729ThrTrp: 0.729 ± 0.626
1.458ThrTyr: 1.458 ± 1.495
0.0ThrXaa: 0.0 ± 0.0
Val
5.831ValAla: 5.831 ± 1.935
0.729ValCys: 0.729 ± 0.502
5.831ValAsp: 5.831 ± 2.304
2.915ValGlu: 2.915 ± 1.128
2.187ValPhe: 2.187 ± 0.868
4.373ValGly: 4.373 ± 0.832
0.0ValHis: 0.0 ± 0.0
3.644ValIle: 3.644 ± 0.708
3.644ValLys: 3.644 ± 1.827
5.102ValLeu: 5.102 ± 1.76
2.187ValMet: 2.187 ± 0.868
2.187ValAsn: 2.187 ± 1.097
3.644ValPro: 3.644 ± 1.389
4.373ValGln: 4.373 ± 1.33
2.915ValArg: 2.915 ± 0.561
6.56ValSer: 6.56 ± 1.401
2.187ValThr: 2.187 ± 1.116
1.458ValVal: 1.458 ± 1.277
0.0ValTrp: 0.0 ± 0.0
0.729ValTyr: 0.729 ± 0.502
0.0ValXaa: 0.0 ± 0.0
Trp
1.458TrpAla: 1.458 ± 1.005
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.729TrpGlu: 0.729 ± 0.502
1.458TrpPhe: 1.458 ± 1.005
1.458TrpGly: 1.458 ± 1.005
1.458TrpHis: 1.458 ± 0.635
1.458TrpIle: 1.458 ± 0.849
0.0TrpLys: 0.0 ± 0.0
0.0TrpLeu: 0.0 ± 0.0
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.729TrpPro: 0.729 ± 0.502
0.729TrpGln: 0.729 ± 0.626
1.458TrpArg: 1.458 ± 0.635
0.0TrpSer: 0.0 ± 0.0
0.729TrpThr: 0.729 ± 0.502
0.729TrpVal: 0.729 ± 0.747
0.0TrpTrp: 0.0 ± 0.0
0.729TrpTyr: 0.729 ± 0.502
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.915TyrAla: 2.915 ± 1.267
0.0TyrCys: 0.0 ± 0.0
2.915TyrAsp: 2.915 ± 1.66
1.458TyrGlu: 1.458 ± 0.587
2.915TyrPhe: 2.915 ± 1.267
2.187TyrGly: 2.187 ± 1.292
2.187TyrHis: 2.187 ± 2.242
2.915TyrIle: 2.915 ± 1.267
0.729TyrLys: 0.729 ± 0.502
3.644TyrLeu: 3.644 ± 1.722
0.729TyrMet: 0.729 ± 0.747
1.458TyrAsn: 1.458 ± 1.252
0.0TyrPro: 0.0 ± 0.0
1.458TyrGln: 1.458 ± 0.635
1.458TyrArg: 1.458 ± 0.635
2.915TyrSer: 2.915 ± 0.892
2.187TyrThr: 2.187 ± 0.868
1.458TyrVal: 1.458 ± 0.849
0.729TyrTrp: 0.729 ± 0.502
1.458TyrTyr: 1.458 ± 0.635
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (1373 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski