Amino acid dipepetide frequency for Mirim virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.549AlaAla: 2.549 ± 3.063
3.059AlaCys: 3.059 ± 1.769
0.765AlaAsp: 0.765 ± 0.355
2.294AlaGlu: 2.294 ± 1.374
2.294AlaPhe: 2.294 ± 0.514
1.784AlaGly: 1.784 ± 1.448
1.784AlaHis: 1.784 ± 0.387
4.588AlaIle: 4.588 ± 0.16
3.824AlaLys: 3.824 ± 1.956
6.118AlaLeu: 6.118 ± 0.395
0.765AlaMet: 0.765 ± 0.172
2.804AlaAsn: 2.804 ± 0.596
1.02AlaPro: 1.02 ± 0.956
1.275AlaGln: 1.275 ± 0.791
2.549AlaArg: 2.549 ± 2.169
4.079AlaSer: 4.079 ± 0.822
3.059AlaThr: 3.059 ± 1.634
2.294AlaVal: 2.294 ± 0.823
0.255AlaTrp: 0.255 ± 0.158
1.784AlaTyr: 1.784 ± 0.569
0.0AlaXaa: 0.0 ± 0.0
Cys
1.02CysAla: 1.02 ± 0.302
0.255CysCys: 0.255 ± 0.158
1.275CysAsp: 1.275 ± 0.827
1.02CysGlu: 1.02 ± 0.956
2.294CysPhe: 2.294 ± 1.416
3.059CysGly: 3.059 ± 1.769
0.51CysHis: 0.51 ± 0.136
2.549CysIle: 2.549 ± 0.678
2.294CysLys: 2.294 ± 1.416
1.275CysLeu: 1.275 ± 0.481
0.51CysMet: 0.51 ± 0.136
2.294CysAsn: 2.294 ± 0.517
1.02CysPro: 1.02 ± 0.271
1.529CysGln: 1.529 ± 0.71
1.275CysArg: 1.275 ± 1.195
1.529CysSer: 1.529 ± 0.407
1.784CysThr: 1.784 ± 0.611
1.784CysVal: 1.784 ± 1.303
0.0CysTrp: 0.0 ± 0.0
1.02CysTyr: 1.02 ± 0.59
0.0CysXaa: 0.0 ± 0.0
Asp
2.294AspAla: 2.294 ± 0.525
0.51AspCys: 0.51 ± 0.136
2.549AspAsp: 2.549 ± 2.117
4.588AspGlu: 4.588 ± 1.028
3.314AspPhe: 3.314 ± 1.361
1.529AspGly: 1.529 ± 0.67
0.765AspHis: 0.765 ± 0.355
7.647AspIle: 7.647 ± 1.427
3.314AspLys: 3.314 ± 0.692
6.882AspLeu: 6.882 ± 0.744
2.294AspMet: 2.294 ± 0.75
3.569AspAsn: 3.569 ± 1.18
2.039AspPro: 2.039 ± 0.604
1.02AspGln: 1.02 ± 0.271
1.529AspArg: 1.529 ± 0.603
2.804AspSer: 2.804 ± 0.299
1.529AspThr: 1.529 ± 0.407
2.294AspVal: 2.294 ± 0.75
0.51AspTrp: 0.51 ± 0.316
2.804AspTyr: 2.804 ± 0.596
0.0AspXaa: 0.0 ± 0.0
Glu
3.314GluAla: 3.314 ± 0.539
1.784GluCys: 1.784 ± 1.303
3.314GluAsp: 3.314 ± 1.361
3.824GluGlu: 3.824 ± 0.195
6.118GluPhe: 6.118 ± 1.604
1.529GluGly: 1.529 ± 0.407
2.294GluHis: 2.294 ± 1.065
6.628GluIle: 6.628 ± 0.978
4.079GluLys: 4.079 ± 0.93
7.647GluLeu: 7.647 ± 2.132
3.314GluMet: 3.314 ± 0.669
3.314GluAsn: 3.314 ± 1.315
1.275GluPro: 1.275 ± 0.791
2.294GluGln: 2.294 ± 0.588
1.784GluArg: 1.784 ± 0.758
4.333GluSer: 4.333 ± 0.474
2.804GluThr: 2.804 ± 0.877
4.079GluVal: 4.079 ± 1.585
0.255GluTrp: 0.255 ± 0.158
2.294GluTyr: 2.294 ± 0.525
0.0GluXaa: 0.0 ± 0.0
Phe
1.275PheAla: 1.275 ± 0.267
2.549PheCys: 2.549 ± 0.9
3.059PheAsp: 3.059 ± 1.063
2.804PheGlu: 2.804 ± 0.454
3.569PhePhe: 3.569 ± 0.431
2.549PheGly: 2.549 ± 1.277
0.765PheHis: 0.765 ± 0.355
3.824PheIle: 3.824 ± 1.066
5.608PheLys: 5.608 ± 0.298
6.118PheLeu: 6.118 ± 2.253
1.529PheMet: 1.529 ± 0.345
3.314PheAsn: 3.314 ± 0.804
0.765PhePro: 0.765 ± 0.781
1.275PheGln: 1.275 ± 0.45
2.804PheArg: 2.804 ± 1.518
5.353PheSer: 5.353 ± 1.207
2.804PheThr: 2.804 ± 0.765
2.549PheVal: 2.549 ± 0.674
0.51PheTrp: 0.51 ± 0.316
2.549PheTyr: 2.549 ± 0.397
0.0PheXaa: 0.0 ± 0.0
Gly
1.529GlyAla: 1.529 ± 0.407
1.784GlyCys: 1.784 ± 0.943
3.059GlyAsp: 3.059 ± 1.339
5.098GlyGlu: 5.098 ± 0.634
1.02GlyPhe: 1.02 ± 0.271
0.51GlyGly: 0.51 ± 0.316
1.02GlyHis: 1.02 ± 0.302
4.588GlyIle: 4.588 ± 1.745
2.039GlyLys: 2.039 ± 0.468
4.079GlyLeu: 4.079 ± 0.44
0.255GlyMet: 0.255 ± 0.239
2.804GlyAsn: 2.804 ± 1.052
1.784GlyPro: 1.784 ± 0.943
1.529GlyGln: 1.529 ± 0.345
1.529GlyArg: 1.529 ± 0.407
4.079GlySer: 4.079 ± 1.332
1.529GlyThr: 1.529 ± 1.294
1.529GlyVal: 1.529 ± 0.71
0.51GlyTrp: 0.51 ± 0.316
2.039GlyTyr: 2.039 ± 1.68
0.0GlyXaa: 0.0 ± 0.0
His
1.529HisAla: 1.529 ± 1.065
0.765HisCys: 0.765 ± 0.355
0.765HisAsp: 0.765 ± 0.355
1.275HisGlu: 1.275 ± 0.481
1.529HisPhe: 1.529 ± 0.603
1.529HisGly: 1.529 ± 0.603
1.02HisHis: 1.02 ± 0.741
0.765HisIle: 0.765 ± 0.717
2.294HisLys: 2.294 ± 0.75
1.529HisLeu: 1.529 ± 0.933
0.765HisMet: 0.765 ± 0.172
2.039HisAsn: 2.039 ± 0.915
0.51HisPro: 0.51 ± 0.81
0.51HisGln: 0.51 ± 0.316
0.51HisArg: 0.51 ± 0.81
1.784HisSer: 1.784 ± 0.569
1.275HisThr: 1.275 ± 0.481
0.765HisVal: 0.765 ± 0.172
0.51HisTrp: 0.51 ± 0.316
1.275HisTyr: 1.275 ± 0.481
0.0HisXaa: 0.0 ± 0.0
Ile
3.824IleAla: 3.824 ± 1.049
0.765IleCys: 0.765 ± 0.717
3.059IleAsp: 3.059 ± 1.206
7.392IleGlu: 7.392 ± 0.285
5.098IlePhe: 5.098 ± 0.794
4.333IleGly: 4.333 ± 0.672
1.784IleHis: 1.784 ± 0.526
6.882IleIle: 6.882 ± 1.451
8.157IleLys: 8.157 ± 1.414
8.922IleLeu: 8.922 ± 2.12
1.275IleMet: 1.275 ± 0.299
3.824IleAsn: 3.824 ± 0.223
4.333IlePro: 4.333 ± 1.189
2.804IleGln: 2.804 ± 0.596
4.079IleArg: 4.079 ± 1.089
7.647IleSer: 7.647 ± 0.724
5.353IleThr: 5.353 ± 0.247
3.569IleVal: 3.569 ± 1.137
0.765IleTrp: 0.765 ± 0.172
2.039IleTyr: 2.039 ± 0.535
0.0IleXaa: 0.0 ± 0.0
Lys
4.588LysAla: 4.588 ± 1.289
2.549LysCys: 2.549 ± 1.654
5.353LysAsp: 5.353 ± 0.841
7.137LysGlu: 7.137 ± 0.634
3.059LysPhe: 3.059 ± 0.908
4.588LysGly: 4.588 ± 1.028
1.529LysHis: 1.529 ± 0.67
5.863LysIle: 5.863 ± 0.623
6.882LysLys: 6.882 ± 0.988
8.667LysLeu: 8.667 ± 0.935
4.079LysMet: 4.079 ± 1.916
3.824LysAsn: 3.824 ± 0.564
2.549LysPro: 2.549 ± 0.4
3.569LysGln: 3.569 ± 0.931
1.784LysArg: 1.784 ± 0.758
6.118LysSer: 6.118 ± 0.557
6.118LysThr: 6.118 ± 1.3
3.824LysVal: 3.824 ± 0.594
0.765LysTrp: 0.765 ± 0.172
2.804LysTyr: 2.804 ± 0.765
0.0LysXaa: 0.0 ± 0.0
Leu
5.353LeuAla: 5.353 ± 1.868
2.039LeuCys: 2.039 ± 0.835
6.118LeuAsp: 6.118 ± 2.099
8.157LeuGlu: 8.157 ± 1.765
4.843LeuPhe: 4.843 ± 0.782
3.059LeuGly: 3.059 ± 0.268
2.039LeuHis: 2.039 ± 0.535
6.882LeuIle: 6.882 ± 1.451
8.922LeuLys: 8.922 ± 1.241
8.667LeuLeu: 8.667 ± 1.927
1.529LeuMet: 1.529 ± 0.67
5.608LeuAsn: 5.608 ± 1.252
4.333LeuPro: 4.333 ± 0.862
2.294LeuGln: 2.294 ± 0.406
3.059LeuArg: 3.059 ± 2.91
8.412LeuSer: 8.412 ± 0.348
6.882LeuThr: 6.882 ± 0.705
4.843LeuVal: 4.843 ± 0.782
0.255LeuTrp: 0.255 ± 0.158
2.039LeuTyr: 2.039 ± 0.604
0.0LeuXaa: 0.0 ± 0.0
Met
1.529MetAla: 1.529 ± 0.67
1.275MetCys: 1.275 ± 0.481
1.529MetAsp: 1.529 ± 0.345
2.039MetGlu: 2.039 ± 0.542
1.02MetPhe: 1.02 ± 0.805
0.765MetGly: 0.765 ± 0.355
0.255MetHis: 0.255 ± 0.158
3.314MetIle: 3.314 ± 1.256
2.804MetLys: 2.804 ± 2.062
3.314MetLeu: 3.314 ± 0.322
1.02MetMet: 1.02 ± 0.302
1.275MetAsn: 1.275 ± 0.267
0.51MetPro: 0.51 ± 0.136
0.765MetGln: 0.765 ± 0.172
1.529MetArg: 1.529 ± 0.603
2.039MetSer: 2.039 ± 0.915
2.294MetThr: 2.294 ± 0.525
1.529MetVal: 1.529 ± 0.407
0.0MetTrp: 0.0 ± 0.0
0.255MetTyr: 0.255 ± 0.158
0.0MetXaa: 0.0 ± 0.0
Asn
2.039AsnAla: 2.039 ± 0.803
1.784AsnCys: 1.784 ± 0.611
2.804AsnAsp: 2.804 ± 1.052
3.569AsnGlu: 3.569 ± 1.204
3.569AsnPhe: 3.569 ± 0.802
1.784AsnGly: 1.784 ± 1.303
1.784AsnHis: 1.784 ± 0.722
3.824AsnIle: 3.824 ± 0.8
5.863AsnLys: 5.863 ± 1.966
3.824AsnLeu: 3.824 ± 0.223
2.549AsnMet: 2.549 ± 0.855
5.098AsnAsn: 5.098 ± 0.24
2.549AsnPro: 2.549 ± 0.4
2.294AsnGln: 2.294 ± 1.423
2.294AsnArg: 2.294 ± 0.75
2.804AsnSer: 2.804 ± 1.189
2.804AsnThr: 2.804 ± 1.146
2.039AsnVal: 2.039 ± 0.428
1.275AsnTrp: 1.275 ± 0.481
1.784AsnTyr: 1.784 ± 0.465
0.0AsnXaa: 0.0 ± 0.0
Pro
2.039ProAla: 2.039 ± 0.803
0.0ProCys: 0.0 ± 0.0
2.039ProAsp: 2.039 ± 1.433
2.039ProGlu: 2.039 ± 1.362
2.039ProPhe: 2.039 ± 0.542
2.039ProGly: 2.039 ± 0.468
1.02ProHis: 1.02 ± 0.271
3.569ProIle: 3.569 ± 1.127
2.039ProLys: 2.039 ± 0.835
2.039ProLeu: 2.039 ± 0.666
0.255ProMet: 0.255 ± 0.239
1.529ProAsn: 1.529 ± 0.345
0.51ProPro: 0.51 ± 0.316
0.51ProGln: 0.51 ± 0.79
0.51ProArg: 0.51 ± 0.136
2.804ProSer: 2.804 ± 1.052
3.569ProThr: 3.569 ± 0.931
1.529ProVal: 1.529 ± 0.345
0.765ProTrp: 0.765 ± 0.781
1.02ProTyr: 1.02 ± 0.302
0.0ProXaa: 0.0 ± 0.0
Gln
1.784GlnAla: 1.784 ± 0.611
0.51GlnCys: 0.51 ± 0.316
3.824GlnAsp: 3.824 ± 1.324
1.275GlnGlu: 1.275 ± 0.45
1.275GlnPhe: 1.275 ± 0.45
0.765GlnGly: 0.765 ± 0.355
1.02GlnHis: 1.02 ± 0.59
3.314GlnIle: 3.314 ± 1.384
2.549GlnLys: 2.549 ± 0.9
2.039GlnLeu: 2.039 ± 0.604
0.765GlnMet: 0.765 ± 0.781
1.529GlnAsn: 1.529 ± 0.71
0.765GlnPro: 0.765 ± 0.86
1.275GlnGln: 1.275 ± 1.563
1.784GlnArg: 1.784 ± 0.758
2.549GlnSer: 2.549 ± 1.02
2.549GlnThr: 2.549 ± 0.534
1.275GlnVal: 1.275 ± 0.267
0.255GlnTrp: 0.255 ± 0.829
1.02GlnTyr: 1.02 ± 0.672
0.0GlnXaa: 0.0 ± 0.0
Arg
1.784ArgAla: 1.784 ± 1.392
1.529ArgCys: 1.529 ± 0.407
2.039ArgAsp: 2.039 ± 0.915
2.294ArgGlu: 2.294 ± 0.75
2.039ArgPhe: 2.039 ± 0.428
0.255ArgGly: 0.255 ± 0.239
1.275ArgHis: 1.275 ± 0.791
3.824ArgIle: 3.824 ± 1.248
3.569ArgLys: 3.569 ± 1.223
4.079ArgLeu: 4.079 ± 1.47
1.529ArgMet: 1.529 ± 0.345
2.039ArgAsn: 2.039 ± 0.915
0.51ArgPro: 0.51 ± 0.136
1.02ArgGln: 1.02 ± 0.805
1.02ArgArg: 1.02 ± 0.59
2.294ArgSer: 2.294 ± 0.904
0.765ArgThr: 0.765 ± 0.727
2.549ArgVal: 2.549 ± 1.277
0.0ArgTrp: 0.0 ± 0.0
1.529ArgTyr: 1.529 ± 0.67
0.0ArgXaa: 0.0 ± 0.0
Ser
3.824SerAla: 3.824 ± 0.594
1.784SerCys: 1.784 ± 0.611
5.608SerAsp: 5.608 ± 1.317
4.079SerGlu: 4.079 ± 0.707
4.588SerPhe: 4.588 ± 1.327
2.294SerGly: 2.294 ± 0.406
1.529SerHis: 1.529 ± 0.345
7.137SerIle: 7.137 ± 0.514
8.412SerLys: 8.412 ± 0.798
8.412SerLeu: 8.412 ± 1.896
2.294SerMet: 2.294 ± 1.374
2.804SerAsn: 2.804 ± 0.877
2.039SerPro: 2.039 ± 1.265
1.529SerGln: 1.529 ± 2.37
3.569SerArg: 3.569 ± 1.199
4.333SerSer: 4.333 ± 1.237
5.608SerThr: 5.608 ± 1.29
3.824SerVal: 3.824 ± 0.943
0.255SerTrp: 0.255 ± 0.158
2.549SerTyr: 2.549 ± 1.298
0.0SerXaa: 0.0 ± 0.0
Thr
3.569ThrAla: 3.569 ± 0.75
2.549ThrCys: 2.549 ± 2.019
3.824ThrAsp: 3.824 ± 0.974
3.314ThrGlu: 3.314 ± 0.804
3.059ThrPhe: 3.059 ± 0.905
2.804ThrGly: 2.804 ± 0.855
0.765ThrHis: 0.765 ± 0.355
5.608ThrIle: 5.608 ± 1.192
5.863ThrLys: 5.863 ± 1.437
3.824ThrLeu: 3.824 ± 0.966
0.765ThrMet: 0.765 ± 0.172
3.059ThrAsn: 3.059 ± 2.25
2.294ThrPro: 2.294 ± 0.517
2.039ThrGln: 2.039 ± 0.604
2.294ThrArg: 2.294 ± 0.588
5.098ThrSer: 5.098 ± 1.267
4.079ThrThr: 4.079 ± 2.34
2.294ThrVal: 2.294 ± 0.514
1.529ThrTrp: 1.529 ± 0.67
3.314ThrTyr: 3.314 ± 0.692
0.0ThrXaa: 0.0 ± 0.0
Val
2.804ValAla: 2.804 ± 1.194
1.529ValCys: 1.529 ± 0.71
1.02ValAsp: 1.02 ± 0.271
1.529ValGlu: 1.529 ± 0.345
2.039ValPhe: 2.039 ± 1.138
3.569ValGly: 3.569 ± 0.431
0.765ValHis: 0.765 ± 0.474
2.294ValIle: 2.294 ± 0.514
2.804ValLys: 2.804 ± 0.877
3.314ValLeu: 3.314 ± 1.315
1.02ValMet: 1.02 ± 0.302
3.059ValAsn: 3.059 ± 1.177
1.784ValPro: 1.784 ± 0.526
2.549ValGln: 2.549 ± 1.75
1.275ValArg: 1.275 ± 0.267
5.353ValSer: 5.353 ± 0.665
4.588ValThr: 4.588 ± 1.028
1.784ValVal: 1.784 ± 0.465
0.0ValTrp: 0.0 ± 0.0
2.804ValTyr: 2.804 ± 0.877
0.0ValXaa: 0.0 ± 0.0
Trp
0.255TrpAla: 0.255 ± 0.239
0.0TrpCys: 0.0 ± 0.0
0.255TrpAsp: 0.255 ± 0.158
0.255TrpGlu: 0.255 ± 0.158
0.765TrpPhe: 0.765 ± 0.172
1.275TrpGly: 1.275 ± 0.652
0.0TrpHis: 0.0 ± 0.0
0.255TrpIle: 0.255 ± 0.158
0.255TrpLys: 0.255 ± 0.158
1.02TrpLeu: 1.02 ± 0.271
0.255TrpMet: 0.255 ± 0.829
0.765TrpAsn: 0.765 ± 0.474
0.0TrpPro: 0.0 ± 0.0
1.02TrpGln: 1.02 ± 0.633
0.0TrpArg: 0.0 ± 0.0
0.765TrpSer: 0.765 ± 0.474
0.255TrpThr: 0.255 ± 0.829
0.51TrpVal: 0.51 ± 0.316
0.0TrpTrp: 0.0 ± 0.0
0.51TrpTyr: 0.51 ± 0.136
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.039TyrAla: 2.039 ± 0.98
1.275TyrCys: 1.275 ± 0.827
1.529TyrAsp: 1.529 ± 0.71
2.294TyrGlu: 2.294 ± 0.514
2.039TyrPhe: 2.039 ± 0.535
2.294TyrGly: 2.294 ± 0.823
1.02TyrHis: 1.02 ± 0.59
2.804TyrIle: 2.804 ± 1.052
4.079TyrLys: 4.079 ± 1.355
3.569TyrLeu: 3.569 ± 2.815
2.039TyrMet: 2.039 ± 0.428
1.784TyrAsn: 1.784 ± 0.465
1.275TyrPro: 1.275 ± 0.45
1.02TyrGln: 1.02 ± 0.302
0.765TyrArg: 0.765 ± 0.172
2.294TyrSer: 2.294 ± 0.588
2.294TyrThr: 2.294 ± 0.75
1.02TyrVal: 1.02 ± 0.271
0.0TyrTrp: 0.0 ± 0.0
1.529TyrTyr: 1.529 ± 1.065
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (3924 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski