Amino acid dipepetide frequency for Meiothermus phage MMP17

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
14.304AlaAla: 14.304 ± 1.565
1.022AlaCys: 1.022 ± 0.318
5.187AlaAsp: 5.187 ± 0.671
5.737AlaGlu: 5.737 ± 0.828
2.986AlaPhe: 2.986 ± 0.557
11.081AlaGly: 11.081 ± 1.33
2.358AlaHis: 2.358 ± 0.461
5.03AlaIle: 5.03 ± 0.783
3.065AlaLys: 3.065 ± 0.565
15.561AlaLeu: 15.561 ± 1.432
1.65AlaMet: 1.65 ± 0.347
2.279AlaAsn: 2.279 ± 0.347
6.366AlaPro: 6.366 ± 0.676
4.48AlaGln: 4.48 ± 0.742
10.688AlaArg: 10.688 ± 1.017
5.816AlaSer: 5.816 ± 0.669
6.13AlaThr: 6.13 ± 0.618
8.252AlaVal: 8.252 ± 0.907
2.986AlaTrp: 2.986 ± 0.586
3.301AlaTyr: 3.301 ± 0.516
0.0AlaXaa: 0.0 ± 0.0
Cys
0.707CysAla: 0.707 ± 0.295
0.393CysCys: 0.393 ± 0.172
0.55CysAsp: 0.55 ± 0.206
0.943CysGlu: 0.943 ± 0.308
0.157CysPhe: 0.157 ± 0.108
1.493CysGly: 1.493 ± 0.413
0.393CysHis: 0.393 ± 0.179
0.0CysIle: 0.0 ± 0.0
0.079CysLys: 0.079 ± 0.078
0.707CysLeu: 0.707 ± 0.251
0.0CysMet: 0.0 ± 0.0
0.157CysAsn: 0.157 ± 0.167
1.022CysPro: 1.022 ± 0.437
0.629CysGln: 0.629 ± 0.244
1.179CysArg: 1.179 ± 0.327
0.865CysSer: 0.865 ± 0.32
0.55CysThr: 0.55 ± 0.198
0.393CysVal: 0.393 ± 0.183
0.236CysTrp: 0.236 ± 0.156
0.157CysTyr: 0.157 ± 0.095
0.0CysXaa: 0.0 ± 0.0
Asp
5.423AspAla: 5.423 ± 0.57
0.55AspCys: 0.55 ± 0.2
2.358AspAsp: 2.358 ± 0.442
3.537AspGlu: 3.537 ± 0.556
1.179AspPhe: 1.179 ± 0.276
4.165AspGly: 4.165 ± 0.674
0.629AspHis: 0.629 ± 0.22
1.808AspIle: 1.808 ± 0.455
0.707AspLys: 0.707 ± 0.212
4.951AspLeu: 4.951 ± 0.586
0.55AspMet: 0.55 ± 0.235
1.65AspAsn: 1.65 ± 0.378
3.851AspPro: 3.851 ± 0.542
0.865AspGln: 0.865 ± 0.246
2.672AspArg: 2.672 ± 0.49
1.729AspSer: 1.729 ± 0.351
2.043AspThr: 2.043 ± 0.348
2.201AspVal: 2.201 ± 0.438
1.965AspTrp: 1.965 ± 0.404
1.179AspTyr: 1.179 ± 0.298
0.0AspXaa: 0.0 ± 0.0
Glu
9.038GluAla: 9.038 ± 1.09
0.786GluCys: 0.786 ± 0.312
1.965GluAsp: 1.965 ± 0.375
4.087GluGlu: 4.087 ± 0.718
1.493GluPhe: 1.493 ± 0.384
6.13GluGly: 6.13 ± 0.87
1.493GluHis: 1.493 ± 0.347
2.672GluIle: 2.672 ± 0.409
1.493GluLys: 1.493 ± 0.374
5.894GluLeu: 5.894 ± 0.931
1.022GluMet: 1.022 ± 0.272
1.415GluAsn: 1.415 ± 0.41
4.401GluPro: 4.401 ± 0.499
2.751GluGln: 2.751 ± 0.442
7.545GluArg: 7.545 ± 0.999
2.043GluSer: 2.043 ± 0.416
2.594GluThr: 2.594 ± 0.352
4.323GluVal: 4.323 ± 0.787
1.65GluTrp: 1.65 ± 0.418
1.336GluTyr: 1.336 ± 0.29
0.0GluXaa: 0.0 ± 0.0
Phe
2.751PheAla: 2.751 ± 0.535
0.314PheCys: 0.314 ± 0.153
1.336PheAsp: 1.336 ± 0.306
1.65PheGlu: 1.65 ± 0.326
0.707PhePhe: 0.707 ± 0.276
2.201PheGly: 2.201 ± 0.504
0.472PheHis: 0.472 ± 0.165
0.786PheIle: 0.786 ± 0.297
0.629PheLys: 0.629 ± 0.221
2.201PheLeu: 2.201 ± 0.438
0.079PheMet: 0.079 ± 0.076
1.1PheAsn: 1.1 ± 0.337
1.65PhePro: 1.65 ± 0.345
0.55PheGln: 0.55 ± 0.291
1.965PheArg: 1.965 ± 0.41
2.043PheSer: 2.043 ± 0.349
1.415PheThr: 1.415 ± 0.325
1.493PheVal: 1.493 ± 0.334
0.472PheTrp: 0.472 ± 0.178
0.707PheTyr: 0.707 ± 0.228
0.0PheXaa: 0.0 ± 0.0
Gly
9.667GlyAla: 9.667 ± 1.26
0.943GlyCys: 0.943 ± 0.324
5.344GlyAsp: 5.344 ± 0.79
4.794GlyGlu: 4.794 ± 0.635
3.065GlyPhe: 3.065 ± 0.468
8.959GlyGly: 8.959 ± 1.246
2.515GlyHis: 2.515 ± 0.62
3.458GlyIle: 3.458 ± 0.484
2.829GlyLys: 2.829 ± 0.549
10.374GlyLeu: 10.374 ± 1.004
2.358GlyMet: 2.358 ± 0.419
2.515GlyAsn: 2.515 ± 0.594
4.48GlyPro: 4.48 ± 0.671
4.008GlyGln: 4.008 ± 0.723
8.016GlyArg: 8.016 ± 0.694
5.737GlySer: 5.737 ± 0.701
4.401GlyThr: 4.401 ± 0.689
6.209GlyVal: 6.209 ± 0.687
2.751GlyTrp: 2.751 ± 0.503
3.065GlyTyr: 3.065 ± 0.553
0.0GlyXaa: 0.0 ± 0.0
His
1.965HisAla: 1.965 ± 0.396
0.157HisCys: 0.157 ± 0.137
0.707HisAsp: 0.707 ± 0.218
0.786HisGlu: 0.786 ± 0.227
0.472HisPhe: 0.472 ± 0.176
2.043HisGly: 2.043 ± 0.482
0.393HisHis: 0.393 ± 0.164
0.472HisIle: 0.472 ± 0.199
0.314HisLys: 0.314 ± 0.171
2.436HisLeu: 2.436 ± 0.491
0.236HisMet: 0.236 ± 0.13
0.157HisAsn: 0.157 ± 0.115
1.415HisPro: 1.415 ± 0.355
1.336HisGln: 1.336 ± 0.35
2.043HisArg: 2.043 ± 0.575
1.572HisSer: 1.572 ± 0.354
0.865HisThr: 0.865 ± 0.272
1.022HisVal: 1.022 ± 0.259
0.157HisTrp: 0.157 ± 0.12
0.629HisTyr: 0.629 ± 0.216
0.0HisXaa: 0.0 ± 0.0
Ile
4.48IleAla: 4.48 ± 0.789
0.236IleCys: 0.236 ± 0.124
1.336IleAsp: 1.336 ± 0.336
2.751IleGlu: 2.751 ± 0.391
0.629IlePhe: 0.629 ± 0.212
3.537IleGly: 3.537 ± 0.77
0.629IleHis: 0.629 ± 0.206
1.886IleIle: 1.886 ± 0.36
1.257IleLys: 1.257 ± 0.252
3.065IleLeu: 3.065 ± 0.519
0.707IleMet: 0.707 ± 0.18
1.493IleAsn: 1.493 ± 0.307
2.672IlePro: 2.672 ± 0.462
1.493IleGln: 1.493 ± 0.286
2.122IleArg: 2.122 ± 0.455
2.829IleSer: 2.829 ± 0.497
2.594IleThr: 2.594 ± 0.544
2.358IleVal: 2.358 ± 0.542
0.629IleTrp: 0.629 ± 0.231
1.179IleTyr: 1.179 ± 0.343
0.0IleXaa: 0.0 ± 0.0
Lys
2.358LysAla: 2.358 ± 0.483
0.079LysCys: 0.079 ± 0.075
0.629LysAsp: 0.629 ± 0.212
1.022LysGlu: 1.022 ± 0.368
0.472LysPhe: 0.472 ± 0.184
1.65LysGly: 1.65 ± 0.326
0.393LysHis: 0.393 ± 0.175
1.022LysIle: 1.022 ± 0.273
0.943LysLys: 0.943 ± 0.436
2.908LysLeu: 2.908 ± 0.695
0.314LysMet: 0.314 ± 0.16
0.393LysAsn: 0.393 ± 0.167
1.415LysPro: 1.415 ± 0.331
1.415LysGln: 1.415 ± 0.325
2.515LysArg: 2.515 ± 0.488
1.179LysSer: 1.179 ± 0.282
2.436LysThr: 2.436 ± 0.449
1.808LysVal: 1.808 ± 0.405
0.393LysTrp: 0.393 ± 0.169
0.314LysTyr: 0.314 ± 0.158
0.0LysXaa: 0.0 ± 0.0
Leu
13.989LeuAla: 13.989 ± 1.634
0.943LeuCys: 0.943 ± 0.241
4.951LeuAsp: 4.951 ± 0.476
11.553LeuGlu: 11.553 ± 1.224
1.808LeuPhe: 1.808 ± 0.288
9.667LeuGly: 9.667 ± 0.798
2.358LeuHis: 2.358 ± 0.45
3.301LeuIle: 3.301 ± 0.401
2.515LeuLys: 2.515 ± 0.402
9.903LeuLeu: 9.903 ± 1.179
1.179LeuMet: 1.179 ± 0.273
1.965LeuAsn: 1.965 ± 0.414
7.781LeuPro: 7.781 ± 0.769
3.379LeuGln: 3.379 ± 0.542
8.881LeuArg: 8.881 ± 0.978
5.423LeuSer: 5.423 ± 0.493
5.344LeuThr: 5.344 ± 0.796
6.209LeuVal: 6.209 ± 0.668
2.201LeuTrp: 2.201 ± 0.373
2.829LeuTyr: 2.829 ± 0.461
0.0LeuXaa: 0.0 ± 0.0
Met
2.672MetAla: 2.672 ± 0.415
0.157MetCys: 0.157 ± 0.125
0.629MetAsp: 0.629 ± 0.237
0.472MetGlu: 0.472 ± 0.21
0.236MetPhe: 0.236 ± 0.108
1.022MetGly: 1.022 ± 0.28
0.079MetHis: 0.079 ± 0.072
0.786MetIle: 0.786 ± 0.192
0.472MetLys: 0.472 ± 0.194
1.336MetLeu: 1.336 ± 0.376
0.157MetMet: 0.157 ± 0.108
0.236MetAsn: 0.236 ± 0.122
1.1MetPro: 1.1 ± 0.281
0.629MetGln: 0.629 ± 0.199
1.729MetArg: 1.729 ± 0.311
1.022MetSer: 1.022 ± 0.209
1.415MetThr: 1.415 ± 0.329
0.55MetVal: 0.55 ± 0.238
0.236MetTrp: 0.236 ± 0.145
0.314MetTyr: 0.314 ± 0.178
0.0MetXaa: 0.0 ± 0.0
Asn
1.729AsnAla: 1.729 ± 0.492
0.157AsnCys: 0.157 ± 0.1
0.943AsnAsp: 0.943 ± 0.246
1.257AsnGlu: 1.257 ± 0.34
0.314AsnPhe: 0.314 ± 0.145
2.122AsnGly: 2.122 ± 0.507
0.157AsnHis: 0.157 ± 0.111
1.022AsnIle: 1.022 ± 0.302
0.707AsnLys: 0.707 ± 0.27
3.379AsnLeu: 3.379 ± 0.478
0.629AsnMet: 0.629 ± 0.192
0.786AsnAsn: 0.786 ± 0.422
2.436AsnPro: 2.436 ± 0.54
0.707AsnGln: 0.707 ± 0.176
2.279AsnArg: 2.279 ± 0.296
1.1AsnSer: 1.1 ± 0.282
2.122AsnThr: 2.122 ± 0.567
1.572AsnVal: 1.572 ± 0.314
0.472AsnTrp: 0.472 ± 0.16
0.707AsnTyr: 0.707 ± 0.226
0.0AsnXaa: 0.0 ± 0.0
Pro
8.016ProAla: 8.016 ± 0.841
0.55ProCys: 0.55 ± 0.287
3.144ProAsp: 3.144 ± 0.528
5.266ProGlu: 5.266 ± 0.809
1.257ProPhe: 1.257 ± 0.285
8.252ProGly: 8.252 ± 1.157
0.943ProHis: 0.943 ± 0.348
1.965ProIle: 1.965 ± 0.442
1.179ProLys: 1.179 ± 0.278
6.445ProLeu: 6.445 ± 0.65
1.1ProMet: 1.1 ± 0.303
1.65ProAsn: 1.65 ± 0.396
5.187ProPro: 5.187 ± 0.854
3.222ProGln: 3.222 ± 0.45
5.187ProArg: 5.187 ± 0.853
3.301ProSer: 3.301 ± 0.553
3.537ProThr: 3.537 ± 0.469
3.537ProVal: 3.537 ± 0.487
1.729ProTrp: 1.729 ± 0.415
1.572ProTyr: 1.572 ± 0.364
0.0ProXaa: 0.0 ± 0.0
Gln
4.873GlnAla: 4.873 ± 0.715
0.55GlnCys: 0.55 ± 0.225
1.808GlnAsp: 1.808 ± 0.366
2.043GlnGlu: 2.043 ± 0.436
0.55GlnPhe: 0.55 ± 0.182
4.323GlnGly: 4.323 ± 0.777
1.179GlnHis: 1.179 ± 0.367
1.65GlnIle: 1.65 ± 0.37
0.707GlnLys: 0.707 ± 0.202
3.458GlnLeu: 3.458 ± 0.542
0.786GlnMet: 0.786 ± 0.277
0.55GlnAsn: 0.55 ± 0.183
2.908GlnPro: 2.908 ± 0.582
1.572GlnGln: 1.572 ± 0.401
3.851GlnArg: 3.851 ± 0.559
1.257GlnSer: 1.257 ± 0.239
1.729GlnThr: 1.729 ± 0.308
3.144GlnVal: 3.144 ± 0.47
0.786GlnTrp: 0.786 ± 0.182
0.865GlnTyr: 0.865 ± 0.28
0.0GlnXaa: 0.0 ± 0.0
Arg
10.296ArgAla: 10.296 ± 0.978
1.729ArgCys: 1.729 ± 0.412
3.065ArgAsp: 3.065 ± 0.565
5.737ArgGlu: 5.737 ± 0.983
2.594ArgPhe: 2.594 ± 0.449
7.702ArgGly: 7.702 ± 1.101
1.336ArgHis: 1.336 ± 0.308
3.615ArgIle: 3.615 ± 0.427
1.729ArgLys: 1.729 ± 0.403
10.61ArgLeu: 10.61 ± 1.052
1.572ArgMet: 1.572 ± 0.381
1.965ArgAsn: 1.965 ± 0.405
4.951ArgPro: 4.951 ± 0.785
2.594ArgGln: 2.594 ± 0.529
7.309ArgArg: 7.309 ± 0.849
5.187ArgSer: 5.187 ± 0.62
2.751ArgThr: 2.751 ± 0.408
6.602ArgVal: 6.602 ± 0.831
3.222ArgTrp: 3.222 ± 0.589
4.087ArgTyr: 4.087 ± 0.638
0.0ArgXaa: 0.0 ± 0.0
Ser
5.659SerAla: 5.659 ± 0.725
0.707SerCys: 0.707 ± 0.283
2.043SerAsp: 2.043 ± 0.322
2.436SerGlu: 2.436 ± 0.448
2.279SerPhe: 2.279 ± 0.459
5.816SerGly: 5.816 ± 0.745
0.707SerHis: 0.707 ± 0.22
2.122SerIle: 2.122 ± 0.342
0.943SerLys: 0.943 ± 0.299
5.344SerLeu: 5.344 ± 0.564
0.786SerMet: 0.786 ± 0.21
1.179SerAsn: 1.179 ± 0.295
4.558SerPro: 4.558 ± 0.559
2.043SerGln: 2.043 ± 0.48
4.715SerArg: 4.715 ± 0.684
3.537SerSer: 3.537 ± 0.646
2.358SerThr: 2.358 ± 0.447
3.537SerVal: 3.537 ± 0.547
1.1SerTrp: 1.1 ± 0.298
1.493SerTyr: 1.493 ± 0.387
0.0SerXaa: 0.0 ± 0.0
Thr
5.108ThrAla: 5.108 ± 0.751
0.157ThrCys: 0.157 ± 0.111
2.986ThrAsp: 2.986 ± 0.556
2.122ThrGlu: 2.122 ± 0.315
1.257ThrPhe: 1.257 ± 0.334
4.401ThrGly: 4.401 ± 0.728
0.629ThrHis: 0.629 ± 0.16
2.279ThrIle: 2.279 ± 0.459
1.493ThrLys: 1.493 ± 0.395
6.209ThrLeu: 6.209 ± 0.7
0.393ThrMet: 0.393 ± 0.192
1.965ThrAsn: 1.965 ± 0.514
3.851ThrPro: 3.851 ± 0.588
1.808ThrGln: 1.808 ± 0.342
4.165ThrArg: 4.165 ± 0.586
2.829ThrSer: 2.829 ± 0.439
3.065ThrThr: 3.065 ± 0.51
4.323ThrVal: 4.323 ± 0.652
0.865ThrTrp: 0.865 ± 0.223
1.1ThrTyr: 1.1 ± 0.241
0.0ThrXaa: 0.0 ± 0.0
Val
8.488ValAla: 8.488 ± 1.03
0.55ValCys: 0.55 ± 0.214
3.065ValAsp: 3.065 ± 0.448
5.03ValGlu: 5.03 ± 0.838
1.415ValPhe: 1.415 ± 0.343
6.209ValGly: 6.209 ± 0.821
1.336ValHis: 1.336 ± 0.32
2.279ValIle: 2.279 ± 0.402
1.965ValLys: 1.965 ± 0.363
6.052ValLeu: 6.052 ± 0.861
0.629ValMet: 0.629 ± 0.2
1.415ValAsn: 1.415 ± 0.34
2.986ValPro: 2.986 ± 0.552
2.908ValGln: 2.908 ± 0.474
5.973ValArg: 5.973 ± 0.556
3.458ValSer: 3.458 ± 0.593
3.301ValThr: 3.301 ± 0.519
4.715ValVal: 4.715 ± 0.635
1.729ValTrp: 1.729 ± 0.312
1.493ValTyr: 1.493 ± 0.325
0.0ValXaa: 0.0 ± 0.0
Trp
3.301TrpAla: 3.301 ± 0.566
0.314TrpCys: 0.314 ± 0.17
0.55TrpAsp: 0.55 ± 0.218
1.257TrpGlu: 1.257 ± 0.299
0.786TrpPhe: 0.786 ± 0.236
2.594TrpGly: 2.594 ± 0.472
0.472TrpHis: 0.472 ± 0.214
0.707TrpIle: 0.707 ± 0.268
0.393TrpLys: 0.393 ± 0.175
2.986TrpLeu: 2.986 ± 0.566
0.629TrpMet: 0.629 ± 0.22
0.786TrpAsn: 0.786 ± 0.255
2.279TrpPro: 2.279 ± 0.408
1.022TrpGln: 1.022 ± 0.323
3.144TrpArg: 3.144 ± 0.471
1.179TrpSer: 1.179 ± 0.328
0.55TrpThr: 0.55 ± 0.202
1.022TrpVal: 1.022 ± 0.318
0.393TrpTrp: 0.393 ± 0.161
0.393TrpTyr: 0.393 ± 0.161
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.458TyrAla: 3.458 ± 0.473
0.314TyrCys: 0.314 ± 0.147
1.415TyrAsp: 1.415 ± 0.369
1.65TyrGlu: 1.65 ± 0.409
0.865TyrPhe: 0.865 ± 0.297
2.122TyrGly: 2.122 ± 0.517
0.707TyrHis: 0.707 ± 0.258
1.022TyrIle: 1.022 ± 0.218
0.236TyrLys: 0.236 ± 0.147
2.436TyrLeu: 2.436 ± 0.483
0.393TyrMet: 0.393 ± 0.173
0.943TyrAsn: 0.943 ± 0.225
1.729TyrPro: 1.729 ± 0.403
1.179TyrGln: 1.179 ± 0.326
2.672TyrArg: 2.672 ± 0.352
1.257TyrSer: 1.257 ± 0.336
1.808TyrThr: 1.808 ± 0.41
1.729TyrVal: 1.729 ± 0.397
0.786TyrTrp: 0.786 ± 0.24
1.1TyrTyr: 1.1 ± 0.307
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 66 proteins (12725 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski