Amino acid dipepetide frequency for Pantoea phage vB_PagM_AAM22

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.55AlaAla: 9.55 ± 1.171
1.202AlaCys: 1.202 ± 0.33
4.933AlaAsp: 4.933 ± 0.659
5.313AlaGlu: 5.313 ± 0.594
2.214AlaPhe: 2.214 ± 0.442
8.475AlaGly: 8.475 ± 1.079
1.834AlaHis: 1.834 ± 0.329
3.415AlaIle: 3.415 ± 0.321
4.111AlaLys: 4.111 ± 0.628
7.02AlaLeu: 7.02 ± 0.561
2.53AlaMet: 2.53 ± 0.307
3.415AlaAsn: 3.415 ± 0.501
3.226AlaPro: 3.226 ± 0.58
4.554AlaGln: 4.554 ± 0.454
4.87AlaArg: 4.87 ± 0.614
5.313AlaSer: 5.313 ± 0.622
6.135AlaThr: 6.135 ± 0.629
6.451AlaVal: 6.451 ± 0.756
1.075AlaTrp: 1.075 ± 0.299
2.53AlaTyr: 2.53 ± 0.413
0.0AlaXaa: 0.0 ± 0.0
Cys
0.443CysAla: 0.443 ± 0.174
0.379CysCys: 0.379 ± 0.16
0.949CysAsp: 0.949 ± 0.283
1.075CysGlu: 1.075 ± 0.271
0.443CysPhe: 0.443 ± 0.143
1.328CysGly: 1.328 ± 0.371
0.443CysHis: 0.443 ± 0.157
0.822CysIle: 0.822 ± 0.254
0.759CysLys: 0.759 ± 0.256
0.949CysLeu: 0.949 ± 0.217
0.063CysMet: 0.063 ± 0.053
0.696CysAsn: 0.696 ± 0.186
1.328CysPro: 1.328 ± 0.388
0.253CysGln: 0.253 ± 0.12
0.759CysArg: 0.759 ± 0.198
0.443CysSer: 0.443 ± 0.159
0.632CysThr: 0.632 ± 0.244
0.949CysVal: 0.949 ± 0.242
0.19CysTrp: 0.19 ± 0.119
0.506CysTyr: 0.506 ± 0.174
0.0CysXaa: 0.0 ± 0.0
Asp
6.767AspAla: 6.767 ± 0.714
0.759AspCys: 0.759 ± 0.216
3.921AspAsp: 3.921 ± 0.525
4.364AspGlu: 4.364 ± 0.493
1.961AspPhe: 1.961 ± 0.377
5.755AspGly: 5.755 ± 0.587
1.075AspHis: 1.075 ± 0.26
4.554AspIle: 4.554 ± 0.608
3.099AspLys: 3.099 ± 0.451
3.036AspLeu: 3.036 ± 0.361
2.087AspMet: 2.087 ± 0.411
2.593AspAsn: 2.593 ± 0.381
2.087AspPro: 2.087 ± 0.362
1.265AspGln: 1.265 ± 0.327
2.656AspArg: 2.656 ± 0.436
3.415AspSer: 3.415 ± 0.564
4.427AspThr: 4.427 ± 0.673
4.301AspVal: 4.301 ± 0.607
1.391AspTrp: 1.391 ± 0.287
1.897AspTyr: 1.897 ± 0.342
0.0AspXaa: 0.0 ± 0.0
Glu
4.617GluAla: 4.617 ± 0.546
1.012GluCys: 1.012 ± 0.278
2.34GluAsp: 2.34 ± 0.404
3.732GluGlu: 3.732 ± 0.706
2.277GluPhe: 2.277 ± 0.423
3.415GluGly: 3.415 ± 0.507
0.822GluHis: 0.822 ± 0.25
4.238GluIle: 4.238 ± 0.49
3.605GluLys: 3.605 ± 0.52
4.87GluLeu: 4.87 ± 0.615
2.087GluMet: 2.087 ± 0.388
1.771GluAsn: 1.771 ± 0.373
1.644GluPro: 1.644 ± 0.306
3.605GluGln: 3.605 ± 0.639
4.048GluArg: 4.048 ± 0.562
3.352GluSer: 3.352 ± 0.475
3.099GluThr: 3.099 ± 0.365
3.289GluVal: 3.289 ± 0.456
2.15GluTrp: 2.15 ± 0.404
1.834GluTyr: 1.834 ± 0.346
0.0GluXaa: 0.0 ± 0.0
Phe
2.214PheAla: 2.214 ± 0.354
0.822PheCys: 0.822 ± 0.25
2.656PheAsp: 2.656 ± 0.409
1.897PheGlu: 1.897 ± 0.461
0.885PhePhe: 0.885 ± 0.278
2.783PheGly: 2.783 ± 0.386
0.696PheHis: 0.696 ± 0.187
1.771PheIle: 1.771 ± 0.399
1.834PheLys: 1.834 ± 0.338
2.53PheLeu: 2.53 ± 0.498
0.759PheMet: 0.759 ± 0.219
1.961PheAsn: 1.961 ± 0.357
1.075PhePro: 1.075 ± 0.245
0.632PheGln: 0.632 ± 0.183
2.214PheArg: 2.214 ± 0.504
2.656PheSer: 2.656 ± 0.371
2.34PheThr: 2.34 ± 0.359
2.467PheVal: 2.467 ± 0.404
0.632PheTrp: 0.632 ± 0.203
0.949PheTyr: 0.949 ± 0.24
0.0PheXaa: 0.0 ± 0.0
Gly
6.704GlyAla: 6.704 ± 0.959
1.455GlyCys: 1.455 ± 0.348
5.882GlyAsp: 5.882 ± 0.654
4.301GlyGlu: 4.301 ± 0.576
3.099GlyPhe: 3.099 ± 0.489
5.945GlyGly: 5.945 ± 0.934
1.075GlyHis: 1.075 ± 0.374
4.491GlyIle: 4.491 ± 0.532
3.415GlyLys: 3.415 ± 0.47
5.186GlyLeu: 5.186 ± 0.526
2.467GlyMet: 2.467 ± 0.407
3.858GlyAsn: 3.858 ± 0.593
4.111GlyPro: 4.111 ± 1.626
2.72GlyGln: 2.72 ± 0.36
3.858GlyArg: 3.858 ± 0.506
4.174GlySer: 4.174 ± 0.555
5.439GlyThr: 5.439 ± 0.859
6.578GlyVal: 6.578 ± 0.588
1.581GlyTrp: 1.581 ± 0.333
3.542GlyTyr: 3.542 ± 0.459
0.0GlyXaa: 0.0 ± 0.0
His
1.455HisAla: 1.455 ± 0.382
0.253HisCys: 0.253 ± 0.118
1.138HisAsp: 1.138 ± 0.278
0.759HisGlu: 0.759 ± 0.22
0.822HisPhe: 0.822 ± 0.21
1.581HisGly: 1.581 ± 0.349
0.569HisHis: 0.569 ± 0.213
0.885HisIle: 0.885 ± 0.278
0.949HisLys: 0.949 ± 0.234
1.328HisLeu: 1.328 ± 0.348
0.506HisMet: 0.506 ± 0.168
0.885HisAsn: 0.885 ± 0.241
0.759HisPro: 0.759 ± 0.203
0.379HisGln: 0.379 ± 0.149
1.138HisArg: 1.138 ± 0.249
1.518HisSer: 1.518 ± 0.35
1.328HisThr: 1.328 ± 0.318
1.455HisVal: 1.455 ± 0.305
0.506HisTrp: 0.506 ± 0.155
0.379HisTyr: 0.379 ± 0.138
0.0HisXaa: 0.0 ± 0.0
Ile
4.933IleAla: 4.933 ± 0.651
0.632IleCys: 0.632 ± 0.216
3.921IleAsp: 3.921 ± 0.522
4.87IleGlu: 4.87 ± 0.667
1.391IlePhe: 1.391 ± 0.349
3.162IleGly: 3.162 ± 0.462
1.012IleHis: 1.012 ± 0.301
3.036IleIle: 3.036 ± 0.416
2.846IleLys: 2.846 ± 0.478
2.909IleLeu: 2.909 ± 0.399
1.455IleMet: 1.455 ± 0.319
3.415IleAsn: 3.415 ± 0.575
3.099IlePro: 3.099 ± 0.451
2.214IleGln: 2.214 ± 0.402
2.783IleArg: 2.783 ± 0.428
3.415IleSer: 3.415 ± 0.576
4.807IleThr: 4.807 ± 0.497
4.174IleVal: 4.174 ± 0.62
0.379IleTrp: 0.379 ± 0.149
1.328IleTyr: 1.328 ± 0.267
0.0IleXaa: 0.0 ± 0.0
Lys
4.933LysAla: 4.933 ± 0.511
0.949LysCys: 0.949 ± 0.282
2.973LysAsp: 2.973 ± 0.457
2.214LysGlu: 2.214 ± 0.459
2.087LysPhe: 2.087 ± 0.431
4.554LysGly: 4.554 ± 0.616
1.012LysHis: 1.012 ± 0.347
3.036LysIle: 3.036 ± 0.45
2.973LysLys: 2.973 ± 0.479
4.048LysLeu: 4.048 ± 0.525
1.961LysMet: 1.961 ± 0.326
2.277LysAsn: 2.277 ± 0.422
1.897LysPro: 1.897 ± 0.374
2.214LysGln: 2.214 ± 0.439
2.846LysArg: 2.846 ± 0.369
2.909LysSer: 2.909 ± 0.391
2.467LysThr: 2.467 ± 0.33
3.732LysVal: 3.732 ± 0.521
1.138LysTrp: 1.138 ± 0.243
2.277LysTyr: 2.277 ± 0.316
0.0LysXaa: 0.0 ± 0.0
Leu
6.894LeuAla: 6.894 ± 0.68
0.949LeuCys: 0.949 ± 0.228
5.376LeuAsp: 5.376 ± 0.587
4.238LeuGlu: 4.238 ± 0.615
1.897LeuPhe: 1.897 ± 0.319
4.68LeuGly: 4.68 ± 0.549
1.455LeuHis: 1.455 ± 0.302
3.921LeuIle: 3.921 ± 0.452
4.427LeuLys: 4.427 ± 0.486
4.807LeuLeu: 4.807 ± 0.57
1.834LeuMet: 1.834 ± 0.382
3.985LeuAsn: 3.985 ± 0.576
3.099LeuPro: 3.099 ± 0.484
3.162LeuGln: 3.162 ± 0.498
4.68LeuArg: 4.68 ± 0.569
5.313LeuSer: 5.313 ± 0.668
4.68LeuThr: 4.68 ± 0.616
4.554LeuVal: 4.554 ± 0.594
1.012LeuTrp: 1.012 ± 0.232
2.72LeuTyr: 2.72 ± 0.339
0.0LeuXaa: 0.0 ± 0.0
Met
1.834MetAla: 1.834 ± 0.285
0.379MetCys: 0.379 ± 0.172
1.138MetAsp: 1.138 ± 0.279
1.202MetGlu: 1.202 ± 0.308
1.075MetPhe: 1.075 ± 0.214
1.897MetGly: 1.897 ± 0.343
0.253MetHis: 0.253 ± 0.127
1.391MetIle: 1.391 ± 0.336
1.518MetLys: 1.518 ± 0.263
1.897MetLeu: 1.897 ± 0.363
0.443MetMet: 0.443 ± 0.215
1.391MetAsn: 1.391 ± 0.27
0.949MetPro: 0.949 ± 0.268
1.834MetGln: 1.834 ± 0.308
2.909MetArg: 2.909 ± 0.414
2.656MetSer: 2.656 ± 0.342
2.593MetThr: 2.593 ± 0.431
1.328MetVal: 1.328 ± 0.348
0.443MetTrp: 0.443 ± 0.18
0.632MetTyr: 0.632 ± 0.206
0.0MetXaa: 0.0 ± 0.0
Asn
3.542AsnAla: 3.542 ± 0.586
0.822AsnCys: 0.822 ± 0.246
2.72AsnAsp: 2.72 ± 0.454
2.024AsnGlu: 2.024 ± 0.431
1.518AsnPhe: 1.518 ± 0.312
4.933AsnGly: 4.933 ± 0.549
1.455AsnHis: 1.455 ± 0.286
2.593AsnIle: 2.593 ± 0.374
2.34AsnLys: 2.34 ± 0.43
3.795AsnLeu: 3.795 ± 0.532
1.391AsnMet: 1.391 ± 0.36
3.162AsnAsn: 3.162 ± 0.485
1.897AsnPro: 1.897 ± 0.435
1.771AsnGln: 1.771 ± 0.389
2.783AsnArg: 2.783 ± 0.467
3.036AsnSer: 3.036 ± 0.432
3.036AsnThr: 3.036 ± 0.428
3.036AsnVal: 3.036 ± 0.379
0.632AsnTrp: 0.632 ± 0.184
1.644AsnTyr: 1.644 ± 0.34
0.0AsnXaa: 0.0 ± 0.0
Pro
4.174ProAla: 4.174 ± 0.563
0.443ProCys: 0.443 ± 0.134
2.403ProAsp: 2.403 ± 0.355
2.846ProGlu: 2.846 ± 0.489
1.391ProPhe: 1.391 ± 0.258
3.226ProGly: 3.226 ± 0.613
0.506ProHis: 0.506 ± 0.185
1.518ProIle: 1.518 ± 0.343
1.834ProLys: 1.834 ± 0.371
2.467ProLeu: 2.467 ± 0.453
0.759ProMet: 0.759 ± 0.216
1.644ProAsn: 1.644 ± 0.283
1.455ProPro: 1.455 ± 0.402
2.024ProGln: 2.024 ± 0.481
1.455ProArg: 1.455 ± 0.272
2.467ProSer: 2.467 ± 0.446
2.34ProThr: 2.34 ± 0.466
5.123ProVal: 5.123 ± 0.631
0.569ProTrp: 0.569 ± 0.191
1.012ProTyr: 1.012 ± 0.212
0.0ProXaa: 0.0 ± 0.0
Gln
3.605GlnAla: 3.605 ± 0.529
0.316GlnCys: 0.316 ± 0.186
2.214GlnAsp: 2.214 ± 0.377
2.656GlnGlu: 2.656 ± 0.433
2.024GlnPhe: 2.024 ± 0.394
3.036GlnGly: 3.036 ± 0.71
0.759GlnHis: 0.759 ± 0.176
2.656GlnIle: 2.656 ± 0.374
1.518GlnLys: 1.518 ± 0.322
4.427GlnLeu: 4.427 ± 0.491
1.581GlnMet: 1.581 ± 0.34
1.897GlnAsn: 1.897 ± 0.334
1.391GlnPro: 1.391 ± 0.343
2.846GlnGln: 2.846 ± 0.461
2.783GlnArg: 2.783 ± 0.472
2.403GlnSer: 2.403 ± 0.447
2.593GlnThr: 2.593 ± 0.474
2.783GlnVal: 2.783 ± 0.494
0.696GlnTrp: 0.696 ± 0.204
1.455GlnTyr: 1.455 ± 0.317
0.0GlnXaa: 0.0 ± 0.0
Arg
5.123ArgAla: 5.123 ± 0.631
0.632ArgCys: 0.632 ± 0.223
3.605ArgAsp: 3.605 ± 0.477
2.973ArgGlu: 2.973 ± 0.521
2.15ArgPhe: 2.15 ± 0.469
3.162ArgGly: 3.162 ± 0.459
1.644ArgHis: 1.644 ± 0.254
4.238ArgIle: 4.238 ± 0.555
3.985ArgLys: 3.985 ± 0.489
3.668ArgLeu: 3.668 ± 0.502
1.834ArgMet: 1.834 ± 0.423
3.036ArgAsn: 3.036 ± 0.464
1.708ArgPro: 1.708 ± 0.32
2.783ArgGln: 2.783 ± 0.376
2.783ArgArg: 2.783 ± 0.504
2.593ArgSer: 2.593 ± 0.347
2.72ArgThr: 2.72 ± 0.41
4.744ArgVal: 4.744 ± 0.56
1.012ArgTrp: 1.012 ± 0.248
1.391ArgTyr: 1.391 ± 0.281
0.0ArgXaa: 0.0 ± 0.0
Ser
5.566SerAla: 5.566 ± 0.698
0.569SerCys: 0.569 ± 0.169
4.111SerAsp: 4.111 ± 0.584
2.909SerGlu: 2.909 ± 0.458
2.34SerPhe: 2.34 ± 0.408
6.008SerGly: 6.008 ± 0.738
1.391SerHis: 1.391 ± 0.34
2.656SerIle: 2.656 ± 0.433
3.732SerLys: 3.732 ± 0.484
5.566SerLeu: 5.566 ± 0.549
1.391SerMet: 1.391 ± 0.253
2.846SerAsn: 2.846 ± 0.431
1.708SerPro: 1.708 ± 0.299
3.289SerGln: 3.289 ± 0.557
2.656SerArg: 2.656 ± 0.38
4.301SerSer: 4.301 ± 0.701
3.795SerThr: 3.795 ± 0.596
4.427SerVal: 4.427 ± 0.515
1.138SerTrp: 1.138 ± 0.191
2.214SerTyr: 2.214 ± 0.296
0.0SerXaa: 0.0 ± 0.0
Thr
5.945ThrAla: 5.945 ± 0.674
0.632ThrCys: 0.632 ± 0.19
3.858ThrAsp: 3.858 ± 0.504
3.668ThrGlu: 3.668 ± 0.464
2.214ThrPhe: 2.214 ± 0.386
7.59ThrGly: 7.59 ± 1.637
0.696ThrHis: 0.696 ± 0.235
3.099ThrIle: 3.099 ± 0.448
2.277ThrLys: 2.277 ± 0.458
5.566ThrLeu: 5.566 ± 0.607
1.328ThrMet: 1.328 ± 0.341
2.34ThrAsn: 2.34 ± 0.423
3.542ThrPro: 3.542 ± 0.66
2.53ThrGln: 2.53 ± 0.362
3.226ThrArg: 3.226 ± 0.521
4.427ThrSer: 4.427 ± 0.546
4.111ThrThr: 4.111 ± 0.453
5.186ThrVal: 5.186 ± 0.577
1.012ThrTrp: 1.012 ± 0.264
2.277ThrTyr: 2.277 ± 0.324
0.0ThrXaa: 0.0 ± 0.0
Val
6.325ValAla: 6.325 ± 0.761
0.569ValCys: 0.569 ± 0.166
4.301ValAsp: 4.301 ± 0.582
3.921ValGlu: 3.921 ± 0.596
2.34ValPhe: 2.34 ± 0.304
4.554ValGly: 4.554 ± 0.546
1.075ValHis: 1.075 ± 0.222
4.68ValIle: 4.68 ± 0.663
4.301ValLys: 4.301 ± 0.488
4.807ValLeu: 4.807 ± 0.621
1.897ValMet: 1.897 ± 0.434
4.364ValAsn: 4.364 ± 0.543
3.099ValPro: 3.099 ± 0.443
3.415ValGln: 3.415 ± 0.47
4.68ValArg: 4.68 ± 0.53
5.313ValSer: 5.313 ± 0.75
5.186ValThr: 5.186 ± 0.756
4.048ValVal: 4.048 ± 0.476
1.075ValTrp: 1.075 ± 0.253
2.467ValTyr: 2.467 ± 0.348
0.0ValXaa: 0.0 ± 0.0
Trp
1.075TrpAla: 1.075 ± 0.246
0.19TrpCys: 0.19 ± 0.114
0.696TrpAsp: 0.696 ± 0.205
0.949TrpGlu: 0.949 ± 0.298
0.696TrpPhe: 0.696 ± 0.202
1.202TrpGly: 1.202 ± 0.231
0.443TrpHis: 0.443 ± 0.206
0.885TrpIle: 0.885 ± 0.242
1.265TrpLys: 1.265 ± 0.268
2.277TrpLeu: 2.277 ± 0.458
0.443TrpMet: 0.443 ± 0.161
0.885TrpAsn: 0.885 ± 0.236
0.506TrpPro: 0.506 ± 0.19
0.569TrpGln: 0.569 ± 0.172
0.885TrpArg: 0.885 ± 0.256
1.202TrpSer: 1.202 ± 0.24
1.391TrpThr: 1.391 ± 0.279
0.949TrpVal: 0.949 ± 0.24
0.126TrpTrp: 0.126 ± 0.085
0.379TrpTyr: 0.379 ± 0.131
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.467TyrAla: 2.467 ± 0.432
0.443TyrCys: 0.443 ± 0.13
2.277TyrAsp: 2.277 ± 0.357
1.897TyrGlu: 1.897 ± 0.357
0.949TyrPhe: 0.949 ± 0.233
2.593TyrGly: 2.593 ± 0.437
0.379TyrHis: 0.379 ± 0.171
1.897TyrIle: 1.897 ± 0.269
1.708TyrLys: 1.708 ± 0.389
2.53TyrLeu: 2.53 ± 0.403
0.885TyrMet: 0.885 ± 0.24
1.708TyrAsn: 1.708 ± 0.378
1.075TyrPro: 1.075 ± 0.348
1.581TyrGln: 1.581 ± 0.305
1.771TyrArg: 1.771 ± 0.338
1.708TyrSer: 1.708 ± 0.376
2.403TyrThr: 2.403 ± 0.325
2.846TyrVal: 2.846 ± 0.389
0.253TyrTrp: 0.253 ± 0.118
1.075TyrTyr: 1.075 ± 0.25
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 96 proteins (15812 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski