Amino acid dipepetide frequency for Arthrobacter phage Bumble

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
21.659AlaAla: 21.659 ± 1.998
0.239AlaCys: 0.239 ± 0.142
7.724AlaAsp: 7.724 ± 0.815
11.706AlaGlu: 11.706 ± 1.091
1.752AlaPhe: 1.752 ± 0.658
11.148AlaGly: 11.148 ± 1.136
1.752AlaHis: 1.752 ± 0.371
5.176AlaIle: 5.176 ± 0.566
3.743AlaLys: 3.743 ± 0.623
12.582AlaLeu: 12.582 ± 0.88
2.548AlaMet: 2.548 ± 0.449
3.424AlaAsn: 3.424 ± 0.493
10.83AlaPro: 10.83 ± 1.088
3.743AlaGln: 3.743 ± 0.568
10.272AlaArg: 10.272 ± 1.119
7.963AlaSer: 7.963 ± 0.812
6.609AlaThr: 6.609 ± 0.809
9.715AlaVal: 9.715 ± 0.857
1.433AlaTrp: 1.433 ± 0.425
2.469AlaTyr: 2.469 ± 0.364
0.0AlaXaa: 0.0 ± 0.0
Cys
0.319CysAla: 0.319 ± 0.173
0.0CysCys: 0.0 ± 0.0
0.398CysAsp: 0.398 ± 0.174
0.0CysGlu: 0.0 ± 0.0
0.239CysPhe: 0.239 ± 0.126
0.159CysGly: 0.159 ± 0.101
0.08CysHis: 0.08 ± 0.078
0.08CysIle: 0.08 ± 0.07
0.08CysLys: 0.08 ± 0.085
0.398CysLeu: 0.398 ± 0.195
0.239CysMet: 0.239 ± 0.135
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
0.08CysGln: 0.08 ± 0.071
0.159CysArg: 0.159 ± 0.103
0.08CysSer: 0.08 ± 0.082
0.319CysThr: 0.319 ± 0.137
0.717CysVal: 0.717 ± 0.228
0.159CysTrp: 0.159 ± 0.096
0.08CysTyr: 0.08 ± 0.08
0.0CysXaa: 0.0 ± 0.0
Asp
7.804AspAla: 7.804 ± 0.589
0.159AspCys: 0.159 ± 0.117
2.946AspAsp: 2.946 ± 0.499
3.663AspGlu: 3.663 ± 0.619
1.354AspPhe: 1.354 ± 0.348
6.928AspGly: 6.928 ± 0.752
1.354AspHis: 1.354 ± 0.276
1.115AspIle: 1.115 ± 0.372
1.991AspLys: 1.991 ± 0.402
4.778AspLeu: 4.778 ± 0.669
0.637AspMet: 0.637 ± 0.254
1.115AspAsn: 1.115 ± 0.325
6.132AspPro: 6.132 ± 0.703
1.035AspGln: 1.035 ± 0.32
3.982AspArg: 3.982 ± 0.63
3.185AspSer: 3.185 ± 0.465
3.743AspThr: 3.743 ± 0.526
5.256AspVal: 5.256 ± 0.453
1.593AspTrp: 1.593 ± 0.374
1.513AspTyr: 1.513 ± 0.389
0.0AspXaa: 0.0 ± 0.0
Glu
10.352GluAla: 10.352 ± 1.375
0.08GluCys: 0.08 ± 0.071
4.141GluAsp: 4.141 ± 0.607
3.982GluGlu: 3.982 ± 0.725
1.194GluPhe: 1.194 ± 0.261
5.335GluGly: 5.335 ± 0.629
1.115GluHis: 1.115 ± 0.274
4.459GluIle: 4.459 ± 0.511
1.035GluLys: 1.035 ± 0.308
7.326GluLeu: 7.326 ± 0.853
0.478GluMet: 0.478 ± 0.199
1.672GluAsn: 1.672 ± 0.424
3.504GluPro: 3.504 ± 0.466
1.115GluGln: 1.115 ± 0.29
4.22GluArg: 4.22 ± 0.563
2.548GluSer: 2.548 ± 0.409
3.424GluThr: 3.424 ± 0.48
5.256GluVal: 5.256 ± 0.873
0.717GluTrp: 0.717 ± 0.243
1.911GluTyr: 1.911 ± 0.462
0.0GluXaa: 0.0 ± 0.0
Phe
3.424PheAla: 3.424 ± 0.742
0.319PheCys: 0.319 ± 0.132
1.513PheAsp: 1.513 ± 0.301
1.194PheGlu: 1.194 ± 0.242
0.319PhePhe: 0.319 ± 0.256
1.752PheGly: 1.752 ± 0.67
0.557PheHis: 0.557 ± 0.198
1.274PheIle: 1.274 ± 0.254
1.115PheLys: 1.115 ± 0.259
1.274PheLeu: 1.274 ± 0.306
0.557PheMet: 0.557 ± 0.185
0.717PheAsn: 0.717 ± 0.269
1.433PhePro: 1.433 ± 0.341
0.398PheGln: 0.398 ± 0.146
2.469PheArg: 2.469 ± 0.437
1.672PheSer: 1.672 ± 0.404
1.035PheThr: 1.035 ± 0.268
1.274PheVal: 1.274 ± 0.362
0.557PheTrp: 0.557 ± 0.173
0.478PheTyr: 0.478 ± 0.192
0.0PheXaa: 0.0 ± 0.0
Gly
10.75GlyAla: 10.75 ± 0.996
0.0GlyCys: 0.0 ± 0.0
5.495GlyAsp: 5.495 ± 0.616
3.822GlyGlu: 3.822 ± 0.563
2.389GlyPhe: 2.389 ± 0.62
9.635GlyGly: 9.635 ± 1.72
1.593GlyHis: 1.593 ± 0.325
4.141GlyIle: 4.141 ± 0.809
2.867GlyLys: 2.867 ± 0.6
8.441GlyLeu: 8.441 ± 0.965
1.752GlyMet: 1.752 ± 0.347
1.513GlyAsn: 1.513 ± 0.372
4.778GlyPro: 4.778 ± 0.538
2.07GlyGln: 2.07 ± 0.544
7.565GlyArg: 7.565 ± 0.767
6.53GlySer: 6.53 ± 0.709
6.53GlyThr: 6.53 ± 0.965
5.495GlyVal: 5.495 ± 0.787
1.513GlyTrp: 1.513 ± 0.359
2.628GlyTyr: 2.628 ± 0.549
0.0GlyXaa: 0.0 ± 0.0
His
1.274HisAla: 1.274 ± 0.37
0.08HisCys: 0.08 ± 0.078
0.956HisAsp: 0.956 ± 0.282
0.478HisGlu: 0.478 ± 0.192
0.637HisPhe: 0.637 ± 0.202
1.035HisGly: 1.035 ± 0.269
0.239HisHis: 0.239 ± 0.147
0.319HisIle: 0.319 ± 0.128
0.159HisLys: 0.159 ± 0.112
1.752HisLeu: 1.752 ± 0.354
0.239HisMet: 0.239 ± 0.119
0.398HisAsn: 0.398 ± 0.185
1.433HisPro: 1.433 ± 0.316
0.478HisGln: 0.478 ± 0.143
0.956HisArg: 0.956 ± 0.325
1.274HisSer: 1.274 ± 0.515
1.354HisThr: 1.354 ± 0.485
1.354HisVal: 1.354 ± 0.275
0.159HisTrp: 0.159 ± 0.101
0.239HisTyr: 0.239 ± 0.128
0.0HisXaa: 0.0 ± 0.0
Ile
4.698IleAla: 4.698 ± 0.516
0.159IleCys: 0.159 ± 0.113
2.548IleAsp: 2.548 ± 0.352
2.628IleGlu: 2.628 ± 0.354
0.876IlePhe: 0.876 ± 0.267
3.504IleGly: 3.504 ± 0.89
0.637IleHis: 0.637 ± 0.238
1.593IleIle: 1.593 ± 0.373
1.752IleLys: 1.752 ± 0.354
3.982IleLeu: 3.982 ± 0.561
0.637IleMet: 0.637 ± 0.256
1.115IleAsn: 1.115 ± 0.321
2.707IlePro: 2.707 ± 0.504
0.637IleGln: 0.637 ± 0.203
2.707IleArg: 2.707 ± 0.537
3.344IleSer: 3.344 ± 0.445
2.389IleThr: 2.389 ± 0.416
3.265IleVal: 3.265 ± 0.543
0.239IleTrp: 0.239 ± 0.157
0.876IleTyr: 0.876 ± 0.208
0.0IleXaa: 0.0 ± 0.0
Lys
4.38LysAla: 4.38 ± 0.561
0.08LysCys: 0.08 ± 0.073
2.15LysAsp: 2.15 ± 0.362
1.672LysGlu: 1.672 ± 0.326
1.274LysPhe: 1.274 ± 0.336
2.628LysGly: 2.628 ± 0.466
0.319LysHis: 0.319 ± 0.188
1.593LysIle: 1.593 ± 0.536
1.115LysLys: 1.115 ± 0.36
2.548LysLeu: 2.548 ± 0.552
0.637LysMet: 0.637 ± 0.239
0.876LysAsn: 0.876 ± 0.278
1.593LysPro: 1.593 ± 0.475
0.956LysGln: 0.956 ± 0.299
2.07LysArg: 2.07 ± 0.36
1.672LysSer: 1.672 ± 0.44
2.548LysThr: 2.548 ± 0.51
2.07LysVal: 2.07 ± 0.444
0.478LysTrp: 0.478 ± 0.161
0.956LysTyr: 0.956 ± 0.319
0.0LysXaa: 0.0 ± 0.0
Leu
11.069LeuAla: 11.069 ± 0.775
0.398LeuCys: 0.398 ± 0.175
5.893LeuAsp: 5.893 ± 0.678
5.495LeuGlu: 5.495 ± 0.624
1.672LeuPhe: 1.672 ± 0.393
7.246LeuGly: 7.246 ± 0.808
1.035LeuHis: 1.035 ± 0.333
3.663LeuIle: 3.663 ± 0.649
2.787LeuLys: 2.787 ± 0.459
7.963LeuLeu: 7.963 ± 1.064
1.115LeuMet: 1.115 ± 0.277
3.185LeuAsn: 3.185 ± 0.455
5.813LeuPro: 5.813 ± 0.834
1.433LeuGln: 1.433 ± 0.337
6.928LeuArg: 6.928 ± 0.729
8.043LeuSer: 8.043 ± 1.084
6.45LeuThr: 6.45 ± 0.738
6.848LeuVal: 6.848 ± 0.682
1.194LeuTrp: 1.194 ± 0.275
1.354LeuTyr: 1.354 ± 0.376
0.0LeuXaa: 0.0 ± 0.0
Met
2.23MetAla: 2.23 ± 0.516
0.239MetCys: 0.239 ± 0.134
0.796MetAsp: 0.796 ± 0.24
0.796MetGlu: 0.796 ± 0.235
0.478MetPhe: 0.478 ± 0.196
1.354MetGly: 1.354 ± 0.559
0.0MetHis: 0.0 ± 0.0
0.717MetIle: 0.717 ± 0.218
0.319MetLys: 0.319 ± 0.15
1.354MetLeu: 1.354 ± 0.265
0.239MetMet: 0.239 ± 0.117
0.319MetAsn: 0.319 ± 0.142
1.115MetPro: 1.115 ± 0.32
0.398MetGln: 0.398 ± 0.206
0.876MetArg: 0.876 ± 0.298
1.115MetSer: 1.115 ± 0.321
1.832MetThr: 1.832 ± 0.367
1.035MetVal: 1.035 ± 0.257
0.239MetTrp: 0.239 ± 0.134
0.398MetTyr: 0.398 ± 0.183
0.0MetXaa: 0.0 ± 0.0
Asn
3.344AsnAla: 3.344 ± 0.655
0.319AsnCys: 0.319 ± 0.157
1.194AsnAsp: 1.194 ± 0.322
1.832AsnGlu: 1.832 ± 0.29
0.478AsnPhe: 0.478 ± 0.173
2.946AsnGly: 2.946 ± 0.515
0.398AsnHis: 0.398 ± 0.155
0.478AsnIle: 0.478 ± 0.19
0.956AsnLys: 0.956 ± 0.213
2.867AsnLeu: 2.867 ± 0.477
0.239AsnMet: 0.239 ± 0.177
0.717AsnAsn: 0.717 ± 0.239
2.07AsnPro: 2.07 ± 0.382
0.717AsnGln: 0.717 ± 0.265
1.274AsnArg: 1.274 ± 0.255
1.752AsnSer: 1.752 ± 0.278
1.911AsnThr: 1.911 ± 0.413
1.513AsnVal: 1.513 ± 0.336
0.398AsnTrp: 0.398 ± 0.201
0.319AsnTyr: 0.319 ± 0.143
0.0AsnXaa: 0.0 ± 0.0
Pro
10.511ProAla: 10.511 ± 1.097
0.319ProCys: 0.319 ± 0.162
5.415ProAsp: 5.415 ± 0.716
5.654ProGlu: 5.654 ± 0.838
1.194ProPhe: 1.194 ± 0.246
6.848ProGly: 6.848 ± 0.702
0.956ProHis: 0.956 ± 0.242
2.309ProIle: 2.309 ± 0.525
1.752ProLys: 1.752 ± 0.392
4.619ProLeu: 4.619 ± 0.59
0.717ProMet: 0.717 ± 0.224
1.274ProAsn: 1.274 ± 0.354
4.061ProPro: 4.061 ± 0.866
0.876ProGln: 0.876 ± 0.302
5.335ProArg: 5.335 ± 0.81
3.902ProSer: 3.902 ± 0.715
2.946ProThr: 2.946 ± 0.509
5.495ProVal: 5.495 ± 0.596
0.557ProTrp: 0.557 ± 0.182
1.115ProTyr: 1.115 ± 0.343
0.0ProXaa: 0.0 ± 0.0
Gln
2.23GlnAla: 2.23 ± 0.476
0.159GlnCys: 0.159 ± 0.107
0.637GlnAsp: 0.637 ± 0.188
1.194GlnGlu: 1.194 ± 0.378
0.557GlnPhe: 0.557 ± 0.155
1.354GlnGly: 1.354 ± 0.362
0.319GlnHis: 0.319 ± 0.126
1.672GlnIle: 1.672 ± 0.335
0.717GlnLys: 0.717 ± 0.222
1.991GlnLeu: 1.991 ± 0.443
1.035GlnMet: 1.035 ± 0.296
0.796GlnAsn: 0.796 ± 0.262
0.717GlnPro: 0.717 ± 0.199
1.035GlnGln: 1.035 ± 0.277
1.672GlnArg: 1.672 ± 0.305
1.354GlnSer: 1.354 ± 0.35
2.07GlnThr: 2.07 ± 0.406
1.433GlnVal: 1.433 ± 0.261
0.319GlnTrp: 0.319 ± 0.156
0.717GlnTyr: 0.717 ± 0.219
0.0GlnXaa: 0.0 ± 0.0
Arg
9.874ArgAla: 9.874 ± 1.053
0.398ArgCys: 0.398 ± 0.241
4.937ArgAsp: 4.937 ± 0.516
5.017ArgGlu: 5.017 ± 0.578
2.628ArgPhe: 2.628 ± 0.502
4.459ArgGly: 4.459 ± 0.715
0.796ArgHis: 0.796 ± 0.261
4.38ArgIle: 4.38 ± 0.538
2.628ArgLys: 2.628 ± 0.455
6.609ArgLeu: 6.609 ± 0.748
1.194ArgMet: 1.194 ± 0.271
1.513ArgAsn: 1.513 ± 0.36
4.459ArgPro: 4.459 ± 0.627
2.309ArgGln: 2.309 ± 0.477
4.459ArgArg: 4.459 ± 0.628
4.459ArgSer: 4.459 ± 0.689
5.096ArgThr: 5.096 ± 0.614
4.3ArgVal: 4.3 ± 0.653
1.433ArgTrp: 1.433 ± 0.36
2.469ArgTyr: 2.469 ± 0.485
0.0ArgXaa: 0.0 ± 0.0
Ser
8.441SerAla: 8.441 ± 1.052
0.159SerCys: 0.159 ± 0.1
2.867SerAsp: 2.867 ± 0.412
3.344SerGlu: 3.344 ± 0.468
1.433SerPhe: 1.433 ± 0.327
8.043SerGly: 8.043 ± 1.202
1.194SerHis: 1.194 ± 0.487
1.991SerIle: 1.991 ± 0.41
2.469SerLys: 2.469 ± 0.485
4.778SerLeu: 4.778 ± 0.553
0.956SerMet: 0.956 ± 0.28
1.513SerAsn: 1.513 ± 0.38
4.459SerPro: 4.459 ± 0.514
1.194SerGln: 1.194 ± 0.301
4.778SerArg: 4.778 ± 0.593
4.619SerSer: 4.619 ± 0.648
3.583SerThr: 3.583 ± 0.51
4.061SerVal: 4.061 ± 0.709
1.274SerTrp: 1.274 ± 0.432
2.548SerTyr: 2.548 ± 0.448
0.0SerXaa: 0.0 ± 0.0
Thr
9.396ThrAla: 9.396 ± 0.982
0.398ThrCys: 0.398 ± 0.171
3.265ThrAsp: 3.265 ± 0.515
3.424ThrGlu: 3.424 ± 0.655
2.389ThrPhe: 2.389 ± 0.522
5.335ThrGly: 5.335 ± 0.815
1.274ThrHis: 1.274 ± 0.383
1.832ThrIle: 1.832 ± 0.364
2.548ThrLys: 2.548 ± 0.511
4.619ThrLeu: 4.619 ± 0.448
0.876ThrMet: 0.876 ± 0.273
1.911ThrAsn: 1.911 ± 0.37
5.176ThrPro: 5.176 ± 0.627
1.115ThrGln: 1.115 ± 0.283
3.743ThrArg: 3.743 ± 0.493
3.026ThrSer: 3.026 ± 0.383
4.857ThrThr: 4.857 ± 0.686
6.132ThrVal: 6.132 ± 0.961
0.956ThrTrp: 0.956 ± 0.277
1.991ThrTyr: 1.991 ± 0.467
0.0ThrXaa: 0.0 ± 0.0
Val
9.476ValAla: 9.476 ± 1.076
0.08ValCys: 0.08 ± 0.085
4.778ValAsp: 4.778 ± 0.804
6.132ValGlu: 6.132 ± 0.804
1.354ValPhe: 1.354 ± 0.327
5.256ValGly: 5.256 ± 0.752
0.796ValHis: 0.796 ± 0.179
2.07ValIle: 2.07 ± 0.408
2.23ValLys: 2.23 ± 0.355
7.804ValLeu: 7.804 ± 0.728
0.796ValMet: 0.796 ± 0.287
1.991ValAsn: 1.991 ± 0.322
3.663ValPro: 3.663 ± 0.491
1.194ValGln: 1.194 ± 0.281
6.689ValArg: 6.689 ± 0.776
4.778ValSer: 4.778 ± 0.774
5.096ValThr: 5.096 ± 0.673
7.167ValVal: 7.167 ± 1.044
0.717ValTrp: 0.717 ± 0.193
3.265ValTyr: 3.265 ± 0.568
0.0ValXaa: 0.0 ± 0.0
Trp
2.15TrpAla: 2.15 ± 0.373
0.0TrpCys: 0.0 ± 0.0
1.035TrpAsp: 1.035 ± 0.351
0.557TrpGlu: 0.557 ± 0.198
0.319TrpPhe: 0.319 ± 0.136
1.194TrpGly: 1.194 ± 0.393
0.159TrpHis: 0.159 ± 0.108
0.876TrpIle: 0.876 ± 0.251
0.796TrpLys: 0.796 ± 0.213
1.513TrpLeu: 1.513 ± 0.391
0.319TrpMet: 0.319 ± 0.142
0.478TrpAsn: 0.478 ± 0.167
0.239TrpPro: 0.239 ± 0.133
0.319TrpGln: 0.319 ± 0.138
1.035TrpArg: 1.035 ± 0.243
0.717TrpSer: 0.717 ± 0.231
1.115TrpThr: 1.115 ± 0.333
1.115TrpVal: 1.115 ± 0.231
0.319TrpTrp: 0.319 ± 0.152
0.159TrpTyr: 0.159 ± 0.111
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.106TyrAla: 3.106 ± 0.45
0.0TyrCys: 0.0 ± 0.0
1.593TyrAsp: 1.593 ± 0.424
1.274TyrGlu: 1.274 ± 0.303
0.796TyrPhe: 0.796 ± 0.256
3.344TyrGly: 3.344 ± 0.612
0.239TyrHis: 0.239 ± 0.171
0.557TyrIle: 0.557 ± 0.216
0.637TyrLys: 0.637 ± 0.194
2.389TyrLeu: 2.389 ± 0.509
0.557TyrMet: 0.557 ± 0.202
1.194TyrAsn: 1.194 ± 0.304
1.672TyrPro: 1.672 ± 0.321
0.876TyrGln: 0.876 ± 0.298
2.23TyrArg: 2.23 ± 0.378
1.672TyrSer: 1.672 ± 0.327
1.354TyrThr: 1.354 ± 0.363
1.832TyrVal: 1.832 ± 0.42
0.159TyrTrp: 0.159 ± 0.114
0.637TyrTyr: 0.637 ± 0.223
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 57 proteins (12559 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski