Amino acid dipepetide frequency for Arthrobacter phage Makai

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.892AlaAla: 7.892 ± 1.133
0.386AlaCys: 0.386 ± 0.205
4.581AlaAsp: 4.581 ± 0.579
5.078AlaGlu: 5.078 ± 0.477
2.704AlaPhe: 2.704 ± 0.396
7.23AlaGly: 7.23 ± 1.156
1.766AlaHis: 1.766 ± 0.322
5.74AlaIle: 5.74 ± 0.441
6.402AlaLys: 6.402 ± 0.826
7.009AlaLeu: 7.009 ± 1.172
3.256AlaMet: 3.256 ± 0.536
3.974AlaAsn: 3.974 ± 0.383
3.311AlaPro: 3.311 ± 0.415
2.649AlaGln: 2.649 ± 0.391
4.802AlaArg: 4.802 ± 0.448
5.243AlaSer: 5.243 ± 0.62
5.685AlaThr: 5.685 ± 0.586
6.678AlaVal: 6.678 ± 0.714
1.159AlaTrp: 1.159 ± 0.264
2.76AlaTyr: 2.76 ± 0.339
0.0AlaXaa: 0.0 ± 0.0
Cys
0.552CysAla: 0.552 ± 0.196
0.0CysCys: 0.0 ± 0.0
0.386CysAsp: 0.386 ± 0.141
0.331CysGlu: 0.331 ± 0.161
0.221CysPhe: 0.221 ± 0.121
0.386CysGly: 0.386 ± 0.172
0.221CysHis: 0.221 ± 0.094
0.331CysIle: 0.331 ± 0.159
0.166CysLys: 0.166 ± 0.099
0.442CysLeu: 0.442 ± 0.183
0.11CysMet: 0.11 ± 0.073
0.11CysAsn: 0.11 ± 0.077
0.276CysPro: 0.276 ± 0.17
0.166CysGln: 0.166 ± 0.111
0.276CysArg: 0.276 ± 0.113
0.442CysSer: 0.442 ± 0.218
0.11CysThr: 0.11 ± 0.074
0.386CysVal: 0.386 ± 0.166
0.0CysTrp: 0.0 ± 0.0
0.166CysTyr: 0.166 ± 0.086
0.0CysXaa: 0.0 ± 0.0
Asp
5.685AspAla: 5.685 ± 0.551
0.221AspCys: 0.221 ± 0.106
3.698AspAsp: 3.698 ± 0.549
4.746AspGlu: 4.746 ± 0.61
2.484AspPhe: 2.484 ± 0.419
5.243AspGly: 5.243 ± 0.609
0.497AspHis: 0.497 ± 0.185
4.415AspIle: 4.415 ± 0.676
3.367AspLys: 3.367 ± 0.47
4.912AspLeu: 4.912 ± 0.602
1.766AspMet: 1.766 ± 0.278
2.925AspAsn: 2.925 ± 0.369
2.428AspPro: 2.428 ± 0.43
2.539AspGln: 2.539 ± 0.411
2.87AspArg: 2.87 ± 0.382
2.925AspSer: 2.925 ± 0.414
2.925AspThr: 2.925 ± 0.663
3.919AspVal: 3.919 ± 0.539
0.883AspTrp: 0.883 ± 0.197
1.325AspTyr: 1.325 ± 0.226
0.0AspXaa: 0.0 ± 0.0
Glu
5.298GluAla: 5.298 ± 0.454
0.552GluCys: 0.552 ± 0.169
4.029GluAsp: 4.029 ± 0.533
5.188GluGlu: 5.188 ± 0.813
2.87GluPhe: 2.87 ± 0.455
4.526GluGly: 4.526 ± 0.411
1.601GluHis: 1.601 ± 0.298
3.532GluIle: 3.532 ± 0.371
3.863GluLys: 3.863 ± 0.595
5.795GluLeu: 5.795 ± 0.58
2.649GluMet: 2.649 ± 0.344
2.373GluAsn: 2.373 ± 0.364
2.594GluPro: 2.594 ± 0.455
2.704GluGln: 2.704 ± 0.302
3.422GluArg: 3.422 ± 0.489
2.704GluSer: 2.704 ± 0.419
3.698GluThr: 3.698 ± 0.364
4.691GluVal: 4.691 ± 0.564
1.325GluTrp: 1.325 ± 0.23
1.987GluTyr: 1.987 ± 0.296
0.0GluXaa: 0.0 ± 0.0
Phe
3.035PheAla: 3.035 ± 0.349
0.11PheCys: 0.11 ± 0.077
2.87PheAsp: 2.87 ± 0.357
2.539PheGlu: 2.539 ± 0.426
1.435PhePhe: 1.435 ± 0.287
2.484PheGly: 2.484 ± 0.331
0.552PheHis: 0.552 ± 0.182
2.373PheIle: 2.373 ± 0.289
2.373PheLys: 2.373 ± 0.304
2.594PheLeu: 2.594 ± 0.409
0.883PheMet: 0.883 ± 0.271
2.318PheAsn: 2.318 ± 0.327
1.104PhePro: 1.104 ± 0.227
1.104PheGln: 1.104 ± 0.188
1.932PheArg: 1.932 ± 0.415
2.594PheSer: 2.594 ± 0.416
2.373PheThr: 2.373 ± 0.557
2.042PheVal: 2.042 ± 0.332
0.828PheTrp: 0.828 ± 0.255
1.049PheTyr: 1.049 ± 0.263
0.0PheXaa: 0.0 ± 0.0
Gly
7.672GlyAla: 7.672 ± 0.926
0.386GlyCys: 0.386 ± 0.19
4.857GlyAsp: 4.857 ± 0.529
4.36GlyGlu: 4.36 ± 0.45
3.698GlyPhe: 3.698 ± 0.422
6.292GlyGly: 6.292 ± 1.578
1.601GlyHis: 1.601 ± 0.323
4.581GlyIle: 4.581 ± 0.601
4.746GlyLys: 4.746 ± 0.525
5.961GlyLeu: 5.961 ± 0.839
2.373GlyMet: 2.373 ± 0.574
2.87GlyAsn: 2.87 ± 0.522
1.656GlyPro: 1.656 ± 0.244
1.49GlyGln: 1.49 ± 0.378
2.649GlyArg: 2.649 ± 0.389
4.967GlySer: 4.967 ± 0.641
5.961GlyThr: 5.961 ± 0.812
5.574GlyVal: 5.574 ± 0.625
1.711GlyTrp: 1.711 ± 0.396
2.373GlyTyr: 2.373 ± 0.42
0.0GlyXaa: 0.0 ± 0.0
His
1.601HisAla: 1.601 ± 0.429
0.221HisCys: 0.221 ± 0.113
0.938HisAsp: 0.938 ± 0.262
1.049HisGlu: 1.049 ± 0.247
0.662HisPhe: 0.662 ± 0.196
1.104HisGly: 1.104 ± 0.298
0.386HisHis: 0.386 ± 0.172
1.269HisIle: 1.269 ± 0.238
1.159HisLys: 1.159 ± 0.263
1.435HisLeu: 1.435 ± 0.268
0.442HisMet: 0.442 ± 0.173
0.717HisAsn: 0.717 ± 0.228
0.993HisPro: 0.993 ± 0.251
0.883HisGln: 0.883 ± 0.205
0.993HisArg: 0.993 ± 0.241
0.993HisSer: 0.993 ± 0.284
0.938HisThr: 0.938 ± 0.184
1.049HisVal: 1.049 ± 0.23
0.221HisTrp: 0.221 ± 0.12
1.159HisTyr: 1.159 ± 0.251
0.0HisXaa: 0.0 ± 0.0
Ile
5.74IleAla: 5.74 ± 0.551
0.276IleCys: 0.276 ± 0.136
4.47IleAsp: 4.47 ± 0.494
4.029IleGlu: 4.029 ± 0.505
1.932IlePhe: 1.932 ± 0.356
3.643IleGly: 3.643 ± 0.674
1.104IleHis: 1.104 ± 0.269
3.532IleIle: 3.532 ± 0.479
4.36IleLys: 4.36 ± 0.546
6.347IleLeu: 6.347 ± 0.607
1.269IleMet: 1.269 ± 0.225
3.422IleAsn: 3.422 ± 0.398
2.649IlePro: 2.649 ± 0.468
1.766IleGln: 1.766 ± 0.24
2.98IleArg: 2.98 ± 0.343
3.863IleSer: 3.863 ± 0.487
3.477IleThr: 3.477 ± 0.387
4.746IleVal: 4.746 ± 0.444
0.828IleTrp: 0.828 ± 0.173
1.711IleTyr: 1.711 ± 0.282
0.0IleXaa: 0.0 ± 0.0
Lys
6.678LysAla: 6.678 ± 0.859
0.331LysCys: 0.331 ± 0.126
3.422LysAsp: 3.422 ± 0.543
5.298LysGlu: 5.298 ± 0.652
2.594LysPhe: 2.594 ± 0.363
4.47LysGly: 4.47 ± 0.554
1.711LysHis: 1.711 ± 0.396
3.698LysIle: 3.698 ± 0.499
5.74LysLys: 5.74 ± 0.751
4.967LysLeu: 4.967 ± 0.51
2.539LysMet: 2.539 ± 0.391
2.87LysAsn: 2.87 ± 0.421
3.367LysPro: 3.367 ± 0.476
2.76LysGln: 2.76 ± 0.327
3.974LysArg: 3.974 ± 0.483
3.753LysSer: 3.753 ± 0.482
2.925LysThr: 2.925 ± 0.423
4.415LysVal: 4.415 ± 0.449
1.435LysTrp: 1.435 ± 0.291
2.208LysTyr: 2.208 ± 0.392
0.0LysXaa: 0.0 ± 0.0
Leu
7.12LeuAla: 7.12 ± 0.827
0.442LeuCys: 0.442 ± 0.145
5.243LeuAsp: 5.243 ± 0.675
5.409LeuGlu: 5.409 ± 0.566
2.594LeuPhe: 2.594 ± 0.465
6.016LeuGly: 6.016 ± 1.028
1.269LeuHis: 1.269 ± 0.29
5.353LeuIle: 5.353 ± 0.589
5.353LeuLys: 5.353 ± 0.642
5.078LeuLeu: 5.078 ± 0.691
2.208LeuMet: 2.208 ± 0.344
3.367LeuAsn: 3.367 ± 0.372
3.587LeuPro: 3.587 ± 0.433
2.649LeuGln: 2.649 ± 0.34
4.139LeuArg: 4.139 ± 0.546
4.746LeuSer: 4.746 ± 0.602
5.078LeuThr: 5.078 ± 0.589
6.016LeuVal: 6.016 ± 0.502
0.828LeuTrp: 0.828 ± 0.215
3.091LeuTyr: 3.091 ± 0.387
0.0LeuXaa: 0.0 ± 0.0
Met
3.201MetAla: 3.201 ± 0.414
0.166MetCys: 0.166 ± 0.115
1.214MetAsp: 1.214 ± 0.19
1.656MetGlu: 1.656 ± 0.309
1.049MetPhe: 1.049 ± 0.211
1.932MetGly: 1.932 ± 0.414
0.497MetHis: 0.497 ± 0.18
1.876MetIle: 1.876 ± 0.322
2.152MetLys: 2.152 ± 0.355
2.263MetLeu: 2.263 ± 0.304
0.607MetMet: 0.607 ± 0.185
1.049MetAsn: 1.049 ± 0.258
0.938MetPro: 0.938 ± 0.208
0.828MetGln: 0.828 ± 0.307
1.435MetArg: 1.435 ± 0.287
2.925MetSer: 2.925 ± 0.415
1.821MetThr: 1.821 ± 0.324
1.656MetVal: 1.656 ± 0.274
0.276MetTrp: 0.276 ± 0.12
0.993MetTyr: 0.993 ± 0.28
0.0MetXaa: 0.0 ± 0.0
Asn
4.029AsnAla: 4.029 ± 0.474
0.331AsnCys: 0.331 ± 0.138
2.704AsnAsp: 2.704 ± 0.372
3.035AsnGlu: 3.035 ± 0.434
1.49AsnPhe: 1.49 ± 0.277
4.47AsnGly: 4.47 ± 0.59
0.497AsnHis: 0.497 ± 0.159
2.704AsnIle: 2.704 ± 0.488
2.484AsnLys: 2.484 ± 0.323
2.87AsnLeu: 2.87 ± 0.563
1.214AsnMet: 1.214 ± 0.228
2.097AsnAsn: 2.097 ± 0.323
2.594AsnPro: 2.594 ± 0.36
2.152AsnGln: 2.152 ± 0.326
2.815AsnArg: 2.815 ± 0.46
3.367AsnSer: 3.367 ± 0.405
2.208AsnThr: 2.208 ± 0.292
2.208AsnVal: 2.208 ± 0.313
0.607AsnTrp: 0.607 ± 0.181
1.766AsnTyr: 1.766 ± 0.348
0.0AsnXaa: 0.0 ± 0.0
Pro
3.477ProAla: 3.477 ± 0.498
0.331ProCys: 0.331 ± 0.132
2.484ProAsp: 2.484 ± 0.372
3.201ProGlu: 3.201 ± 0.553
1.38ProPhe: 1.38 ± 0.289
3.643ProGly: 3.643 ± 0.555
0.607ProHis: 0.607 ± 0.197
1.987ProIle: 1.987 ± 0.355
2.594ProLys: 2.594 ± 0.445
2.428ProLeu: 2.428 ± 0.326
0.883ProMet: 0.883 ± 0.218
1.711ProAsn: 1.711 ± 0.351
2.097ProPro: 2.097 ± 0.419
1.214ProGln: 1.214 ± 0.272
1.876ProArg: 1.876 ± 0.373
3.146ProSer: 3.146 ± 0.378
2.704ProThr: 2.704 ± 0.374
3.091ProVal: 3.091 ± 0.302
0.386ProTrp: 0.386 ± 0.137
1.38ProTyr: 1.38 ± 0.311
0.0ProXaa: 0.0 ± 0.0
Gln
2.76GlnAla: 2.76 ± 0.503
0.055GlnCys: 0.055 ± 0.053
1.435GlnAsp: 1.435 ± 0.258
2.318GlnGlu: 2.318 ± 0.426
1.214GlnPhe: 1.214 ± 0.198
2.208GlnGly: 2.208 ± 0.353
0.717GlnHis: 0.717 ± 0.231
2.539GlnIle: 2.539 ± 0.287
3.256GlnLys: 3.256 ± 0.57
3.256GlnLeu: 3.256 ± 0.481
0.773GlnMet: 0.773 ± 0.186
1.49GlnAsn: 1.49 ± 0.258
1.159GlnPro: 1.159 ± 0.261
1.435GlnGln: 1.435 ± 0.298
1.932GlnArg: 1.932 ± 0.378
1.159GlnSer: 1.159 ± 0.282
1.821GlnThr: 1.821 ± 0.346
2.484GlnVal: 2.484 ± 0.392
0.497GlnTrp: 0.497 ± 0.186
1.49GlnTyr: 1.49 ± 0.282
0.0GlnXaa: 0.0 ± 0.0
Arg
3.808ArgAla: 3.808 ± 0.552
0.11ArgCys: 0.11 ± 0.079
2.263ArgAsp: 2.263 ± 0.46
3.256ArgGlu: 3.256 ± 0.393
1.435ArgPhe: 1.435 ± 0.276
2.815ArgGly: 2.815 ± 0.328
1.104ArgHis: 1.104 ± 0.274
3.367ArgIle: 3.367 ± 0.453
4.581ArgLys: 4.581 ± 0.588
4.746ArgLeu: 4.746 ± 0.568
1.325ArgMet: 1.325 ± 0.289
2.925ArgAsn: 2.925 ± 0.4
1.435ArgPro: 1.435 ± 0.315
2.263ArgGln: 2.263 ± 0.428
4.084ArgArg: 4.084 ± 0.783
2.815ArgSer: 2.815 ± 0.39
3.532ArgThr: 3.532 ± 0.395
3.643ArgVal: 3.643 ± 0.44
0.662ArgTrp: 0.662 ± 0.219
1.821ArgTyr: 1.821 ± 0.296
0.0ArgXaa: 0.0 ± 0.0
Ser
4.415SerAla: 4.415 ± 0.583
0.331SerCys: 0.331 ± 0.184
3.698SerAsp: 3.698 ± 0.488
3.477SerGlu: 3.477 ± 0.461
2.484SerPhe: 2.484 ± 0.383
5.629SerGly: 5.629 ± 0.69
0.993SerHis: 0.993 ± 0.251
4.194SerIle: 4.194 ± 0.414
4.36SerLys: 4.36 ± 0.515
5.574SerLeu: 5.574 ± 0.466
1.49SerMet: 1.49 ± 0.286
2.318SerAsn: 2.318 ± 0.323
2.925SerPro: 2.925 ± 0.393
1.656SerGln: 1.656 ± 0.272
2.98SerArg: 2.98 ± 0.346
4.415SerSer: 4.415 ± 0.63
3.532SerThr: 3.532 ± 0.374
3.367SerVal: 3.367 ± 0.41
1.104SerTrp: 1.104 ± 0.252
1.932SerTyr: 1.932 ± 0.337
0.0SerXaa: 0.0 ± 0.0
Thr
5.574ThrAla: 5.574 ± 0.606
0.276ThrCys: 0.276 ± 0.135
3.919ThrAsp: 3.919 ± 0.51
3.532ThrGlu: 3.532 ± 0.447
2.097ThrPhe: 2.097 ± 0.375
4.912ThrGly: 4.912 ± 0.671
0.938ThrHis: 0.938 ± 0.256
3.698ThrIle: 3.698 ± 0.431
3.256ThrLys: 3.256 ± 0.326
5.464ThrLeu: 5.464 ± 0.57
1.545ThrMet: 1.545 ± 0.237
3.256ThrAsn: 3.256 ± 0.392
3.311ThrPro: 3.311 ± 0.46
1.49ThrGln: 1.49 ± 0.268
2.484ThrArg: 2.484 ± 0.444
3.146ThrSer: 3.146 ± 0.425
4.746ThrThr: 4.746 ± 0.792
4.25ThrVal: 4.25 ± 0.554
0.938ThrTrp: 0.938 ± 0.216
2.318ThrTyr: 2.318 ± 0.378
0.0ThrXaa: 0.0 ± 0.0
Val
5.685ValAla: 5.685 ± 0.622
0.166ValCys: 0.166 ± 0.109
3.919ValAsp: 3.919 ± 0.472
3.919ValGlu: 3.919 ± 0.417
2.152ValPhe: 2.152 ± 0.403
4.912ValGly: 4.912 ± 0.511
1.159ValHis: 1.159 ± 0.277
4.25ValIle: 4.25 ± 0.449
6.016ValLys: 6.016 ± 0.652
4.857ValLeu: 4.857 ± 0.629
1.876ValMet: 1.876 ± 0.332
3.643ValAsn: 3.643 ± 0.494
2.594ValPro: 2.594 ± 0.351
2.484ValGln: 2.484 ± 0.457
3.863ValArg: 3.863 ± 0.657
4.415ValSer: 4.415 ± 0.523
4.084ValThr: 4.084 ± 0.609
4.802ValVal: 4.802 ± 0.531
0.717ValTrp: 0.717 ± 0.178
2.76ValTyr: 2.76 ± 0.488
0.0ValXaa: 0.0 ± 0.0
Trp
0.883TrpAla: 0.883 ± 0.214
0.11TrpCys: 0.11 ± 0.088
1.269TrpAsp: 1.269 ± 0.301
1.104TrpGlu: 1.104 ± 0.301
0.552TrpPhe: 0.552 ± 0.188
1.435TrpGly: 1.435 ± 0.236
0.276TrpHis: 0.276 ± 0.132
0.662TrpIle: 0.662 ± 0.252
0.607TrpLys: 0.607 ± 0.188
1.214TrpLeu: 1.214 ± 0.279
0.331TrpMet: 0.331 ± 0.134
0.662TrpAsn: 0.662 ± 0.223
0.166TrpPro: 0.166 ± 0.081
0.662TrpGln: 0.662 ± 0.183
0.938TrpArg: 0.938 ± 0.321
0.773TrpSer: 0.773 ± 0.162
1.545TrpThr: 1.545 ± 0.275
1.104TrpVal: 1.104 ± 0.324
0.276TrpTrp: 0.276 ± 0.116
0.497TrpTyr: 0.497 ± 0.167
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.201TyrAla: 3.201 ± 0.438
0.276TyrCys: 0.276 ± 0.116
2.373TyrAsp: 2.373 ± 0.439
1.876TyrGlu: 1.876 ± 0.332
1.545TyrPhe: 1.545 ± 0.308
2.428TyrGly: 2.428 ± 0.355
0.717TyrHis: 0.717 ± 0.218
2.042TyrIle: 2.042 ± 0.359
2.318TyrLys: 2.318 ± 0.3
2.373TyrLeu: 2.373 ± 0.292
0.938TyrMet: 0.938 ± 0.229
1.545TyrAsn: 1.545 ± 0.277
1.435TyrPro: 1.435 ± 0.278
1.104TyrGln: 1.104 ± 0.24
1.601TyrArg: 1.601 ± 0.269
2.484TyrSer: 2.484 ± 0.327
1.987TyrThr: 1.987 ± 0.321
2.152TyrVal: 2.152 ± 0.322
0.386TyrTrp: 0.386 ± 0.158
1.049TyrTyr: 1.049 ± 0.28
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 92 proteins (18120 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski