Amino acid dipepetide frequency for Gordonia phage Mcklovin

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
17.167AlaAla: 17.167 ± 1.717
0.882AlaCys: 0.882 ± 0.245
8.172AlaAsp: 8.172 ± 0.678
7.584AlaGlu: 7.584 ± 0.859
3.116AlaPhe: 3.116 ± 0.499
9.289AlaGly: 9.289 ± 0.879
1.881AlaHis: 1.881 ± 0.462
5.35AlaIle: 5.35 ± 0.506
4.292AlaLys: 4.292 ± 0.477
8.701AlaLeu: 8.701 ± 0.693
3.469AlaMet: 3.469 ± 0.581
3.528AlaAsn: 3.528 ± 0.447
4.468AlaPro: 4.468 ± 0.64
4.351AlaGln: 4.351 ± 0.83
8.172AlaArg: 8.172 ± 0.711
5.997AlaSer: 5.997 ± 0.748
8.113AlaThr: 8.113 ± 0.584
7.702AlaVal: 7.702 ± 1.144
1.764AlaTrp: 1.764 ± 0.386
3.057AlaTyr: 3.057 ± 0.372
0.0AlaXaa: 0.0 ± 0.0
Cys
0.882CysAla: 0.882 ± 0.239
0.235CysCys: 0.235 ± 0.116
0.941CysAsp: 0.941 ± 0.275
0.588CysGlu: 0.588 ± 0.242
0.118CysPhe: 0.118 ± 0.08
1.411CysGly: 1.411 ± 0.295
0.47CysHis: 0.47 ± 0.238
0.353CysIle: 0.353 ± 0.153
0.294CysLys: 0.294 ± 0.138
0.412CysLeu: 0.412 ± 0.146
0.059CysMet: 0.059 ± 0.058
0.47CysAsn: 0.47 ± 0.155
0.588CysPro: 0.588 ± 0.202
0.294CysGln: 0.294 ± 0.178
0.999CysArg: 0.999 ± 0.303
0.47CysSer: 0.47 ± 0.158
0.412CysThr: 0.412 ± 0.154
0.529CysVal: 0.529 ± 0.171
0.176CysTrp: 0.176 ± 0.112
0.588CysTyr: 0.588 ± 0.185
0.0CysXaa: 0.0 ± 0.0
Asp
6.82AspAla: 6.82 ± 0.519
0.588AspCys: 0.588 ± 0.197
7.467AspAsp: 7.467 ± 1.0
5.233AspGlu: 5.233 ± 0.695
1.529AspPhe: 1.529 ± 0.241
5.762AspGly: 5.762 ± 0.639
1.881AspHis: 1.881 ± 0.439
2.234AspIle: 2.234 ± 0.401
1.94AspLys: 1.94 ± 0.338
6.585AspLeu: 6.585 ± 0.547
1.411AspMet: 1.411 ± 0.272
1.764AspAsn: 1.764 ± 0.292
5.115AspPro: 5.115 ± 0.487
2.058AspGln: 2.058 ± 0.359
5.056AspArg: 5.056 ± 0.816
2.763AspSer: 2.763 ± 0.331
3.822AspThr: 3.822 ± 0.454
5.291AspVal: 5.291 ± 0.546
1.352AspTrp: 1.352 ± 0.256
1.293AspTyr: 1.293 ± 0.264
0.0AspXaa: 0.0 ± 0.0
Glu
6.056GluAla: 6.056 ± 0.845
0.647GluCys: 0.647 ± 0.273
2.998GluAsp: 2.998 ± 0.525
3.234GluGlu: 3.234 ± 0.457
2.175GluPhe: 2.175 ± 0.376
3.351GluGly: 3.351 ± 0.48
0.823GluHis: 0.823 ± 0.248
2.469GluIle: 2.469 ± 0.423
1.705GluLys: 1.705 ± 0.357
5.585GluLeu: 5.585 ± 0.512
0.999GluMet: 0.999 ± 0.246
1.176GluAsn: 1.176 ± 0.372
2.352GluPro: 2.352 ± 0.493
3.057GluGln: 3.057 ± 0.545
5.526GluArg: 5.526 ± 0.767
3.292GluSer: 3.292 ± 0.329
3.175GluThr: 3.175 ± 0.44
5.233GluVal: 5.233 ± 0.719
1.235GluTrp: 1.235 ± 0.247
1.176GluTyr: 1.176 ± 0.235
0.0GluXaa: 0.0 ± 0.0
Phe
3.057PheAla: 3.057 ± 0.422
0.118PheCys: 0.118 ± 0.073
2.41PheAsp: 2.41 ± 0.352
1.47PheGlu: 1.47 ± 0.265
0.647PhePhe: 0.647 ± 0.235
2.41PheGly: 2.41 ± 0.497
0.529PheHis: 0.529 ± 0.168
0.999PheIle: 0.999 ± 0.233
0.764PheLys: 0.764 ± 0.212
1.587PheLeu: 1.587 ± 0.433
0.412PheMet: 0.412 ± 0.134
0.823PheAsn: 0.823 ± 0.239
1.176PhePro: 1.176 ± 0.251
0.706PheGln: 0.706 ± 0.195
1.646PheArg: 1.646 ± 0.334
1.235PheSer: 1.235 ± 0.273
2.058PheThr: 2.058 ± 0.411
2.058PheVal: 2.058 ± 0.447
0.647PheTrp: 0.647 ± 0.182
0.647PheTyr: 0.647 ± 0.215
0.0PheXaa: 0.0 ± 0.0
Gly
8.936GlyAla: 8.936 ± 0.788
0.647GlyCys: 0.647 ± 0.175
5.115GlyAsp: 5.115 ± 0.53
3.645GlyGlu: 3.645 ± 0.371
2.058GlyPhe: 2.058 ± 0.366
7.408GlyGly: 7.408 ± 0.876
1.587GlyHis: 1.587 ± 0.325
4.115GlyIle: 4.115 ± 0.429
3.528GlyLys: 3.528 ± 0.428
6.408GlyLeu: 6.408 ± 0.737
2.234GlyMet: 2.234 ± 0.41
3.528GlyAsn: 3.528 ± 0.4
3.88GlyPro: 3.88 ± 0.545
3.057GlyGln: 3.057 ± 0.492
6.879GlyArg: 6.879 ± 0.73
4.88GlySer: 4.88 ± 0.574
5.644GlyThr: 5.644 ± 0.617
5.938GlyVal: 5.938 ± 0.641
1.352GlyTrp: 1.352 ± 0.28
2.352GlyTyr: 2.352 ± 0.307
0.0GlyXaa: 0.0 ± 0.0
His
1.705HisAla: 1.705 ± 0.413
0.294HisCys: 0.294 ± 0.13
1.47HisAsp: 1.47 ± 0.29
0.882HisGlu: 0.882 ± 0.273
0.294HisPhe: 0.294 ± 0.133
2.293HisGly: 2.293 ± 0.532
0.588HisHis: 0.588 ± 0.248
0.647HisIle: 0.647 ± 0.168
0.529HisLys: 0.529 ± 0.166
1.881HisLeu: 1.881 ± 0.416
0.47HisMet: 0.47 ± 0.166
0.235HisAsn: 0.235 ± 0.11
1.411HisPro: 1.411 ± 0.299
1.117HisGln: 1.117 ± 0.224
1.823HisArg: 1.823 ± 0.346
0.999HisSer: 0.999 ± 0.222
1.646HisThr: 1.646 ± 0.391
1.587HisVal: 1.587 ± 0.268
0.47HisTrp: 0.47 ± 0.144
0.706HisTyr: 0.706 ± 0.223
0.0HisXaa: 0.0 ± 0.0
Ile
6.35IleAla: 6.35 ± 0.737
0.353IleCys: 0.353 ± 0.195
3.704IleAsp: 3.704 ± 0.455
3.351IleGlu: 3.351 ± 0.434
0.823IlePhe: 0.823 ± 0.256
3.645IleGly: 3.645 ± 0.551
0.823IleHis: 0.823 ± 0.207
1.117IleIle: 1.117 ± 0.257
0.941IleLys: 0.941 ± 0.212
2.881IleLeu: 2.881 ± 0.439
0.764IleMet: 0.764 ± 0.241
1.352IleAsn: 1.352 ± 0.32
2.704IlePro: 2.704 ± 0.353
1.94IleGln: 1.94 ± 0.346
3.41IleArg: 3.41 ± 0.465
1.999IleSer: 1.999 ± 0.293
2.94IleThr: 2.94 ± 0.493
3.469IleVal: 3.469 ± 0.492
0.706IleTrp: 0.706 ± 0.254
1.176IleTyr: 1.176 ± 0.244
0.0IleXaa: 0.0 ± 0.0
Lys
4.174LysAla: 4.174 ± 0.464
0.176LysCys: 0.176 ± 0.102
1.764LysAsp: 1.764 ± 0.307
1.47LysGlu: 1.47 ± 0.331
0.529LysPhe: 0.529 ± 0.208
2.293LysGly: 2.293 ± 0.334
0.823LysHis: 0.823 ± 0.196
1.176LysIle: 1.176 ± 0.276
1.176LysLys: 1.176 ± 0.307
2.704LysLeu: 2.704 ± 0.446
0.529LysMet: 0.529 ± 0.153
1.176LysAsn: 1.176 ± 0.236
3.234LysPro: 3.234 ± 0.41
1.293LysGln: 1.293 ± 0.438
2.469LysArg: 2.469 ± 0.396
1.117LysSer: 1.117 ± 0.205
2.175LysThr: 2.175 ± 0.39
2.293LysVal: 2.293 ± 0.333
0.706LysTrp: 0.706 ± 0.202
0.647LysTyr: 0.647 ± 0.183
0.0LysXaa: 0.0 ± 0.0
Leu
9.936LeuAla: 9.936 ± 0.922
1.058LeuCys: 1.058 ± 0.284
6.291LeuAsp: 6.291 ± 0.778
4.115LeuGlu: 4.115 ± 0.582
2.646LeuPhe: 2.646 ± 0.629
6.173LeuGly: 6.173 ± 0.737
1.764LeuHis: 1.764 ± 0.43
3.528LeuIle: 3.528 ± 0.376
2.41LeuLys: 2.41 ± 0.336
5.233LeuLeu: 5.233 ± 0.538
1.94LeuMet: 1.94 ± 0.318
2.234LeuAsn: 2.234 ± 0.365
3.939LeuPro: 3.939 ± 0.47
2.058LeuGln: 2.058 ± 0.313
7.349LeuArg: 7.349 ± 0.655
3.939LeuSer: 3.939 ± 0.489
6.644LeuThr: 6.644 ± 0.75
5.82LeuVal: 5.82 ± 0.884
1.529LeuTrp: 1.529 ± 0.318
1.293LeuTyr: 1.293 ± 0.264
0.0LeuXaa: 0.0 ± 0.0
Met
2.646MetAla: 2.646 ± 0.709
0.294MetCys: 0.294 ± 0.137
0.764MetAsp: 0.764 ± 0.229
0.47MetGlu: 0.47 ± 0.169
0.47MetPhe: 0.47 ± 0.152
1.705MetGly: 1.705 ± 0.473
0.588MetHis: 0.588 ± 0.175
0.588MetIle: 0.588 ± 0.229
0.764MetLys: 0.764 ± 0.193
1.646MetLeu: 1.646 ± 0.344
0.176MetMet: 0.176 ± 0.098
0.588MetAsn: 0.588 ± 0.196
1.293MetPro: 1.293 ± 0.22
0.999MetGln: 0.999 ± 0.287
1.881MetArg: 1.881 ± 0.277
2.117MetSer: 2.117 ± 0.335
2.94MetThr: 2.94 ± 0.315
1.235MetVal: 1.235 ± 0.251
0.529MetTrp: 0.529 ± 0.195
0.059MetTyr: 0.059 ± 0.061
0.0MetXaa: 0.0 ± 0.0
Asn
3.234AsnAla: 3.234 ± 0.533
0.118AsnCys: 0.118 ± 0.088
2.058AsnAsp: 2.058 ± 0.381
1.176AsnGlu: 1.176 ± 0.226
0.529AsnPhe: 0.529 ± 0.176
2.763AsnGly: 2.763 ± 0.478
1.117AsnHis: 1.117 ± 0.271
1.058AsnIle: 1.058 ± 0.219
1.058AsnLys: 1.058 ± 0.307
1.94AsnLeu: 1.94 ± 0.326
0.412AsnMet: 0.412 ± 0.149
1.293AsnAsn: 1.293 ± 0.28
1.999AsnPro: 1.999 ± 0.361
1.058AsnGln: 1.058 ± 0.302
2.646AsnArg: 2.646 ± 0.369
1.764AsnSer: 1.764 ± 0.334
2.117AsnThr: 2.117 ± 0.365
2.117AsnVal: 2.117 ± 0.448
0.47AsnTrp: 0.47 ± 0.129
0.706AsnTyr: 0.706 ± 0.253
0.0AsnXaa: 0.0 ± 0.0
Pro
5.938ProAla: 5.938 ± 0.587
0.706ProCys: 0.706 ± 0.238
4.645ProAsp: 4.645 ± 0.794
3.939ProGlu: 3.939 ± 0.557
1.293ProPhe: 1.293 ± 0.295
5.938ProGly: 5.938 ± 0.708
0.647ProHis: 0.647 ± 0.178
2.469ProIle: 2.469 ± 0.29
2.058ProLys: 2.058 ± 0.405
4.233ProLeu: 4.233 ± 0.411
1.293ProMet: 1.293 ± 0.298
1.176ProAsn: 1.176 ± 0.26
2.646ProPro: 2.646 ± 0.463
1.411ProGln: 1.411 ± 0.255
3.88ProArg: 3.88 ± 0.6
2.763ProSer: 2.763 ± 0.415
4.527ProThr: 4.527 ± 0.518
3.586ProVal: 3.586 ± 0.436
1.705ProTrp: 1.705 ± 0.335
0.999ProTyr: 0.999 ± 0.201
0.0ProXaa: 0.0 ± 0.0
Gln
4.468GlnAla: 4.468 ± 0.79
0.47GlnCys: 0.47 ± 0.142
1.646GlnAsp: 1.646 ± 0.361
1.529GlnGlu: 1.529 ± 0.283
1.235GlnPhe: 1.235 ± 0.254
2.293GlnGly: 2.293 ± 0.465
0.529GlnHis: 0.529 ± 0.19
2.469GlnIle: 2.469 ± 0.411
1.117GlnLys: 1.117 ± 0.309
4.174GlnLeu: 4.174 ± 0.384
0.999GlnMet: 0.999 ± 0.25
0.706GlnAsn: 0.706 ± 0.233
1.999GlnPro: 1.999 ± 0.348
1.47GlnGln: 1.47 ± 0.331
4.057GlnArg: 4.057 ± 0.624
1.47GlnSer: 1.47 ± 0.32
1.587GlnThr: 1.587 ± 0.364
2.646GlnVal: 2.646 ± 0.401
0.882GlnTrp: 0.882 ± 0.229
0.353GlnTyr: 0.353 ± 0.167
0.0GlnXaa: 0.0 ± 0.0
Arg
8.172ArgAla: 8.172 ± 0.716
1.352ArgCys: 1.352 ± 0.439
4.939ArgAsp: 4.939 ± 0.592
4.468ArgGlu: 4.468 ± 0.628
1.94ArgPhe: 1.94 ± 0.356
5.35ArgGly: 5.35 ± 0.464
1.823ArgHis: 1.823 ± 0.377
4.645ArgIle: 4.645 ± 0.61
2.763ArgLys: 2.763 ± 0.433
7.702ArgLeu: 7.702 ± 0.542
2.293ArgMet: 2.293 ± 0.347
2.998ArgAsn: 2.998 ± 0.453
3.998ArgPro: 3.998 ± 0.607
2.763ArgGln: 2.763 ± 0.386
6.467ArgArg: 6.467 ± 1.02
4.292ArgSer: 4.292 ± 0.506
5.174ArgThr: 5.174 ± 0.638
4.468ArgVal: 4.468 ± 0.451
2.234ArgTrp: 2.234 ± 0.392
2.352ArgTyr: 2.352 ± 0.348
0.0ArgXaa: 0.0 ± 0.0
Ser
5.82SerAla: 5.82 ± 0.597
0.294SerCys: 0.294 ± 0.144
3.998SerAsp: 3.998 ± 0.555
2.293SerGlu: 2.293 ± 0.344
1.47SerPhe: 1.47 ± 0.284
5.409SerGly: 5.409 ± 0.618
0.882SerHis: 0.882 ± 0.207
2.41SerIle: 2.41 ± 0.289
1.411SerLys: 1.411 ± 0.279
3.822SerLeu: 3.822 ± 0.391
1.47SerMet: 1.47 ± 0.369
1.646SerAsn: 1.646 ± 0.449
3.292SerPro: 3.292 ± 0.414
2.763SerGln: 2.763 ± 0.471
2.94SerArg: 2.94 ± 0.544
3.586SerSer: 3.586 ± 0.587
4.057SerThr: 4.057 ± 0.377
3.528SerVal: 3.528 ± 0.521
1.058SerTrp: 1.058 ± 0.211
1.47SerTyr: 1.47 ± 0.365
0.0SerXaa: 0.0 ± 0.0
Thr
8.878ThrAla: 8.878 ± 0.802
0.647ThrCys: 0.647 ± 0.186
4.233ThrAsp: 4.233 ± 0.502
3.586ThrGlu: 3.586 ± 0.46
1.352ThrPhe: 1.352 ± 0.295
6.585ThrGly: 6.585 ± 0.751
1.411ThrHis: 1.411 ± 0.228
3.822ThrIle: 3.822 ± 0.467
2.058ThrLys: 2.058 ± 0.336
5.174ThrLeu: 5.174 ± 0.588
1.176ThrMet: 1.176 ± 0.262
1.646ThrAsn: 1.646 ± 0.304
5.174ThrPro: 5.174 ± 0.575
1.529ThrGln: 1.529 ± 0.249
6.114ThrArg: 6.114 ± 0.857
3.704ThrSer: 3.704 ± 0.499
5.526ThrThr: 5.526 ± 0.601
4.821ThrVal: 4.821 ± 0.543
1.293ThrTrp: 1.293 ± 0.289
1.587ThrTyr: 1.587 ± 0.339
0.0ThrXaa: 0.0 ± 0.0
Val
8.701ValAla: 8.701 ± 0.654
0.882ValCys: 0.882 ± 0.273
4.703ValAsp: 4.703 ± 0.469
4.703ValGlu: 4.703 ± 0.558
1.881ValPhe: 1.881 ± 0.408
5.762ValGly: 5.762 ± 0.833
1.646ValHis: 1.646 ± 0.255
3.528ValIle: 3.528 ± 0.342
2.234ValLys: 2.234 ± 0.454
4.997ValLeu: 4.997 ± 0.595
0.706ValMet: 0.706 ± 0.186
1.999ValAsn: 1.999 ± 0.316
4.057ValPro: 4.057 ± 0.525
2.469ValGln: 2.469 ± 0.407
5.409ValArg: 5.409 ± 0.448
4.527ValSer: 4.527 ± 0.526
5.115ValThr: 5.115 ± 0.562
4.527ValVal: 4.527 ± 0.524
1.058ValTrp: 1.058 ± 0.324
1.411ValTyr: 1.411 ± 0.377
0.0ValXaa: 0.0 ± 0.0
Trp
1.705TrpAla: 1.705 ± 0.229
0.353TrpCys: 0.353 ± 0.13
1.293TrpAsp: 1.293 ± 0.299
1.117TrpGlu: 1.117 ± 0.231
0.647TrpPhe: 0.647 ± 0.201
0.999TrpGly: 0.999 ± 0.258
0.764TrpHis: 0.764 ± 0.252
0.764TrpIle: 0.764 ± 0.199
0.529TrpLys: 0.529 ± 0.197
1.823TrpLeu: 1.823 ± 0.311
0.412TrpMet: 0.412 ± 0.157
0.588TrpAsn: 0.588 ± 0.171
1.235TrpPro: 1.235 ± 0.266
0.882TrpGln: 0.882 ± 0.186
1.823TrpArg: 1.823 ± 0.366
1.117TrpSer: 1.117 ± 0.247
1.47TrpThr: 1.47 ± 0.277
1.529TrpVal: 1.529 ± 0.324
0.529TrpTrp: 0.529 ± 0.171
0.412TrpTyr: 0.412 ± 0.165
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.469TyrAla: 2.469 ± 0.338
0.176TyrCys: 0.176 ± 0.103
1.352TyrAsp: 1.352 ± 0.32
1.411TyrGlu: 1.411 ± 0.328
0.647TyrPhe: 0.647 ± 0.159
2.352TyrGly: 2.352 ± 0.34
0.47TyrHis: 0.47 ± 0.224
0.706TyrIle: 0.706 ± 0.259
0.353TyrLys: 0.353 ± 0.134
2.41TyrLeu: 2.41 ± 0.525
0.47TyrMet: 0.47 ± 0.165
0.706TyrAsn: 0.706 ± 0.236
1.293TyrPro: 1.293 ± 0.24
0.823TyrGln: 0.823 ± 0.19
1.646TyrArg: 1.646 ± 0.353
1.587TyrSer: 1.587 ± 0.276
1.176TyrThr: 1.176 ± 0.244
1.94TyrVal: 1.94 ± 0.369
0.294TyrTrp: 0.294 ± 0.16
0.47TyrTyr: 0.47 ± 0.133
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 77 proteins (17010 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski