Amino acid dipepetide frequency for Arthrobacter phage Grekaycon

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
19.019AlaAla: 19.019 ± 2.033
0.686AlaCys: 0.686 ± 0.238
7.358AlaAsp: 7.358 ± 0.831
8.044AlaGlu: 8.044 ± 0.931
3.055AlaPhe: 3.055 ± 0.525
9.353AlaGly: 9.353 ± 1.029
2.245AlaHis: 2.245 ± 0.352
4.801AlaIle: 4.801 ± 0.558
4.864AlaLys: 4.864 ± 0.548
9.977AlaLeu: 9.977 ± 1.011
2.931AlaMet: 2.931 ± 0.445
3.055AlaAsn: 3.055 ± 0.347
6.173AlaPro: 6.173 ± 0.83
4.988AlaGln: 4.988 ± 0.627
8.293AlaArg: 8.293 ± 0.796
6.049AlaSer: 6.049 ± 0.787
6.859AlaThr: 6.859 ± 0.969
7.483AlaVal: 7.483 ± 0.822
1.559AlaTrp: 1.559 ± 0.294
2.619AlaTyr: 2.619 ± 0.36
0.0AlaXaa: 0.0 ± 0.0
Cys
0.312CysAla: 0.312 ± 0.156
0.0CysCys: 0.0 ± 0.0
0.499CysAsp: 0.499 ± 0.226
0.436CysGlu: 0.436 ± 0.147
0.187CysPhe: 0.187 ± 0.1
0.686CysGly: 0.686 ± 0.248
0.125CysHis: 0.125 ± 0.085
0.187CysIle: 0.187 ± 0.128
0.249CysLys: 0.249 ± 0.136
0.748CysLeu: 0.748 ± 0.209
0.249CysMet: 0.249 ± 0.161
0.187CysAsn: 0.187 ± 0.133
0.374CysPro: 0.374 ± 0.141
0.125CysGln: 0.125 ± 0.09
0.374CysArg: 0.374 ± 0.207
0.561CysSer: 0.561 ± 0.209
0.374CysThr: 0.374 ± 0.152
0.624CysVal: 0.624 ± 0.198
0.187CysTrp: 0.187 ± 0.144
0.187CysTyr: 0.187 ± 0.112
0.0CysXaa: 0.0 ± 0.0
Asp
8.106AspAla: 8.106 ± 0.765
0.561AspCys: 0.561 ± 0.195
5.176AspAsp: 5.176 ± 0.705
5.674AspGlu: 5.674 ± 0.726
1.746AspPhe: 1.746 ± 0.31
5.924AspGly: 5.924 ± 0.756
1.309AspHis: 1.309 ± 0.266
3.055AspIle: 3.055 ± 0.408
3.741AspLys: 3.741 ± 0.494
3.991AspLeu: 3.991 ± 0.462
0.935AspMet: 0.935 ± 0.228
2.182AspAsn: 2.182 ± 0.33
2.806AspPro: 2.806 ± 0.435
1.684AspGln: 1.684 ± 0.345
2.806AspArg: 2.806 ± 0.35
2.557AspSer: 2.557 ± 0.375
3.554AspThr: 3.554 ± 0.44
6.111AspVal: 6.111 ± 0.738
1.372AspTrp: 1.372 ± 0.343
1.372AspTyr: 1.372 ± 0.281
0.0AspXaa: 0.0 ± 0.0
Glu
6.921GluAla: 6.921 ± 0.826
0.499GluCys: 0.499 ± 0.233
3.554GluAsp: 3.554 ± 0.573
3.866GluGlu: 3.866 ± 0.471
1.746GluPhe: 1.746 ± 0.345
3.492GluGly: 3.492 ± 0.548
1.746GluHis: 1.746 ± 0.339
3.554GluIle: 3.554 ± 0.479
2.931GluLys: 2.931 ± 0.482
6.797GluLeu: 6.797 ± 0.817
1.871GluMet: 1.871 ± 0.296
1.933GluAsn: 1.933 ± 0.293
3.741GluPro: 3.741 ± 0.398
3.305GluGln: 3.305 ± 0.581
4.24GluArg: 4.24 ± 0.57
3.679GluSer: 3.679 ± 0.476
3.118GluThr: 3.118 ± 0.434
3.679GluVal: 3.679 ± 0.498
0.686GluTrp: 0.686 ± 0.2
1.808GluTyr: 1.808 ± 0.384
0.0GluXaa: 0.0 ± 0.0
Phe
3.305PheAla: 3.305 ± 0.537
0.187PheCys: 0.187 ± 0.096
1.995PheAsp: 1.995 ± 0.315
2.432PheGlu: 2.432 ± 0.419
0.748PhePhe: 0.748 ± 0.251
2.494PheGly: 2.494 ± 0.393
0.249PheHis: 0.249 ± 0.155
1.309PheIle: 1.309 ± 0.233
1.871PheLys: 1.871 ± 0.394
1.434PheLeu: 1.434 ± 0.268
0.998PheMet: 0.998 ± 0.25
1.06PheAsn: 1.06 ± 0.263
1.309PhePro: 1.309 ± 0.365
0.561PheGln: 0.561 ± 0.189
1.309PheArg: 1.309 ± 0.371
2.058PheSer: 2.058 ± 0.371
2.12PheThr: 2.12 ± 0.345
2.993PheVal: 2.993 ± 0.323
0.624PheTrp: 0.624 ± 0.211
0.686PheTyr: 0.686 ± 0.237
0.0PheXaa: 0.0 ± 0.0
Gly
8.418GlyAla: 8.418 ± 0.826
0.187GlyCys: 0.187 ± 0.116
5.238GlyAsp: 5.238 ± 0.629
3.866GlyGlu: 3.866 ± 0.515
2.307GlyPhe: 2.307 ± 0.367
6.423GlyGly: 6.423 ± 1.302
1.372GlyHis: 1.372 ± 0.295
3.305GlyIle: 3.305 ± 0.539
3.554GlyLys: 3.554 ± 0.413
4.739GlyLeu: 4.739 ± 0.711
1.746GlyMet: 1.746 ± 0.287
3.18GlyAsn: 3.18 ± 0.554
2.868GlyPro: 2.868 ± 0.51
1.933GlyGln: 1.933 ± 0.348
5.113GlyArg: 5.113 ± 0.53
4.739GlySer: 4.739 ± 0.596
4.926GlyThr: 4.926 ± 0.662
6.36GlyVal: 6.36 ± 0.58
1.621GlyTrp: 1.621 ± 0.428
2.245GlyTyr: 2.245 ± 0.365
0.0GlyXaa: 0.0 ± 0.0
His
2.494HisAla: 2.494 ± 0.433
0.0HisCys: 0.0 ± 0.0
1.746HisAsp: 1.746 ± 0.293
0.998HisGlu: 0.998 ± 0.228
0.998HisPhe: 0.998 ± 0.227
0.249HisGly: 0.249 ± 0.137
0.436HisHis: 0.436 ± 0.175
1.122HisIle: 1.122 ± 0.246
0.811HisLys: 0.811 ± 0.257
1.559HisLeu: 1.559 ± 0.327
0.374HisMet: 0.374 ± 0.165
0.748HisAsn: 0.748 ± 0.215
0.811HisPro: 0.811 ± 0.256
0.436HisGln: 0.436 ± 0.149
0.873HisArg: 0.873 ± 0.26
0.312HisSer: 0.312 ± 0.146
1.746HisThr: 1.746 ± 0.436
1.871HisVal: 1.871 ± 0.315
0.436HisTrp: 0.436 ± 0.142
0.561HisTyr: 0.561 ± 0.19
0.0HisXaa: 0.0 ± 0.0
Ile
5.986IleAla: 5.986 ± 0.683
0.249IleCys: 0.249 ± 0.117
4.178IleAsp: 4.178 ± 0.447
4.365IleGlu: 4.365 ± 0.516
1.497IlePhe: 1.497 ± 0.275
4.614IleGly: 4.614 ± 0.656
0.499IleHis: 0.499 ± 0.159
2.806IleIle: 2.806 ± 0.46
2.744IleLys: 2.744 ± 0.348
2.432IleLeu: 2.432 ± 0.41
1.559IleMet: 1.559 ± 0.37
1.684IleAsn: 1.684 ± 0.333
1.621IlePro: 1.621 ± 0.328
1.185IleGln: 1.185 ± 0.275
1.995IleArg: 1.995 ± 0.423
2.993IleSer: 2.993 ± 0.428
3.367IleThr: 3.367 ± 0.53
4.864IleVal: 4.864 ± 0.596
0.561IleTrp: 0.561 ± 0.172
1.372IleTyr: 1.372 ± 0.311
0.0IleXaa: 0.0 ± 0.0
Lys
4.988LysAla: 4.988 ± 0.593
0.312LysCys: 0.312 ± 0.157
1.684LysAsp: 1.684 ± 0.347
1.808LysGlu: 1.808 ± 0.337
1.746LysPhe: 1.746 ± 0.355
3.43LysGly: 3.43 ± 0.448
1.309LysHis: 1.309 ± 0.336
2.681LysIle: 2.681 ± 0.493
1.122LysLys: 1.122 ± 0.357
5.737LysLeu: 5.737 ± 0.646
1.559LysMet: 1.559 ± 0.329
0.873LysAsn: 0.873 ± 0.308
2.931LysPro: 2.931 ± 0.536
2.806LysGln: 2.806 ± 0.492
4.115LysArg: 4.115 ± 0.65
2.245LysSer: 2.245 ± 0.41
3.305LysThr: 3.305 ± 0.476
2.557LysVal: 2.557 ± 0.459
0.748LysTrp: 0.748 ± 0.2
1.185LysTyr: 1.185 ± 0.288
0.0LysXaa: 0.0 ± 0.0
Leu
8.169LeuAla: 8.169 ± 0.627
0.249LeuCys: 0.249 ± 0.145
5.363LeuAsp: 5.363 ± 0.545
3.804LeuGlu: 3.804 ± 0.489
2.058LeuPhe: 2.058 ± 0.331
5.176LeuGly: 5.176 ± 1.057
1.497LeuHis: 1.497 ± 0.312
4.053LeuIle: 4.053 ± 0.571
3.43LeuLys: 3.43 ± 0.411
7.296LeuLeu: 7.296 ± 0.76
2.182LeuMet: 2.182 ± 0.478
2.681LeuAsn: 2.681 ± 0.396
3.991LeuPro: 3.991 ± 0.433
3.492LeuGln: 3.492 ± 0.477
5.986LeuArg: 5.986 ± 0.601
5.176LeuSer: 5.176 ± 0.661
6.61LeuThr: 6.61 ± 0.628
5.737LeuVal: 5.737 ± 0.566
0.624LeuTrp: 0.624 ± 0.23
1.684LeuTyr: 1.684 ± 0.307
0.0LeuXaa: 0.0 ± 0.0
Met
2.868MetAla: 2.868 ± 0.512
0.249MetCys: 0.249 ± 0.123
0.624MetAsp: 0.624 ± 0.222
0.748MetGlu: 0.748 ± 0.285
0.561MetPhe: 0.561 ± 0.18
1.247MetGly: 1.247 ± 0.319
0.312MetHis: 0.312 ± 0.147
0.935MetIle: 0.935 ± 0.23
1.185MetLys: 1.185 ± 0.292
1.808MetLeu: 1.808 ± 0.334
0.561MetMet: 0.561 ± 0.196
0.811MetAsn: 0.811 ± 0.243
1.559MetPro: 1.559 ± 0.433
1.309MetGln: 1.309 ± 0.223
2.12MetArg: 2.12 ± 0.311
2.12MetSer: 2.12 ± 0.309
3.118MetThr: 3.118 ± 0.43
0.998MetVal: 0.998 ± 0.252
0.187MetTrp: 0.187 ± 0.102
0.374MetTyr: 0.374 ± 0.145
0.0MetXaa: 0.0 ± 0.0
Asn
3.741AsnAla: 3.741 ± 0.452
0.436AsnCys: 0.436 ± 0.183
1.746AsnAsp: 1.746 ± 0.356
2.557AsnGlu: 2.557 ± 0.408
0.811AsnPhe: 0.811 ± 0.177
3.18AsnGly: 3.18 ± 0.438
0.748AsnHis: 0.748 ± 0.215
2.182AsnIle: 2.182 ± 0.352
1.995AsnLys: 1.995 ± 0.389
2.432AsnLeu: 2.432 ± 0.32
0.811AsnMet: 0.811 ± 0.198
1.247AsnAsn: 1.247 ± 0.423
2.37AsnPro: 2.37 ± 0.473
1.122AsnGln: 1.122 ± 0.37
1.933AsnArg: 1.933 ± 0.388
1.372AsnSer: 1.372 ± 0.217
2.182AsnThr: 2.182 ± 0.385
2.432AsnVal: 2.432 ± 0.349
0.935AsnTrp: 0.935 ± 0.196
0.686AsnTyr: 0.686 ± 0.236
0.0AsnXaa: 0.0 ± 0.0
Pro
7.919ProAla: 7.919 ± 1.036
0.374ProCys: 0.374 ± 0.139
3.43ProAsp: 3.43 ± 0.713
3.367ProGlu: 3.367 ± 0.443
0.873ProPhe: 0.873 ± 0.245
3.991ProGly: 3.991 ± 0.603
1.122ProHis: 1.122 ± 0.274
3.367ProIle: 3.367 ± 0.632
1.684ProLys: 1.684 ± 0.306
3.554ProLeu: 3.554 ± 0.466
0.561ProMet: 0.561 ± 0.182
2.182ProAsn: 2.182 ± 0.476
2.619ProPro: 2.619 ± 0.796
1.559ProGln: 1.559 ± 0.32
2.245ProArg: 2.245 ± 0.384
2.494ProSer: 2.494 ± 0.443
3.243ProThr: 3.243 ± 0.517
2.744ProVal: 2.744 ± 0.427
0.811ProTrp: 0.811 ± 0.226
1.372ProTyr: 1.372 ± 0.325
0.0ProXaa: 0.0 ± 0.0
Gln
4.053GlnAla: 4.053 ± 0.4
0.561GlnCys: 0.561 ± 0.219
1.684GlnAsp: 1.684 ± 0.318
1.684GlnGlu: 1.684 ± 0.362
1.185GlnPhe: 1.185 ± 0.3
2.806GlnGly: 2.806 ± 0.431
0.998GlnHis: 0.998 ± 0.226
1.933GlnIle: 1.933 ± 0.341
1.808GlnLys: 1.808 ± 0.308
4.053GlnLeu: 4.053 ± 0.667
0.998GlnMet: 0.998 ± 0.241
1.122GlnAsn: 1.122 ± 0.212
1.933GlnPro: 1.933 ± 0.374
2.307GlnGln: 2.307 ± 0.478
3.243GlnArg: 3.243 ± 0.595
2.12GlnSer: 2.12 ± 0.362
2.307GlnThr: 2.307 ± 0.479
2.245GlnVal: 2.245 ± 0.383
0.436GlnTrp: 0.436 ± 0.137
0.624GlnTyr: 0.624 ± 0.231
0.0GlnXaa: 0.0 ± 0.0
Arg
7.046ArgAla: 7.046 ± 0.627
0.686ArgCys: 0.686 ± 0.202
4.178ArgAsp: 4.178 ± 0.403
5.363ArgGlu: 5.363 ± 0.633
2.182ArgPhe: 2.182 ± 0.403
3.617ArgGly: 3.617 ± 0.479
0.873ArgHis: 0.873 ± 0.228
2.868ArgIle: 2.868 ± 0.41
3.741ArgLys: 3.741 ± 0.534
5.051ArgLeu: 5.051 ± 0.513
1.122ArgMet: 1.122 ± 0.294
2.432ArgAsn: 2.432 ± 0.386
2.993ArgPro: 2.993 ± 0.536
2.681ArgGln: 2.681 ± 0.41
3.243ArgArg: 3.243 ± 0.52
3.928ArgSer: 3.928 ± 0.507
3.43ArgThr: 3.43 ± 0.542
4.49ArgVal: 4.49 ± 0.76
1.434ArgTrp: 1.434 ± 0.361
1.684ArgTyr: 1.684 ± 0.318
0.0ArgXaa: 0.0 ± 0.0
Ser
6.111SerAla: 6.111 ± 0.751
0.249SerCys: 0.249 ± 0.142
3.804SerAsp: 3.804 ± 0.477
3.243SerGlu: 3.243 ± 0.364
1.871SerPhe: 1.871 ± 0.324
4.614SerGly: 4.614 ± 0.847
1.06SerHis: 1.06 ± 0.276
3.367SerIle: 3.367 ± 0.447
2.931SerLys: 2.931 ± 0.491
3.679SerLeu: 3.679 ± 0.514
1.621SerMet: 1.621 ± 0.301
2.182SerAsn: 2.182 ± 0.409
2.058SerPro: 2.058 ± 0.339
1.621SerGln: 1.621 ± 0.293
2.681SerArg: 2.681 ± 0.489
3.617SerSer: 3.617 ± 0.647
4.614SerThr: 4.614 ± 0.595
4.926SerVal: 4.926 ± 0.593
0.624SerTrp: 0.624 ± 0.2
0.998SerTyr: 0.998 ± 0.207
0.0SerXaa: 0.0 ± 0.0
Thr
8.293ThrAla: 8.293 ± 1.109
0.561ThrCys: 0.561 ± 0.197
3.492ThrAsp: 3.492 ± 0.419
3.804ThrGlu: 3.804 ± 0.548
2.806ThrPhe: 2.806 ± 0.371
4.614ThrGly: 4.614 ± 0.572
1.559ThrHis: 1.559 ± 0.406
3.367ThrIle: 3.367 ± 0.474
2.494ThrLys: 2.494 ± 0.421
5.487ThrLeu: 5.487 ± 0.603
1.309ThrMet: 1.309 ± 0.315
1.995ThrAsn: 1.995 ± 0.422
4.053ThrPro: 4.053 ± 0.585
2.182ThrGln: 2.182 ± 0.33
4.365ThrArg: 4.365 ± 0.469
2.993ThrSer: 2.993 ± 0.564
5.176ThrThr: 5.176 ± 0.531
6.173ThrVal: 6.173 ± 0.724
0.811ThrTrp: 0.811 ± 0.263
1.497ThrTyr: 1.497 ± 0.348
0.0ThrXaa: 0.0 ± 0.0
Val
7.545ValAla: 7.545 ± 0.695
0.187ValCys: 0.187 ± 0.109
6.236ValAsp: 6.236 ± 0.628
5.051ValGlu: 5.051 ± 0.583
2.058ValPhe: 2.058 ± 0.307
4.677ValGly: 4.677 ± 0.685
0.624ValHis: 0.624 ± 0.203
4.49ValIle: 4.49 ± 0.535
3.991ValLys: 3.991 ± 0.493
5.799ValLeu: 5.799 ± 0.647
1.309ValMet: 1.309 ± 0.243
3.43ValAsn: 3.43 ± 0.464
3.617ValPro: 3.617 ± 0.545
3.617ValGln: 3.617 ± 0.493
5.113ValArg: 5.113 ± 0.683
4.864ValSer: 4.864 ± 0.522
5.425ValThr: 5.425 ± 0.731
6.236ValVal: 6.236 ± 0.681
0.811ValTrp: 0.811 ± 0.202
1.372ValTyr: 1.372 ± 0.278
0.0ValXaa: 0.0 ± 0.0
Trp
1.684TrpAla: 1.684 ± 0.339
0.249TrpCys: 0.249 ± 0.145
0.873TrpAsp: 0.873 ± 0.226
0.998TrpGlu: 0.998 ± 0.252
0.561TrpPhe: 0.561 ± 0.244
0.998TrpGly: 0.998 ± 0.244
0.312TrpHis: 0.312 ± 0.151
0.686TrpIle: 0.686 ± 0.259
0.748TrpLys: 0.748 ± 0.218
1.497TrpLeu: 1.497 ± 0.349
0.374TrpMet: 0.374 ± 0.181
0.686TrpAsn: 0.686 ± 0.194
0.686TrpPro: 0.686 ± 0.221
0.312TrpGln: 0.312 ± 0.117
1.247TrpArg: 1.247 ± 0.301
0.686TrpSer: 0.686 ± 0.187
0.686TrpThr: 0.686 ± 0.209
1.247TrpVal: 1.247 ± 0.242
0.312TrpTrp: 0.312 ± 0.138
0.249TrpTyr: 0.249 ± 0.107
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.432TyrAla: 2.432 ± 0.351
0.187TyrCys: 0.187 ± 0.117
1.808TyrAsp: 1.808 ± 0.368
1.06TyrGlu: 1.06 ± 0.334
0.748TyrPhe: 0.748 ± 0.189
2.557TyrGly: 2.557 ± 0.421
0.187TyrHis: 0.187 ± 0.109
0.499TyrIle: 0.499 ± 0.195
1.247TyrLys: 1.247 ± 0.343
1.185TyrLeu: 1.185 ± 0.253
0.561TyrMet: 0.561 ± 0.191
1.247TyrAsn: 1.247 ± 0.212
0.998TyrPro: 0.998 ± 0.244
0.873TyrGln: 0.873 ± 0.236
1.746TyrArg: 1.746 ± 0.405
1.372TyrSer: 1.372 ± 0.322
0.873TyrThr: 0.873 ± 0.194
2.681TyrVal: 2.681 ± 0.425
0.312TyrTrp: 0.312 ± 0.14
0.374TyrTyr: 0.374 ± 0.164
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 78 proteins (16038 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski