Amino acid dipepetide frequency for Arthrobacter phage KellEzio

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.957AlaAla: 11.957 ± 0.845
0.815AlaCys: 0.815 ± 0.222
5.978AlaAsp: 5.978 ± 0.53
6.087AlaGlu: 6.087 ± 0.577
2.065AlaPhe: 2.065 ± 0.347
6.848AlaGly: 6.848 ± 0.737
2.065AlaHis: 2.065 ± 0.455
4.837AlaIle: 4.837 ± 0.74
5.924AlaLys: 5.924 ± 0.7
9.13AlaLeu: 9.13 ± 0.908
2.989AlaMet: 2.989 ± 0.472
3.75AlaAsn: 3.75 ± 0.641
4.511AlaPro: 4.511 ± 0.487
3.424AlaGln: 3.424 ± 0.548
4.891AlaArg: 4.891 ± 0.466
4.511AlaSer: 4.511 ± 0.452
6.196AlaThr: 6.196 ± 0.673
8.587AlaVal: 8.587 ± 0.712
2.011AlaTrp: 2.011 ± 0.349
2.663AlaTyr: 2.663 ± 0.37
0.0AlaXaa: 0.0 ± 0.0
Cys
0.652CysAla: 0.652 ± 0.169
0.272CysCys: 0.272 ± 0.124
0.435CysAsp: 0.435 ± 0.224
0.707CysGlu: 0.707 ± 0.205
0.163CysPhe: 0.163 ± 0.111
0.707CysGly: 0.707 ± 0.214
0.38CysHis: 0.38 ± 0.145
0.38CysIle: 0.38 ± 0.145
0.163CysLys: 0.163 ± 0.096
0.707CysLeu: 0.707 ± 0.176
0.217CysMet: 0.217 ± 0.114
0.326CysAsn: 0.326 ± 0.118
0.652CysPro: 0.652 ± 0.189
0.326CysGln: 0.326 ± 0.127
0.707CysArg: 0.707 ± 0.19
0.435CysSer: 0.435 ± 0.151
0.326CysThr: 0.326 ± 0.146
0.489CysVal: 0.489 ± 0.162
0.109CysTrp: 0.109 ± 0.064
0.217CysTyr: 0.217 ± 0.106
0.0CysXaa: 0.0 ± 0.0
Asp
5.598AspAla: 5.598 ± 0.478
0.815AspCys: 0.815 ± 0.219
4.891AspAsp: 4.891 ± 0.873
4.783AspGlu: 4.783 ± 0.56
2.283AspPhe: 2.283 ± 0.344
6.793AspGly: 6.793 ± 0.605
1.196AspHis: 1.196 ± 0.319
2.391AspIle: 2.391 ± 0.323
2.554AspLys: 2.554 ± 0.365
7.011AspLeu: 7.011 ± 0.671
2.228AspMet: 2.228 ± 0.398
1.793AspAsn: 1.793 ± 0.347
3.641AspPro: 3.641 ± 0.415
1.957AspGln: 1.957 ± 0.333
3.207AspArg: 3.207 ± 0.423
3.152AspSer: 3.152 ± 0.504
3.533AspThr: 3.533 ± 0.531
5.217AspVal: 5.217 ± 0.541
2.065AspTrp: 2.065 ± 0.34
1.957AspTyr: 1.957 ± 0.309
0.0AspXaa: 0.0 ± 0.0
Glu
6.63GluAla: 6.63 ± 0.756
0.489GluCys: 0.489 ± 0.179
4.402GluAsp: 4.402 ± 0.652
5.38GluGlu: 5.38 ± 0.686
2.12GluPhe: 2.12 ± 0.385
4.783GluGly: 4.783 ± 0.566
1.522GluHis: 1.522 ± 0.352
4.783GluIle: 4.783 ± 0.596
3.315GluLys: 3.315 ± 0.62
6.522GluLeu: 6.522 ± 0.625
1.413GluMet: 1.413 ± 0.285
2.12GluAsn: 2.12 ± 0.371
2.717GluPro: 2.717 ± 0.382
3.37GluGln: 3.37 ± 0.44
4.022GluArg: 4.022 ± 0.575
3.152GluSer: 3.152 ± 0.368
3.37GluThr: 3.37 ± 0.378
4.891GluVal: 4.891 ± 0.527
1.63GluTrp: 1.63 ± 0.32
1.902GluTyr: 1.902 ± 0.383
0.0GluXaa: 0.0 ± 0.0
Phe
2.772PheAla: 2.772 ± 0.447
0.0PheCys: 0.0 ± 0.0
1.685PheAsp: 1.685 ± 0.387
2.174PheGlu: 2.174 ± 0.343
0.761PhePhe: 0.761 ± 0.216
2.663PheGly: 2.663 ± 0.453
0.543PheHis: 0.543 ± 0.214
1.413PheIle: 1.413 ± 0.467
1.141PheLys: 1.141 ± 0.244
2.88PheLeu: 2.88 ± 0.352
0.707PheMet: 0.707 ± 0.164
1.522PheAsn: 1.522 ± 0.287
1.359PhePro: 1.359 ± 0.273
0.761PheGln: 0.761 ± 0.168
1.848PheArg: 1.848 ± 0.313
1.739PheSer: 1.739 ± 0.296
2.772PheThr: 2.772 ± 0.374
2.174PheVal: 2.174 ± 0.426
0.924PheTrp: 0.924 ± 0.198
0.652PheTyr: 0.652 ± 0.183
0.0PheXaa: 0.0 ± 0.0
Gly
7.609GlyAla: 7.609 ± 1.257
0.543GlyCys: 0.543 ± 0.185
4.565GlyAsp: 4.565 ± 0.678
4.891GlyGlu: 4.891 ± 0.57
3.261GlyPhe: 3.261 ± 0.717
5.978GlyGly: 5.978 ± 0.641
1.359GlyHis: 1.359 ± 0.291
3.696GlyIle: 3.696 ± 0.496
5.163GlyLys: 5.163 ± 0.478
5.707GlyLeu: 5.707 ± 0.666
2.446GlyMet: 2.446 ± 0.382
3.478GlyAsn: 3.478 ± 0.451
3.587GlyPro: 3.587 ± 0.598
2.88GlyGln: 2.88 ± 0.371
4.62GlyArg: 4.62 ± 0.45
5.0GlySer: 5.0 ± 0.903
5.0GlyThr: 5.0 ± 0.796
6.902GlyVal: 6.902 ± 0.601
1.793GlyTrp: 1.793 ± 0.319
2.174GlyTyr: 2.174 ± 0.396
0.0GlyXaa: 0.0 ± 0.0
His
1.739HisAla: 1.739 ± 0.319
0.217HisCys: 0.217 ± 0.106
1.413HisAsp: 1.413 ± 0.317
1.359HisGlu: 1.359 ± 0.322
0.598HisPhe: 0.598 ± 0.197
1.685HisGly: 1.685 ± 0.306
0.598HisHis: 0.598 ± 0.224
0.924HisIle: 0.924 ± 0.221
0.87HisLys: 0.87 ± 0.208
1.848HisLeu: 1.848 ± 0.352
0.435HisMet: 0.435 ± 0.175
0.38HisAsn: 0.38 ± 0.154
1.25HisPro: 1.25 ± 0.345
0.38HisGln: 0.38 ± 0.157
1.25HisArg: 1.25 ± 0.289
0.924HisSer: 0.924 ± 0.267
0.978HisThr: 0.978 ± 0.254
2.337HisVal: 2.337 ± 0.382
0.761HisTrp: 0.761 ± 0.221
0.435HisTyr: 0.435 ± 0.146
0.0HisXaa: 0.0 ± 0.0
Ile
4.674IleAla: 4.674 ± 0.352
0.543IleCys: 0.543 ± 0.146
3.75IleAsp: 3.75 ± 0.411
3.913IleGlu: 3.913 ± 0.535
1.467IlePhe: 1.467 ± 0.391
3.696IleGly: 3.696 ± 0.606
0.598IleHis: 0.598 ± 0.188
2.228IleIle: 2.228 ± 0.316
2.174IleLys: 2.174 ± 0.342
3.913IleLeu: 3.913 ± 0.446
1.141IleMet: 1.141 ± 0.335
1.739IleAsn: 1.739 ± 0.356
2.011IlePro: 2.011 ± 0.357
2.337IleGln: 2.337 ± 0.304
2.826IleArg: 2.826 ± 0.33
2.88IleSer: 2.88 ± 0.38
3.37IleThr: 3.37 ± 0.444
2.609IleVal: 2.609 ± 0.427
1.359IleTrp: 1.359 ± 0.537
1.196IleTyr: 1.196 ± 0.244
0.0IleXaa: 0.0 ± 0.0
Lys
5.326LysAla: 5.326 ± 0.563
0.109LysCys: 0.109 ± 0.082
3.37LysAsp: 3.37 ± 0.422
4.13LysGlu: 4.13 ± 0.621
1.467LysPhe: 1.467 ± 0.292
3.207LysGly: 3.207 ± 0.469
1.087LysHis: 1.087 ± 0.225
2.717LysIle: 2.717 ± 0.391
3.043LysLys: 3.043 ± 0.327
4.13LysLeu: 4.13 ± 0.479
1.359LysMet: 1.359 ± 0.296
1.63LysAsn: 1.63 ± 0.283
2.935LysPro: 2.935 ± 0.489
2.12LysGln: 2.12 ± 0.338
2.446LysArg: 2.446 ± 0.354
2.174LysSer: 2.174 ± 0.278
2.609LysThr: 2.609 ± 0.3
3.37LysVal: 3.37 ± 0.368
0.761LysTrp: 0.761 ± 0.192
1.413LysTyr: 1.413 ± 0.274
0.0LysXaa: 0.0 ± 0.0
Leu
7.554LeuAla: 7.554 ± 0.505
0.598LeuCys: 0.598 ± 0.194
5.815LeuAsp: 5.815 ± 0.607
5.054LeuGlu: 5.054 ± 0.521
1.63LeuPhe: 1.63 ± 0.277
6.033LeuGly: 6.033 ± 0.835
2.283LeuHis: 2.283 ± 0.416
3.804LeuIle: 3.804 ± 0.332
2.989LeuLys: 2.989 ± 0.445
6.522LeuLeu: 6.522 ± 0.634
2.12LeuMet: 2.12 ± 0.328
3.424LeuAsn: 3.424 ± 0.372
4.402LeuPro: 4.402 ± 0.514
3.098LeuGln: 3.098 ± 0.418
4.728LeuArg: 4.728 ± 0.452
4.348LeuSer: 4.348 ± 0.462
6.196LeuThr: 6.196 ± 0.539
5.435LeuVal: 5.435 ± 0.584
1.957LeuTrp: 1.957 ± 0.301
1.739LeuTyr: 1.739 ± 0.285
0.0LeuXaa: 0.0 ± 0.0
Met
3.967MetAla: 3.967 ± 0.413
0.109MetCys: 0.109 ± 0.085
2.011MetAsp: 2.011 ± 0.274
2.174MetGlu: 2.174 ± 0.36
0.761MetPhe: 0.761 ± 0.185
1.685MetGly: 1.685 ± 0.274
0.815MetHis: 0.815 ± 0.222
0.924MetIle: 0.924 ± 0.209
1.141MetLys: 1.141 ± 0.261
1.63MetLeu: 1.63 ± 0.291
0.652MetMet: 0.652 ± 0.21
1.033MetAsn: 1.033 ± 0.272
1.087MetPro: 1.087 ± 0.233
0.815MetGln: 0.815 ± 0.246
1.413MetArg: 1.413 ± 0.263
1.304MetSer: 1.304 ± 0.219
2.065MetThr: 2.065 ± 0.31
1.467MetVal: 1.467 ± 0.251
0.598MetTrp: 0.598 ± 0.188
0.38MetTyr: 0.38 ± 0.144
0.0MetXaa: 0.0 ± 0.0
Asn
4.022AsnAla: 4.022 ± 0.547
0.217AsnCys: 0.217 ± 0.101
1.793AsnAsp: 1.793 ± 0.283
2.174AsnGlu: 2.174 ± 0.284
1.033AsnPhe: 1.033 ± 0.26
4.13AsnGly: 4.13 ± 0.626
0.815AsnHis: 0.815 ± 0.195
1.576AsnIle: 1.576 ± 0.22
1.141AsnLys: 1.141 ± 0.249
2.337AsnLeu: 2.337 ± 0.337
1.087AsnMet: 1.087 ± 0.215
1.304AsnAsn: 1.304 ± 0.309
2.228AsnPro: 2.228 ± 0.341
1.522AsnGln: 1.522 ± 0.266
1.793AsnArg: 1.793 ± 0.343
2.935AsnSer: 2.935 ± 0.486
1.685AsnThr: 1.685 ± 0.347
2.663AsnVal: 2.663 ± 0.511
1.304AsnTrp: 1.304 ± 0.296
1.087AsnTyr: 1.087 ± 0.194
0.0AsnXaa: 0.0 ± 0.0
Pro
4.674ProAla: 4.674 ± 0.451
0.272ProCys: 0.272 ± 0.148
4.511ProAsp: 4.511 ± 0.606
4.348ProGlu: 4.348 ± 0.646
1.576ProPhe: 1.576 ± 0.292
4.239ProGly: 4.239 ± 0.622
0.707ProHis: 0.707 ± 0.183
1.685ProIle: 1.685 ± 0.327
2.011ProLys: 2.011 ± 0.402
3.261ProLeu: 3.261 ± 0.472
1.033ProMet: 1.033 ± 0.286
1.576ProAsn: 1.576 ± 0.273
3.043ProPro: 3.043 ± 0.595
1.902ProGln: 1.902 ± 0.308
2.935ProArg: 2.935 ± 0.486
2.609ProSer: 2.609 ± 0.418
3.315ProThr: 3.315 ± 0.42
4.402ProVal: 4.402 ± 0.475
1.033ProTrp: 1.033 ± 0.198
1.359ProTyr: 1.359 ± 0.251
0.0ProXaa: 0.0 ± 0.0
Gln
3.913GlnAla: 3.913 ± 0.457
0.272GlnCys: 0.272 ± 0.168
1.793GlnAsp: 1.793 ± 0.388
2.391GlnGlu: 2.391 ± 0.321
1.413GlnPhe: 1.413 ± 0.284
2.609GlnGly: 2.609 ± 0.365
1.196GlnHis: 1.196 ± 0.308
2.228GlnIle: 2.228 ± 0.412
1.793GlnLys: 1.793 ± 0.362
3.533GlnLeu: 3.533 ± 0.432
1.087GlnMet: 1.087 ± 0.24
1.033GlnAsn: 1.033 ± 0.179
1.902GlnPro: 1.902 ± 0.341
1.902GlnGln: 1.902 ± 0.353
2.283GlnArg: 2.283 ± 0.428
1.902GlnSer: 1.902 ± 0.318
1.685GlnThr: 1.685 ± 0.344
2.283GlnVal: 2.283 ± 0.338
0.761GlnTrp: 0.761 ± 0.212
0.924GlnTyr: 0.924 ± 0.249
0.0GlnXaa: 0.0 ± 0.0
Arg
4.783ArgAla: 4.783 ± 0.428
0.598ArgCys: 0.598 ± 0.176
3.261ArgAsp: 3.261 ± 0.393
4.565ArgGlu: 4.565 ± 0.504
2.174ArgPhe: 2.174 ± 0.358
3.913ArgGly: 3.913 ± 0.406
1.413ArgHis: 1.413 ± 0.274
3.152ArgIle: 3.152 ± 0.343
4.457ArgLys: 4.457 ± 0.528
3.533ArgLeu: 3.533 ± 0.409
2.12ArgMet: 2.12 ± 0.349
1.739ArgAsn: 1.739 ± 0.372
2.5ArgPro: 2.5 ± 0.388
2.065ArgGln: 2.065 ± 0.351
4.022ArgArg: 4.022 ± 0.525
2.228ArgSer: 2.228 ± 0.288
4.185ArgThr: 4.185 ± 0.417
2.717ArgVal: 2.717 ± 0.377
1.304ArgTrp: 1.304 ± 0.296
1.739ArgTyr: 1.739 ± 0.298
0.0ArgXaa: 0.0 ± 0.0
Ser
4.783SerAla: 4.783 ± 0.568
0.543SerCys: 0.543 ± 0.187
3.315SerAsp: 3.315 ± 0.416
3.315SerGlu: 3.315 ± 0.394
2.174SerPhe: 2.174 ± 0.3
5.87SerGly: 5.87 ± 0.857
0.652SerHis: 0.652 ± 0.199
2.935SerIle: 2.935 ± 0.499
2.5SerLys: 2.5 ± 0.31
3.75SerLeu: 3.75 ± 0.538
1.413SerMet: 1.413 ± 0.222
2.174SerAsn: 2.174 ± 0.306
2.228SerPro: 2.228 ± 0.367
1.957SerGln: 1.957 ± 0.262
2.446SerArg: 2.446 ± 0.409
3.587SerSer: 3.587 ± 0.53
3.967SerThr: 3.967 ± 0.584
4.293SerVal: 4.293 ± 0.633
1.467SerTrp: 1.467 ± 0.265
1.087SerTyr: 1.087 ± 0.256
0.0SerXaa: 0.0 ± 0.0
Thr
7.663ThrAla: 7.663 ± 0.801
0.543ThrCys: 0.543 ± 0.171
4.511ThrAsp: 4.511 ± 0.644
4.185ThrGlu: 4.185 ± 0.427
1.848ThrPhe: 1.848 ± 0.327
6.141ThrGly: 6.141 ± 1.042
0.87ThrHis: 0.87 ± 0.234
2.88ThrIle: 2.88 ± 0.392
2.228ThrLys: 2.228 ± 0.355
4.946ThrLeu: 4.946 ± 0.525
1.033ThrMet: 1.033 ± 0.215
2.554ThrAsn: 2.554 ± 0.434
3.913ThrPro: 3.913 ± 0.448
2.011ThrGln: 2.011 ± 0.384
2.826ThrArg: 2.826 ± 0.345
3.696ThrSer: 3.696 ± 0.845
4.891ThrThr: 4.891 ± 0.537
4.837ThrVal: 4.837 ± 0.529
0.87ThrTrp: 0.87 ± 0.265
1.739ThrTyr: 1.739 ± 0.293
0.0ThrXaa: 0.0 ± 0.0
Val
6.63ValAla: 6.63 ± 0.661
0.87ValCys: 0.87 ± 0.227
6.304ValAsp: 6.304 ± 0.624
3.913ValGlu: 3.913 ± 0.524
2.065ValPhe: 2.065 ± 0.463
5.489ValGly: 5.489 ± 0.569
1.196ValHis: 1.196 ± 0.258
3.587ValIle: 3.587 ± 0.426
4.62ValLys: 4.62 ± 0.555
5.272ValLeu: 5.272 ± 0.564
1.467ValMet: 1.467 ± 0.274
2.826ValAsn: 2.826 ± 0.495
3.75ValPro: 3.75 ± 0.531
2.826ValGln: 2.826 ± 0.281
4.946ValArg: 4.946 ± 0.5
3.75ValSer: 3.75 ± 0.547
4.674ValThr: 4.674 ± 0.665
4.293ValVal: 4.293 ± 0.499
2.011ValTrp: 2.011 ± 0.48
1.685ValTyr: 1.685 ± 0.294
0.0ValXaa: 0.0 ± 0.0
Trp
1.957TrpAla: 1.957 ± 0.241
0.326TrpCys: 0.326 ± 0.124
1.63TrpAsp: 1.63 ± 0.333
1.685TrpGlu: 1.685 ± 0.389
0.761TrpPhe: 0.761 ± 0.185
1.63TrpGly: 1.63 ± 0.315
0.652TrpHis: 0.652 ± 0.209
1.413TrpIle: 1.413 ± 0.304
1.141TrpLys: 1.141 ± 0.208
1.467TrpLeu: 1.467 ± 0.264
0.489TrpMet: 0.489 ± 0.17
1.63TrpAsn: 1.63 ± 0.598
0.978TrpPro: 0.978 ± 0.236
0.598TrpGln: 0.598 ± 0.141
0.978TrpArg: 0.978 ± 0.209
1.848TrpSer: 1.848 ± 0.339
2.12TrpThr: 2.12 ± 0.303
1.63TrpVal: 1.63 ± 0.276
1.033TrpTrp: 1.033 ± 0.273
0.543TrpTyr: 0.543 ± 0.177
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.337TyrAla: 2.337 ± 0.421
0.217TyrCys: 0.217 ± 0.101
1.63TyrAsp: 1.63 ± 0.274
1.304TyrGlu: 1.304 ± 0.25
0.87TyrPhe: 0.87 ± 0.207
2.554TyrGly: 2.554 ± 0.264
0.435TyrHis: 0.435 ± 0.156
0.87TyrIle: 0.87 ± 0.214
1.033TyrLys: 1.033 ± 0.274
1.63TyrLeu: 1.63 ± 0.286
0.543TyrMet: 0.543 ± 0.168
0.815TyrAsn: 0.815 ± 0.223
1.793TyrPro: 1.793 ± 0.333
0.707TyrGln: 0.707 ± 0.18
2.283TyrArg: 2.283 ± 0.396
2.283TyrSer: 2.283 ± 0.359
1.25TyrThr: 1.25 ± 0.308
1.576TyrVal: 1.576 ± 0.277
0.761TyrTrp: 0.761 ± 0.19
0.761TyrTyr: 0.761 ± 0.188
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 98 proteins (18401 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski