Amino acid dipepetide frequency for Citrobacter phage CR8

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.114AlaAla: 9.114 ± 1.076
0.598AlaCys: 0.598 ± 0.221
6.425AlaAsp: 6.425 ± 0.731
6.574AlaGlu: 6.574 ± 0.79
2.615AlaPhe: 2.615 ± 0.367
8.965AlaGly: 8.965 ± 0.825
1.195AlaHis: 1.195 ± 0.336
4.408AlaIle: 4.408 ± 0.721
6.574AlaLys: 6.574 ± 0.686
7.769AlaLeu: 7.769 ± 0.775
2.988AlaMet: 2.988 ± 0.462
4.109AlaAsn: 4.109 ± 0.454
2.689AlaPro: 2.689 ± 0.336
3.362AlaGln: 3.362 ± 0.574
4.034AlaArg: 4.034 ± 0.567
4.557AlaSer: 4.557 ± 0.655
3.885AlaThr: 3.885 ± 0.559
7.396AlaVal: 7.396 ± 0.922
1.569AlaTrp: 1.569 ± 0.332
2.54AlaTyr: 2.54 ± 0.424
0.0AlaXaa: 0.0 ± 0.0
Cys
0.448CysAla: 0.448 ± 0.187
0.0CysCys: 0.0 ± 0.0
0.598CysAsp: 0.598 ± 0.314
0.523CysGlu: 0.523 ± 0.23
0.374CysPhe: 0.374 ± 0.167
0.971CysGly: 0.971 ± 0.237
0.448CysHis: 0.448 ± 0.223
0.149CysIle: 0.149 ± 0.094
0.971CysLys: 0.971 ± 0.273
0.896CysLeu: 0.896 ± 0.328
0.299CysMet: 0.299 ± 0.227
0.448CysAsn: 0.448 ± 0.26
0.598CysPro: 0.598 ± 0.232
0.224CysGln: 0.224 ± 0.131
0.598CysArg: 0.598 ± 0.311
0.747CysSer: 0.747 ± 0.211
0.149CysThr: 0.149 ± 0.135
0.971CysVal: 0.971 ± 0.272
0.224CysTrp: 0.224 ± 0.143
0.149CysTyr: 0.149 ± 0.135
0.0CysXaa: 0.0 ± 0.0
Asp
5.827AspAla: 5.827 ± 0.775
0.822AspCys: 0.822 ± 0.356
4.109AspAsp: 4.109 ± 0.617
4.183AspGlu: 4.183 ± 0.541
2.54AspPhe: 2.54 ± 0.506
6.574AspGly: 6.574 ± 0.716
1.419AspHis: 1.419 ± 0.298
2.839AspIle: 2.839 ± 0.393
3.959AspLys: 3.959 ± 0.496
4.632AspLeu: 4.632 ± 0.477
1.644AspMet: 1.644 ± 0.279
2.615AspAsn: 2.615 ± 0.396
2.764AspPro: 2.764 ± 0.464
2.391AspGln: 2.391 ± 0.531
2.316AspArg: 2.316 ± 0.38
3.661AspSer: 3.661 ± 0.524
3.063AspThr: 3.063 ± 0.623
4.632AspVal: 4.632 ± 0.519
0.896AspTrp: 0.896 ± 0.201
2.092AspTyr: 2.092 ± 0.293
0.0AspXaa: 0.0 ± 0.0
Glu
8.74GluAla: 8.74 ± 0.984
0.523GluCys: 0.523 ± 0.232
4.931GluAsp: 4.931 ± 0.672
4.706GluGlu: 4.706 ± 0.936
2.166GluPhe: 2.166 ± 0.246
5.08GluGly: 5.08 ± 0.552
0.971GluHis: 0.971 ± 0.349
2.764GluIle: 2.764 ± 0.409
3.287GluLys: 3.287 ± 0.657
5.304GluLeu: 5.304 ± 0.729
2.764GluMet: 2.764 ± 0.492
2.391GluAsn: 2.391 ± 0.427
1.718GluPro: 1.718 ± 0.374
2.988GluGln: 2.988 ± 0.53
3.362GluArg: 3.362 ± 0.459
4.258GluSer: 4.258 ± 0.573
4.333GluThr: 4.333 ± 0.464
4.632GluVal: 4.632 ± 0.652
0.672GluTrp: 0.672 ± 0.26
2.839GluTyr: 2.839 ± 0.441
0.0GluXaa: 0.0 ± 0.0
Phe
2.839PheAla: 2.839 ± 0.419
0.299PheCys: 0.299 ± 0.139
3.212PheAsp: 3.212 ± 0.495
2.166PheGlu: 2.166 ± 0.344
0.747PhePhe: 0.747 ± 0.234
2.316PheGly: 2.316 ± 0.436
0.971PheHis: 0.971 ± 0.28
1.494PheIle: 1.494 ± 0.381
2.764PheLys: 2.764 ± 0.464
2.764PheLeu: 2.764 ± 0.484
1.345PheMet: 1.345 ± 0.329
1.868PheAsn: 1.868 ± 0.377
1.27PhePro: 1.27 ± 0.289
1.121PheGln: 1.121 ± 0.271
1.569PheArg: 1.569 ± 0.279
2.241PheSer: 2.241 ± 0.423
2.241PheThr: 2.241 ± 0.314
2.689PheVal: 2.689 ± 0.471
0.299PheTrp: 0.299 ± 0.168
0.896PheTyr: 0.896 ± 0.224
0.0PheXaa: 0.0 ± 0.0
Gly
6.798GlyAla: 6.798 ± 0.787
0.672GlyCys: 0.672 ± 0.223
4.706GlyAsp: 4.706 ± 0.607
5.976GlyGlu: 5.976 ± 0.876
3.138GlyPhe: 3.138 ± 0.452
5.603GlyGly: 5.603 ± 0.489
1.644GlyHis: 1.644 ± 0.334
4.781GlyIle: 4.781 ± 0.549
5.379GlyLys: 5.379 ± 0.751
6.798GlyLeu: 6.798 ± 0.69
2.391GlyMet: 2.391 ± 0.376
2.689GlyAsn: 2.689 ± 0.399
0.448GlyPro: 0.448 ± 0.213
2.689GlyGln: 2.689 ± 0.429
4.632GlyArg: 4.632 ± 0.471
6.35GlySer: 6.35 ± 0.699
3.735GlyThr: 3.735 ± 0.472
5.155GlyVal: 5.155 ± 0.647
0.747GlyTrp: 0.747 ± 0.209
3.287GlyTyr: 3.287 ± 0.68
0.0GlyXaa: 0.0 ± 0.0
His
0.896HisAla: 0.896 ± 0.243
0.299HisCys: 0.299 ± 0.242
1.046HisAsp: 1.046 ± 0.353
1.644HisGlu: 1.644 ± 0.309
0.747HisPhe: 0.747 ± 0.234
1.494HisGly: 1.494 ± 0.419
0.224HisHis: 0.224 ± 0.144
0.971HisIle: 0.971 ± 0.262
1.121HisLys: 1.121 ± 0.286
1.868HisLeu: 1.868 ± 0.417
0.523HisMet: 0.523 ± 0.185
0.896HisAsn: 0.896 ± 0.228
0.523HisPro: 0.523 ± 0.186
0.523HisGln: 0.523 ± 0.166
1.27HisArg: 1.27 ± 0.367
1.195HisSer: 1.195 ± 0.321
0.747HisThr: 0.747 ± 0.187
1.195HisVal: 1.195 ± 0.31
0.374HisTrp: 0.374 ± 0.146
0.672HisTyr: 0.672 ± 0.21
0.0HisXaa: 0.0 ± 0.0
Ile
4.258IleAla: 4.258 ± 0.641
0.523IleCys: 0.523 ± 0.251
3.063IleAsp: 3.063 ± 0.525
3.138IleGlu: 3.138 ± 0.391
0.896IlePhe: 0.896 ± 0.276
3.885IleGly: 3.885 ± 0.522
1.046IleHis: 1.046 ± 0.265
2.465IleIle: 2.465 ± 0.471
4.632IleLys: 4.632 ± 0.475
3.661IleLeu: 3.661 ± 0.516
1.046IleMet: 1.046 ± 0.232
1.569IleAsn: 1.569 ± 0.352
1.942IlePro: 1.942 ± 0.329
2.316IleGln: 2.316 ± 0.433
3.212IleArg: 3.212 ± 0.487
2.391IleSer: 2.391 ± 0.409
2.988IleThr: 2.988 ± 0.388
2.913IleVal: 2.913 ± 0.341
0.822IleTrp: 0.822 ± 0.229
1.345IleTyr: 1.345 ± 0.302
0.0IleXaa: 0.0 ± 0.0
Lys
7.695LysAla: 7.695 ± 1.018
0.598LysCys: 0.598 ± 0.218
3.885LysAsp: 3.885 ± 0.499
4.632LysGlu: 4.632 ± 0.595
2.465LysPhe: 2.465 ± 0.367
4.109LysGly: 4.109 ± 0.447
1.644LysHis: 1.644 ± 0.432
2.316LysIle: 2.316 ± 0.405
3.959LysLys: 3.959 ± 0.734
6.35LysLeu: 6.35 ± 0.635
2.092LysMet: 2.092 ± 0.343
2.092LysAsn: 2.092 ± 0.398
2.316LysPro: 2.316 ± 0.517
1.942LysGln: 1.942 ± 0.403
3.063LysArg: 3.063 ± 0.492
6.126LysSer: 6.126 ± 0.571
4.258LysThr: 4.258 ± 0.703
5.229LysVal: 5.229 ± 0.675
1.195LysTrp: 1.195 ± 0.306
2.465LysTyr: 2.465 ± 0.376
0.0LysXaa: 0.0 ± 0.0
Leu
6.873LeuAla: 6.873 ± 0.772
0.598LeuCys: 0.598 ± 0.284
4.258LeuAsp: 4.258 ± 0.577
5.752LeuGlu: 5.752 ± 0.702
2.316LeuPhe: 2.316 ± 0.432
5.304LeuGly: 5.304 ± 0.552
1.121LeuHis: 1.121 ± 0.316
3.063LeuIle: 3.063 ± 0.447
7.097LeuLys: 7.097 ± 0.73
4.856LeuLeu: 4.856 ± 0.7
3.138LeuMet: 3.138 ± 0.569
3.959LeuAsn: 3.959 ± 0.522
2.913LeuPro: 2.913 ± 0.533
4.408LeuGln: 4.408 ± 0.708
4.482LeuArg: 4.482 ± 0.54
6.35LeuSer: 6.35 ± 0.754
5.902LeuThr: 5.902 ± 0.76
4.408LeuVal: 4.408 ± 0.451
0.822LeuTrp: 0.822 ± 0.24
2.913LeuTyr: 2.913 ± 0.513
0.0LeuXaa: 0.0 ± 0.0
Met
3.436MetAla: 3.436 ± 0.394
0.374MetCys: 0.374 ± 0.175
2.092MetAsp: 2.092 ± 0.348
1.718MetGlu: 1.718 ± 0.261
1.494MetPhe: 1.494 ± 0.307
2.988MetGly: 2.988 ± 0.752
0.299MetHis: 0.299 ± 0.138
1.121MetIle: 1.121 ± 0.296
2.017MetLys: 2.017 ± 0.432
3.063MetLeu: 3.063 ± 0.407
0.747MetMet: 0.747 ± 0.258
0.971MetAsn: 0.971 ± 0.282
0.896MetPro: 0.896 ± 0.252
1.195MetGln: 1.195 ± 0.268
1.718MetArg: 1.718 ± 0.422
1.345MetSer: 1.345 ± 0.251
2.241MetThr: 2.241 ± 0.345
2.988MetVal: 2.988 ± 0.418
0.224MetTrp: 0.224 ± 0.128
0.822MetTyr: 0.822 ± 0.195
0.0MetXaa: 0.0 ± 0.0
Asn
3.661AsnAla: 3.661 ± 0.524
0.598AsnCys: 0.598 ± 0.197
2.092AsnAsp: 2.092 ± 0.469
2.54AsnGlu: 2.54 ± 0.431
1.195AsnPhe: 1.195 ± 0.247
3.661AsnGly: 3.661 ± 0.533
1.046AsnHis: 1.046 ± 0.366
2.166AsnIle: 2.166 ± 0.425
2.764AsnLys: 2.764 ± 0.415
3.212AsnLeu: 3.212 ± 0.42
0.896AsnMet: 0.896 ± 0.219
1.868AsnAsn: 1.868 ± 0.446
2.913AsnPro: 2.913 ± 0.468
1.868AsnGln: 1.868 ± 0.37
1.644AsnArg: 1.644 ± 0.444
2.316AsnSer: 2.316 ± 0.271
2.54AsnThr: 2.54 ± 0.535
2.764AsnVal: 2.764 ± 0.493
0.598AsnTrp: 0.598 ± 0.209
2.316AsnTyr: 2.316 ± 0.422
0.0AsnXaa: 0.0 ± 0.0
Pro
3.735ProAla: 3.735 ± 0.569
0.374ProCys: 0.374 ± 0.196
2.166ProAsp: 2.166 ± 0.429
3.436ProGlu: 3.436 ± 0.468
0.896ProPhe: 0.896 ± 0.324
0.299ProGly: 0.299 ± 0.164
1.046ProHis: 1.046 ± 0.283
0.896ProIle: 0.896 ± 0.268
2.241ProLys: 2.241 ± 0.409
1.569ProLeu: 1.569 ± 0.353
1.345ProMet: 1.345 ± 0.276
2.241ProAsn: 2.241 ± 0.433
0.523ProPro: 0.523 ± 0.175
1.793ProGln: 1.793 ± 0.435
1.494ProArg: 1.494 ± 0.461
2.316ProSer: 2.316 ± 0.397
2.092ProThr: 2.092 ± 0.385
3.586ProVal: 3.586 ± 0.459
0.822ProTrp: 0.822 ± 0.243
1.494ProTyr: 1.494 ± 0.258
0.0ProXaa: 0.0 ± 0.0
Gln
3.885GlnAla: 3.885 ± 0.497
0.149GlnCys: 0.149 ± 0.131
2.913GlnAsp: 2.913 ± 0.5
2.913GlnGlu: 2.913 ± 0.398
1.868GlnPhe: 1.868 ± 0.315
2.764GlnGly: 2.764 ± 0.375
0.523GlnHis: 0.523 ± 0.211
2.241GlnIle: 2.241 ± 0.383
2.689GlnLys: 2.689 ± 0.551
4.333GlnLeu: 4.333 ± 0.625
1.644GlnMet: 1.644 ± 0.274
1.718GlnAsn: 1.718 ± 0.365
0.971GlnPro: 0.971 ± 0.209
1.868GlnGln: 1.868 ± 0.52
2.391GlnArg: 2.391 ± 0.498
1.942GlnSer: 1.942 ± 0.408
2.092GlnThr: 2.092 ± 0.37
1.569GlnVal: 1.569 ± 0.323
0.523GlnTrp: 0.523 ± 0.204
1.569GlnTyr: 1.569 ± 0.432
0.0GlnXaa: 0.0 ± 0.0
Arg
3.586ArgAla: 3.586 ± 0.545
0.896ArgCys: 0.896 ± 0.261
3.661ArgAsp: 3.661 ± 0.366
3.735ArgGlu: 3.735 ± 0.566
2.017ArgPhe: 2.017 ± 0.369
3.959ArgGly: 3.959 ± 0.572
0.747ArgHis: 0.747 ± 0.201
2.764ArgIle: 2.764 ± 0.503
3.586ArgLys: 3.586 ± 0.512
4.706ArgLeu: 4.706 ± 0.46
1.345ArgMet: 1.345 ± 0.329
2.54ArgAsn: 2.54 ± 0.476
1.494ArgPro: 1.494 ± 0.296
2.166ArgGln: 2.166 ± 0.356
2.092ArgArg: 2.092 ± 0.309
2.913ArgSer: 2.913 ± 0.526
2.54ArgThr: 2.54 ± 0.396
3.586ArgVal: 3.586 ± 0.503
0.672ArgTrp: 0.672 ± 0.278
1.27ArgTyr: 1.27 ± 0.341
0.0ArgXaa: 0.0 ± 0.0
Ser
5.005SerAla: 5.005 ± 0.75
0.896SerCys: 0.896 ± 0.421
4.482SerAsp: 4.482 ± 0.647
3.735SerGlu: 3.735 ± 0.641
3.212SerPhe: 3.212 ± 0.398
5.902SerGly: 5.902 ± 1.043
1.419SerHis: 1.419 ± 0.287
3.511SerIle: 3.511 ± 0.563
4.183SerLys: 4.183 ± 0.527
3.661SerLeu: 3.661 ± 0.523
2.465SerMet: 2.465 ± 0.338
2.391SerAsn: 2.391 ± 0.423
2.465SerPro: 2.465 ± 0.368
2.988SerGln: 2.988 ± 0.429
3.212SerArg: 3.212 ± 0.502
3.138SerSer: 3.138 ± 0.458
3.436SerThr: 3.436 ± 0.607
3.959SerVal: 3.959 ± 0.599
1.121SerTrp: 1.121 ± 0.278
2.166SerTyr: 2.166 ± 0.402
0.0SerXaa: 0.0 ± 0.0
Thr
4.408ThrAla: 4.408 ± 0.566
0.822ThrCys: 0.822 ± 0.252
2.913ThrAsp: 2.913 ± 0.534
3.735ThrGlu: 3.735 ± 0.51
2.391ThrPhe: 2.391 ± 0.451
4.706ThrGly: 4.706 ± 0.591
0.523ThrHis: 0.523 ± 0.189
3.586ThrIle: 3.586 ± 0.576
4.034ThrLys: 4.034 ± 0.472
4.781ThrLeu: 4.781 ± 0.537
1.345ThrMet: 1.345 ± 0.281
2.092ThrAsn: 2.092 ± 0.394
3.362ThrPro: 3.362 ± 0.437
2.017ThrGln: 2.017 ± 0.424
2.988ThrArg: 2.988 ± 0.49
2.689ThrSer: 2.689 ± 0.692
3.212ThrThr: 3.212 ± 0.522
4.109ThrVal: 4.109 ± 0.553
0.822ThrTrp: 0.822 ± 0.216
1.644ThrTyr: 1.644 ± 0.363
0.0ThrXaa: 0.0 ± 0.0
Val
5.827ValAla: 5.827 ± 0.491
0.299ValCys: 0.299 ± 0.159
3.212ValAsp: 3.212 ± 0.378
5.005ValGlu: 5.005 ± 0.532
2.465ValPhe: 2.465 ± 0.428
5.528ValGly: 5.528 ± 0.624
0.971ValHis: 0.971 ± 0.312
4.183ValIle: 4.183 ± 0.488
4.258ValLys: 4.258 ± 0.634
5.902ValLeu: 5.902 ± 0.725
2.465ValMet: 2.465 ± 0.446
3.287ValAsn: 3.287 ± 0.466
2.839ValPro: 2.839 ± 0.339
2.764ValGln: 2.764 ± 0.42
3.735ValArg: 3.735 ± 0.38
5.827ValSer: 5.827 ± 0.644
3.735ValThr: 3.735 ± 0.537
5.528ValVal: 5.528 ± 0.636
0.747ValTrp: 0.747 ± 0.32
2.615ValTyr: 2.615 ± 0.554
0.0ValXaa: 0.0 ± 0.0
Trp
0.747TrpAla: 0.747 ± 0.222
0.149TrpCys: 0.149 ± 0.111
0.672TrpAsp: 0.672 ± 0.232
0.747TrpGlu: 0.747 ± 0.251
0.374TrpPhe: 0.374 ± 0.185
1.195TrpGly: 1.195 ± 0.336
0.224TrpHis: 0.224 ± 0.149
0.224TrpIle: 0.224 ± 0.137
1.121TrpLys: 1.121 ± 0.362
1.718TrpLeu: 1.718 ± 0.368
0.149TrpMet: 0.149 ± 0.081
0.971TrpAsn: 0.971 ± 0.243
0.299TrpPro: 0.299 ± 0.15
0.523TrpGln: 0.523 ± 0.193
0.672TrpArg: 0.672 ± 0.259
1.345TrpSer: 1.345 ± 0.336
0.747TrpThr: 0.747 ± 0.255
1.195TrpVal: 1.195 ± 0.368
0.075TrpTrp: 0.075 ± 0.09
0.598TrpTyr: 0.598 ± 0.203
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.586TyrAla: 3.586 ± 0.405
0.299TyrCys: 0.299 ± 0.176
2.54TyrAsp: 2.54 ± 0.436
1.569TyrGlu: 1.569 ± 0.386
1.195TyrPhe: 1.195 ± 0.361
2.316TyrGly: 2.316 ± 0.475
0.747TyrHis: 0.747 ± 0.308
2.166TyrIle: 2.166 ± 0.481
1.793TyrLys: 1.793 ± 0.45
2.689TyrLeu: 2.689 ± 0.394
1.046TyrMet: 1.046 ± 0.216
1.942TyrAsn: 1.942 ± 0.406
1.419TyrPro: 1.419 ± 0.298
1.419TyrGln: 1.419 ± 0.362
1.793TyrArg: 1.793 ± 0.384
1.718TyrSer: 1.718 ± 0.356
2.241TyrThr: 2.241 ± 0.484
2.689TyrVal: 2.689 ± 0.574
0.523TyrTrp: 0.523 ± 0.238
1.644TyrTyr: 1.644 ± 0.355
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 57 proteins (13387 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski