Amino acid dipepetide frequency for Microbacterium phage Kelcole

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
14.882AlaAla: 14.882 ± 1.1
1.002AlaCys: 1.002 ± 0.272
7.266AlaAsp: 7.266 ± 0.762
8.117AlaGlu: 8.117 ± 0.702
2.806AlaPhe: 2.806 ± 0.46
7.817AlaGly: 7.817 ± 0.786
2.054AlaHis: 2.054 ± 0.318
4.961AlaIle: 4.961 ± 0.588
3.357AlaLys: 3.357 ± 0.463
10.122AlaLeu: 10.122 ± 0.799
3.207AlaMet: 3.207 ± 0.41
3.257AlaAsn: 3.257 ± 0.475
4.961AlaPro: 4.961 ± 0.622
4.109AlaGln: 4.109 ± 0.543
6.664AlaArg: 6.664 ± 0.614
6.965AlaSer: 6.965 ± 0.702
7.817AlaThr: 7.817 ± 0.937
8.619AlaVal: 8.619 ± 1.164
1.754AlaTrp: 1.754 ± 0.299
2.756AlaTyr: 2.756 ± 0.461
0.0AlaXaa: 0.0 ± 0.0
Cys
0.601CysAla: 0.601 ± 0.236
0.1CysCys: 0.1 ± 0.076
0.451CysAsp: 0.451 ± 0.145
0.702CysGlu: 0.702 ± 0.2
0.251CysPhe: 0.251 ± 0.121
1.102CysGly: 1.102 ± 0.239
0.251CysHis: 0.251 ± 0.111
0.301CysIle: 0.301 ± 0.121
0.251CysLys: 0.251 ± 0.097
0.401CysLeu: 0.401 ± 0.154
0.05CysMet: 0.05 ± 0.048
0.351CysAsn: 0.351 ± 0.127
0.702CysPro: 0.702 ± 0.222
0.251CysGln: 0.251 ± 0.1
0.651CysArg: 0.651 ± 0.16
0.451CysSer: 0.451 ± 0.187
0.451CysThr: 0.451 ± 0.136
0.501CysVal: 0.501 ± 0.15
0.251CysTrp: 0.251 ± 0.107
0.301CysTyr: 0.301 ± 0.136
0.0CysXaa: 0.0 ± 0.0
Asp
7.967AspAla: 7.967 ± 0.672
0.401AspCys: 0.401 ± 0.094
4.46AspAsp: 4.46 ± 0.498
4.56AspGlu: 4.56 ± 0.435
1.954AspPhe: 1.954 ± 0.354
6.063AspGly: 6.063 ± 0.549
1.102AspHis: 1.102 ± 0.309
2.906AspIle: 2.906 ± 0.381
1.453AspLys: 1.453 ± 0.321
4.51AspLeu: 4.51 ± 0.596
1.553AspMet: 1.553 ± 0.302
1.854AspAsn: 1.854 ± 0.319
3.658AspPro: 3.658 ± 0.469
2.355AspGln: 2.355 ± 0.442
3.959AspArg: 3.959 ± 0.542
3.357AspSer: 3.357 ± 0.419
2.505AspThr: 2.505 ± 0.315
3.959AspVal: 3.959 ± 0.447
1.503AspTrp: 1.503 ± 0.236
1.654AspTyr: 1.654 ± 0.503
0.0AspXaa: 0.0 ± 0.0
Glu
7.115GluAla: 7.115 ± 0.857
0.601GluCys: 0.601 ± 0.195
4.009GluAsp: 4.009 ± 0.518
5.612GluGlu: 5.612 ± 0.757
2.004GluPhe: 2.004 ± 0.362
4.66GluGly: 4.66 ± 0.608
1.854GluHis: 1.854 ± 0.321
4.059GluIle: 4.059 ± 0.652
2.806GluLys: 2.806 ± 0.449
6.915GluLeu: 6.915 ± 0.729
1.403GluMet: 1.403 ± 0.282
2.054GluAsn: 2.054 ± 0.317
3.357GluPro: 3.357 ± 0.557
3.407GluGln: 3.407 ± 0.465
4.61GluArg: 4.61 ± 0.52
3.357GluSer: 3.357 ± 0.328
3.658GluThr: 3.658 ± 0.537
5.662GluVal: 5.662 ± 0.483
1.453GluTrp: 1.453 ± 0.223
2.305GluTyr: 2.305 ± 0.336
0.0GluXaa: 0.0 ± 0.0
Phe
3.307PheAla: 3.307 ± 0.453
0.301PheCys: 0.301 ± 0.12
2.105PheAsp: 2.105 ± 0.419
1.954PheGlu: 1.954 ± 0.307
0.702PhePhe: 0.702 ± 0.144
3.608PheGly: 3.608 ± 0.528
0.501PheHis: 0.501 ± 0.172
0.802PheIle: 0.802 ± 0.156
0.752PheLys: 0.752 ± 0.227
1.854PheLeu: 1.854 ± 0.338
0.501PheMet: 0.501 ± 0.146
0.601PheAsn: 0.601 ± 0.171
1.253PhePro: 1.253 ± 0.238
0.601PheGln: 0.601 ± 0.152
1.754PheArg: 1.754 ± 0.244
1.854PheSer: 1.854 ± 0.284
2.305PheThr: 2.305 ± 0.4
1.654PheVal: 1.654 ± 0.269
0.351PheTrp: 0.351 ± 0.149
0.752PheTyr: 0.752 ± 0.19
0.0PheXaa: 0.0 ± 0.0
Gly
8.619GlyAla: 8.619 ± 0.923
0.802GlyCys: 0.802 ± 0.261
5.562GlyAsp: 5.562 ± 0.602
5.011GlyGlu: 5.011 ± 0.772
2.555GlyPhe: 2.555 ± 0.373
8.819GlyGly: 8.819 ± 0.836
1.553GlyHis: 1.553 ± 0.283
3.558GlyIle: 3.558 ± 0.449
3.808GlyLys: 3.808 ± 0.635
5.913GlyLeu: 5.913 ± 0.609
2.305GlyMet: 2.305 ± 0.322
2.856GlyAsn: 2.856 ± 0.412
2.856GlyPro: 2.856 ± 0.361
2.706GlyGln: 2.706 ± 0.492
5.061GlyArg: 5.061 ± 0.54
4.81GlySer: 4.81 ± 0.774
6.965GlyThr: 6.965 ± 0.854
6.965GlyVal: 6.965 ± 0.498
1.754GlyTrp: 1.754 ± 0.328
2.906GlyTyr: 2.906 ± 0.312
0.0GlyXaa: 0.0 ± 0.0
His
2.054HisAla: 2.054 ± 0.317
0.351HisCys: 0.351 ± 0.146
0.752HisAsp: 0.752 ± 0.233
1.754HisGlu: 1.754 ± 0.386
0.401HisPhe: 0.401 ± 0.113
1.353HisGly: 1.353 ± 0.261
0.451HisHis: 0.451 ± 0.146
0.902HisIle: 0.902 ± 0.29
0.301HisLys: 0.301 ± 0.11
1.654HisLeu: 1.654 ± 0.364
0.601HisMet: 0.601 ± 0.162
0.551HisAsn: 0.551 ± 0.161
1.403HisPro: 1.403 ± 0.341
0.651HisGln: 0.651 ± 0.19
1.002HisArg: 1.002 ± 0.259
0.702HisSer: 0.702 ± 0.195
0.902HisThr: 0.902 ± 0.182
1.654HisVal: 1.654 ± 0.309
0.351HisTrp: 0.351 ± 0.126
0.451HisTyr: 0.451 ± 0.127
0.0HisXaa: 0.0 ± 0.0
Ile
5.512IleAla: 5.512 ± 0.697
0.301IleCys: 0.301 ± 0.12
3.808IleAsp: 3.808 ± 0.384
4.409IleGlu: 4.409 ± 0.412
1.303IlePhe: 1.303 ± 0.243
4.109IleGly: 4.109 ± 0.467
0.852IleHis: 0.852 ± 0.215
2.155IleIle: 2.155 ± 0.436
1.102IleLys: 1.102 ± 0.211
2.956IleLeu: 2.956 ± 0.357
0.651IleMet: 0.651 ± 0.181
1.603IleAsn: 1.603 ± 0.296
3.658IlePro: 3.658 ± 0.978
1.453IleGln: 1.453 ± 0.288
3.107IleArg: 3.107 ± 0.395
2.856IleSer: 2.856 ± 0.382
3.658IleThr: 3.658 ± 0.364
3.758IleVal: 3.758 ± 0.515
0.702IleTrp: 0.702 ± 0.176
0.802IleTyr: 0.802 ± 0.178
0.0IleXaa: 0.0 ± 0.0
Lys
3.858LysAla: 3.858 ± 0.522
0.251LysCys: 0.251 ± 0.125
1.152LysAsp: 1.152 ± 0.233
2.155LysGlu: 2.155 ± 0.329
1.052LysPhe: 1.052 ± 0.281
2.555LysGly: 2.555 ± 0.446
0.651LysHis: 0.651 ± 0.177
1.704LysIle: 1.704 ± 0.267
1.854LysLys: 1.854 ± 0.377
2.956LysLeu: 2.956 ± 0.419
0.852LysMet: 0.852 ± 0.255
0.902LysAsn: 0.902 ± 0.196
1.704LysPro: 1.704 ± 0.336
0.802LysGln: 0.802 ± 0.241
2.255LysArg: 2.255 ± 0.294
1.954LysSer: 1.954 ± 0.318
1.904LysThr: 1.904 ± 0.337
2.706LysVal: 2.706 ± 0.416
0.551LysTrp: 0.551 ± 0.182
0.702LysTyr: 0.702 ± 0.171
0.0LysXaa: 0.0 ± 0.0
Leu
9.671LeuAla: 9.671 ± 0.91
0.351LeuCys: 0.351 ± 0.123
5.311LeuAsp: 5.311 ± 0.511
5.963LeuGlu: 5.963 ± 0.612
1.754LeuPhe: 1.754 ± 0.375
6.314LeuGly: 6.314 ± 0.78
1.403LeuHis: 1.403 ± 0.313
4.009LeuIle: 4.009 ± 0.466
2.405LeuLys: 2.405 ± 0.348
5.662LeuLeu: 5.662 ± 0.645
1.704LeuMet: 1.704 ± 0.29
2.956LeuAsn: 2.956 ± 0.438
3.407LeuPro: 3.407 ± 0.475
1.704LeuGln: 1.704 ± 0.316
6.364LeuArg: 6.364 ± 0.58
4.86LeuSer: 4.86 ± 0.508
6.364LeuThr: 6.364 ± 0.665
4.359LeuVal: 4.359 ± 0.529
0.852LeuTrp: 0.852 ± 0.215
1.854LeuTyr: 1.854 ± 0.29
0.0LeuXaa: 0.0 ± 0.0
Met
2.806MetAla: 2.806 ± 0.429
0.1MetCys: 0.1 ± 0.063
0.852MetAsp: 0.852 ± 0.219
0.651MetGlu: 0.651 ± 0.173
0.351MetPhe: 0.351 ± 0.141
1.603MetGly: 1.603 ± 0.325
0.501MetHis: 0.501 ± 0.151
1.403MetIle: 1.403 ± 0.289
0.752MetLys: 0.752 ± 0.193
1.804MetLeu: 1.804 ± 0.32
0.451MetMet: 0.451 ± 0.159
1.002MetAsn: 1.002 ± 0.225
1.603MetPro: 1.603 ± 0.285
0.702MetGln: 0.702 ± 0.193
2.155MetArg: 2.155 ± 0.321
2.505MetSer: 2.505 ± 0.365
1.904MetThr: 1.904 ± 0.372
1.654MetVal: 1.654 ± 0.264
0.251MetTrp: 0.251 ± 0.106
0.451MetTyr: 0.451 ± 0.193
0.0MetXaa: 0.0 ± 0.0
Asn
3.057AsnAla: 3.057 ± 0.41
0.15AsnCys: 0.15 ± 0.095
2.054AsnAsp: 2.054 ± 0.378
1.603AsnGlu: 1.603 ± 0.277
0.702AsnPhe: 0.702 ± 0.225
3.157AsnGly: 3.157 ± 0.408
0.301AsnHis: 0.301 ± 0.14
1.052AsnIle: 1.052 ± 0.178
0.451AsnLys: 0.451 ± 0.146
2.205AsnLeu: 2.205 ± 0.526
0.351AsnMet: 0.351 ± 0.136
0.651AsnAsn: 0.651 ± 0.166
2.956AsnPro: 2.956 ± 0.807
0.852AsnGln: 0.852 ± 0.221
1.754AsnArg: 1.754 ± 0.384
1.453AsnSer: 1.453 ± 0.274
2.455AsnThr: 2.455 ± 0.357
2.155AsnVal: 2.155 ± 0.339
0.501AsnTrp: 0.501 ± 0.16
0.902AsnTyr: 0.902 ± 0.207
0.0AsnXaa: 0.0 ± 0.0
Pro
7.366ProAla: 7.366 ± 2.12
0.501ProCys: 0.501 ± 0.157
3.608ProAsp: 3.608 ± 0.639
4.409ProGlu: 4.409 ± 0.603
1.052ProPhe: 1.052 ± 0.24
5.161ProGly: 5.161 ± 0.767
0.852ProHis: 0.852 ± 0.239
2.105ProIle: 2.105 ± 0.315
1.603ProLys: 1.603 ± 0.324
3.708ProLeu: 3.708 ± 0.467
1.002ProMet: 1.002 ± 0.194
1.904ProAsn: 1.904 ± 0.436
3.006ProPro: 3.006 ± 0.431
2.004ProGln: 2.004 ± 0.628
3.006ProArg: 3.006 ± 0.446
3.006ProSer: 3.006 ± 0.42
4.109ProThr: 4.109 ± 0.578
4.81ProVal: 4.81 ± 0.595
1.052ProTrp: 1.052 ± 0.24
1.152ProTyr: 1.152 ± 0.266
0.0ProXaa: 0.0 ± 0.0
Gln
3.959GlnAla: 3.959 ± 0.414
0.15GlnCys: 0.15 ± 0.087
0.802GlnAsp: 0.802 ± 0.186
1.804GlnGlu: 1.804 ± 0.37
0.902GlnPhe: 0.902 ± 0.234
2.956GlnGly: 2.956 ± 0.65
0.702GlnHis: 0.702 ± 0.182
1.854GlnIle: 1.854 ± 0.317
0.902GlnLys: 0.902 ± 0.216
2.606GlnLeu: 2.606 ± 0.337
1.303GlnMet: 1.303 ± 0.306
1.203GlnAsn: 1.203 ± 0.295
1.954GlnPro: 1.954 ± 0.504
2.756GlnGln: 2.756 ± 1.087
2.806GlnArg: 2.806 ± 0.381
1.704GlnSer: 1.704 ± 0.227
1.954GlnThr: 1.954 ± 0.308
2.205GlnVal: 2.205 ± 0.381
0.702GlnTrp: 0.702 ± 0.153
0.902GlnTyr: 0.902 ± 0.227
0.0GlnXaa: 0.0 ± 0.0
Arg
6.013ArgAla: 6.013 ± 0.464
0.952ArgCys: 0.952 ± 0.244
3.457ArgAsp: 3.457 ± 0.51
5.061ArgGlu: 5.061 ± 0.498
2.455ArgPhe: 2.455 ± 0.409
4.961ArgGly: 4.961 ± 0.516
1.102ArgHis: 1.102 ± 0.254
3.357ArgIle: 3.357 ± 0.428
2.906ArgLys: 2.906 ± 0.494
5.011ArgLeu: 5.011 ± 0.461
2.455ArgMet: 2.455 ± 0.378
1.854ArgAsn: 1.854 ± 0.311
3.407ArgPro: 3.407 ± 0.525
2.706ArgGln: 2.706 ± 0.423
5.662ArgArg: 5.662 ± 0.695
3.608ArgSer: 3.608 ± 0.382
4.159ArgThr: 4.159 ± 0.39
4.961ArgVal: 4.961 ± 0.457
1.654ArgTrp: 1.654 ± 0.334
1.654ArgTyr: 1.654 ± 0.301
0.0ArgXaa: 0.0 ± 0.0
Ser
5.812SerAla: 5.812 ± 0.815
0.802SerCys: 0.802 ± 0.236
3.457SerAsp: 3.457 ± 0.427
3.407SerGlu: 3.407 ± 0.412
1.854SerPhe: 1.854 ± 0.276
4.86SerGly: 4.86 ± 0.621
0.902SerHis: 0.902 ± 0.245
2.555SerIle: 2.555 ± 0.404
2.105SerLys: 2.105 ± 0.315
4.76SerLeu: 4.76 ± 0.476
1.303SerMet: 1.303 ± 0.33
1.453SerAsn: 1.453 ± 0.306
3.307SerPro: 3.307 ± 0.374
1.704SerGln: 1.704 ± 0.298
4.359SerArg: 4.359 ± 0.49
4.159SerSer: 4.159 ± 0.754
5.412SerThr: 5.412 ± 0.712
3.959SerVal: 3.959 ± 0.412
0.802SerTrp: 0.802 ± 0.244
1.253SerTyr: 1.253 ± 0.196
0.0SerXaa: 0.0 ± 0.0
Thr
6.915ThrAla: 6.915 ± 0.784
0.251ThrCys: 0.251 ± 0.142
4.009ThrAsp: 4.009 ± 0.534
5.161ThrGlu: 5.161 ± 0.561
2.155ThrPhe: 2.155 ± 0.42
6.314ThrGly: 6.314 ± 0.613
1.152ThrHis: 1.152 ± 0.256
5.011ThrIle: 5.011 ± 0.722
2.455ThrLys: 2.455 ± 0.385
5.712ThrLeu: 5.712 ± 0.588
1.553ThrMet: 1.553 ± 0.308
1.253ThrAsn: 1.253 ± 0.279
4.86ThrPro: 4.86 ± 0.871
1.804ThrGln: 1.804 ± 0.272
4.46ThrArg: 4.46 ± 0.531
4.51ThrSer: 4.51 ± 0.741
4.46ThrThr: 4.46 ± 0.674
5.512ThrVal: 5.512 ± 0.605
1.854ThrTrp: 1.854 ± 0.319
1.553ThrTyr: 1.553 ± 0.333
0.0ThrXaa: 0.0 ± 0.0
Val
8.168ValAla: 8.168 ± 0.769
0.651ValCys: 0.651 ± 0.162
5.211ValAsp: 5.211 ± 0.512
5.311ValGlu: 5.311 ± 0.64
2.054ValPhe: 2.054 ± 0.345
6.163ValGly: 6.163 ± 0.611
1.303ValHis: 1.303 ± 0.267
4.109ValIle: 4.109 ± 0.397
1.954ValLys: 1.954 ± 0.41
5.762ValLeu: 5.762 ± 0.652
1.453ValMet: 1.453 ± 0.233
1.754ValAsn: 1.754 ± 0.287
5.362ValPro: 5.362 ± 1.303
2.205ValGln: 2.205 ± 0.303
4.46ValArg: 4.46 ± 0.463
3.959ValSer: 3.959 ± 0.463
5.963ValThr: 5.963 ± 0.595
5.762ValVal: 5.762 ± 0.53
1.754ValTrp: 1.754 ± 0.323
1.203ValTyr: 1.203 ± 0.252
0.0ValXaa: 0.0 ± 0.0
Trp
2.155TrpAla: 2.155 ± 0.356
0.15TrpCys: 0.15 ± 0.084
1.203TrpAsp: 1.203 ± 0.213
1.152TrpGlu: 1.152 ± 0.279
0.702TrpPhe: 0.702 ± 0.155
1.102TrpGly: 1.102 ± 0.236
0.401TrpHis: 0.401 ± 0.155
1.102TrpIle: 1.102 ± 0.221
0.702TrpLys: 0.702 ± 0.18
1.503TrpLeu: 1.503 ± 0.275
0.401TrpMet: 0.401 ± 0.134
0.301TrpAsn: 0.301 ± 0.11
0.852TrpPro: 0.852 ± 0.215
0.551TrpGln: 0.551 ± 0.173
1.353TrpArg: 1.353 ± 0.288
1.203TrpSer: 1.203 ± 0.238
2.105TrpThr: 2.105 ± 0.364
1.453TrpVal: 1.453 ± 0.244
0.702TrpTrp: 0.702 ± 0.263
0.401TrpTyr: 0.401 ± 0.126
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.656TyrAla: 2.656 ± 0.398
0.2TyrCys: 0.2 ± 0.091
2.205TyrAsp: 2.205 ± 0.316
2.105TyrGlu: 2.105 ± 0.299
0.501TyrPhe: 0.501 ± 0.161
2.706TyrGly: 2.706 ± 0.343
0.351TyrHis: 0.351 ± 0.122
0.852TyrIle: 0.852 ± 0.179
0.601TyrLys: 0.601 ± 0.181
1.253TyrLeu: 1.253 ± 0.214
0.351TyrMet: 0.351 ± 0.127
0.451TyrAsn: 0.451 ± 0.136
1.152TyrPro: 1.152 ± 0.241
0.802TyrGln: 0.802 ± 0.18
1.954TyrArg: 1.954 ± 0.336
0.952TyrSer: 0.952 ± 0.236
1.904TyrThr: 1.904 ± 0.323
2.255TyrVal: 2.255 ± 0.739
0.702TyrTrp: 0.702 ± 0.235
0.451TyrTyr: 0.451 ± 0.179
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 105 proteins (19958 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski