Amino acid dipepetide frequency for Arthrobacter phage Vibaki

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
26.366AlaAla: 26.366 ± 2.435
1.121AlaCys: 1.121 ± 0.328
10.217AlaAsp: 10.217 ± 1.051
10.151AlaGlu: 10.151 ± 0.987
4.219AlaPhe: 4.219 ± 0.661
10.81AlaGly: 10.81 ± 0.915
2.373AlaHis: 2.373 ± 0.479
4.944AlaIle: 4.944 ± 0.791
5.339AlaLys: 5.339 ± 0.822
11.997AlaLeu: 11.997 ± 1.047
3.625AlaMet: 3.625 ± 0.561
3.098AlaAsn: 3.098 ± 0.492
9.096AlaPro: 9.096 ± 1.095
3.691AlaGln: 3.691 ± 0.625
8.108AlaArg: 8.108 ± 0.93
5.405AlaSer: 5.405 ± 0.79
9.953AlaThr: 9.953 ± 1.195
8.767AlaVal: 8.767 ± 0.875
1.582AlaTrp: 1.582 ± 0.39
2.966AlaTyr: 2.966 ± 0.492
0.0AlaXaa: 0.0 ± 0.0
Cys
0.791CysAla: 0.791 ± 0.217
0.198CysCys: 0.198 ± 0.106
0.461CysAsp: 0.461 ± 0.207
0.132CysGlu: 0.132 ± 0.097
0.198CysPhe: 0.198 ± 0.104
0.593CysGly: 0.593 ± 0.23
0.198CysHis: 0.198 ± 0.117
0.198CysIle: 0.198 ± 0.112
0.33CysLys: 0.33 ± 0.188
0.461CysLeu: 0.461 ± 0.174
0.132CysMet: 0.132 ± 0.094
0.198CysAsn: 0.198 ± 0.123
0.857CysPro: 0.857 ± 0.292
0.33CysGln: 0.33 ± 0.171
0.264CysArg: 0.264 ± 0.154
0.461CysSer: 0.461 ± 0.166
0.264CysThr: 0.264 ± 0.127
0.461CysVal: 0.461 ± 0.18
0.132CysTrp: 0.132 ± 0.081
0.198CysTyr: 0.198 ± 0.1
0.0CysXaa: 0.0 ± 0.0
Asp
8.503AspAla: 8.503 ± 0.996
0.791AspCys: 0.791 ± 0.224
3.955AspAsp: 3.955 ± 0.479
3.494AspGlu: 3.494 ± 0.501
1.318AspPhe: 1.318 ± 0.374
6.657AspGly: 6.657 ± 0.647
1.252AspHis: 1.252 ± 0.271
0.923AspIle: 0.923 ± 0.211
2.307AspLys: 2.307 ± 0.405
7.383AspLeu: 7.383 ± 0.636
1.582AspMet: 1.582 ± 0.292
1.648AspAsn: 1.648 ± 0.305
3.955AspPro: 3.955 ± 0.52
2.439AspGln: 2.439 ± 0.344
3.032AspArg: 3.032 ± 0.536
2.307AspSer: 2.307 ± 0.417
3.889AspThr: 3.889 ± 0.555
4.482AspVal: 4.482 ± 0.563
1.582AspTrp: 1.582 ± 0.389
1.912AspTyr: 1.912 ± 0.35
0.0AspXaa: 0.0 ± 0.0
Glu
7.251GluAla: 7.251 ± 0.913
0.264GluCys: 0.264 ± 0.137
2.307GluAsp: 2.307 ± 0.385
2.241GluGlu: 2.241 ± 0.447
1.582GluPhe: 1.582 ± 0.37
3.823GluGly: 3.823 ± 0.491
0.989GluHis: 0.989 ± 0.235
3.428GluIle: 3.428 ± 0.48
1.055GluLys: 1.055 ± 0.244
5.998GluLeu: 5.998 ± 0.763
0.857GluMet: 0.857 ± 0.228
1.186GluAsn: 1.186 ± 0.256
3.164GluPro: 3.164 ± 0.584
3.625GluGln: 3.625 ± 0.571
3.296GluArg: 3.296 ± 0.534
2.571GluSer: 2.571 ± 0.468
4.153GluThr: 4.153 ± 0.556
3.23GluVal: 3.23 ± 0.408
1.121GluTrp: 1.121 ± 0.286
1.648GluTyr: 1.648 ± 0.31
0.0GluXaa: 0.0 ± 0.0
Phe
2.834PheAla: 2.834 ± 0.409
0.0PheCys: 0.0 ± 0.0
2.241PheAsp: 2.241 ± 0.439
2.439PheGlu: 2.439 ± 0.375
1.252PhePhe: 1.252 ± 0.343
2.703PheGly: 2.703 ± 0.421
0.264PheHis: 0.264 ± 0.135
1.055PheIle: 1.055 ± 0.255
0.857PheLys: 0.857 ± 0.223
1.121PheLeu: 1.121 ± 0.3
0.989PheMet: 0.989 ± 0.353
0.923PheAsn: 0.923 ± 0.275
1.45PhePro: 1.45 ± 0.333
0.593PheGln: 0.593 ± 0.229
1.384PheArg: 1.384 ± 0.287
1.45PheSer: 1.45 ± 0.28
2.439PheThr: 2.439 ± 0.328
2.307PheVal: 2.307 ± 0.389
0.395PheTrp: 0.395 ± 0.209
0.791PheTyr: 0.791 ± 0.245
0.0PheXaa: 0.0 ± 0.0
Gly
11.074GlyAla: 11.074 ± 1.148
0.198GlyCys: 0.198 ± 0.107
4.35GlyAsp: 4.35 ± 0.635
3.625GlyGlu: 3.625 ± 0.583
2.505GlyPhe: 2.505 ± 0.508
7.383GlyGly: 7.383 ± 1.006
2.175GlyHis: 2.175 ± 0.354
3.559GlyIle: 3.559 ± 0.495
4.284GlyLys: 4.284 ± 0.434
6.328GlyLeu: 6.328 ± 0.873
1.714GlyMet: 1.714 ± 0.331
3.494GlyAsn: 3.494 ± 0.704
4.284GlyPro: 4.284 ± 0.625
2.439GlyGln: 2.439 ± 0.354
5.998GlyArg: 5.998 ± 0.518
3.428GlySer: 3.428 ± 0.608
5.339GlyThr: 5.339 ± 0.572
5.998GlyVal: 5.998 ± 0.737
1.78GlyTrp: 1.78 ± 0.323
3.296GlyTyr: 3.296 ± 0.504
0.0GlyXaa: 0.0 ± 0.0
His
1.977HisAla: 1.977 ± 0.419
0.198HisCys: 0.198 ± 0.143
1.45HisAsp: 1.45 ± 0.254
1.121HisGlu: 1.121 ± 0.29
0.725HisPhe: 0.725 ± 0.236
1.318HisGly: 1.318 ± 0.301
0.857HisHis: 0.857 ± 0.266
0.725HisIle: 0.725 ± 0.224
0.395HisLys: 0.395 ± 0.163
1.516HisLeu: 1.516 ± 0.299
0.132HisMet: 0.132 ± 0.08
0.923HisAsn: 0.923 ± 0.243
1.516HisPro: 1.516 ± 0.367
0.659HisGln: 0.659 ± 0.218
1.846HisArg: 1.846 ± 0.497
0.791HisSer: 0.791 ± 0.248
0.923HisThr: 0.923 ± 0.304
1.45HisVal: 1.45 ± 0.277
0.264HisTrp: 0.264 ± 0.117
0.33HisTyr: 0.33 ± 0.162
0.0HisXaa: 0.0 ± 0.0
Ile
6.262IleAla: 6.262 ± 0.608
0.264IleCys: 0.264 ± 0.114
2.768IleAsp: 2.768 ± 0.446
3.559IleGlu: 3.559 ± 0.551
0.725IlePhe: 0.725 ± 0.212
3.23IleGly: 3.23 ± 0.628
0.923IleHis: 0.923 ± 0.263
1.912IleIle: 1.912 ± 0.303
1.121IleLys: 1.121 ± 0.227
1.912IleLeu: 1.912 ± 0.267
1.186IleMet: 1.186 ± 0.255
1.384IleAsn: 1.384 ± 0.308
2.703IlePro: 2.703 ± 0.387
0.725IleGln: 0.725 ± 0.245
2.9IleArg: 2.9 ± 0.373
2.373IleSer: 2.373 ± 0.308
3.428IleThr: 3.428 ± 0.503
3.757IleVal: 3.757 ± 0.642
0.132IleTrp: 0.132 ± 0.088
1.186IleTyr: 1.186 ± 0.275
0.0IleXaa: 0.0 ± 0.0
Lys
6.394LysAla: 6.394 ± 0.848
0.264LysCys: 0.264 ± 0.116
1.384LysAsp: 1.384 ± 0.266
1.318LysGlu: 1.318 ± 0.317
0.923LysPhe: 0.923 ± 0.23
3.164LysGly: 3.164 ± 0.447
0.593LysHis: 0.593 ± 0.191
1.45LysIle: 1.45 ± 0.267
1.78LysLys: 1.78 ± 0.411
3.691LysLeu: 3.691 ± 0.436
1.318LysMet: 1.318 ± 0.379
0.989LysAsn: 0.989 ± 0.264
3.032LysPro: 3.032 ± 0.58
1.582LysGln: 1.582 ± 0.373
2.768LysArg: 2.768 ± 0.408
1.186LysSer: 1.186 ± 0.289
3.494LysThr: 3.494 ± 0.6
2.703LysVal: 2.703 ± 0.501
0.527LysTrp: 0.527 ± 0.164
0.791LysTyr: 0.791 ± 0.266
0.0LysXaa: 0.0 ± 0.0
Leu
10.678LeuAla: 10.678 ± 0.949
0.132LeuCys: 0.132 ± 0.089
5.998LeuAsp: 5.998 ± 0.72
3.296LeuGlu: 3.296 ± 0.533
2.109LeuPhe: 2.109 ± 0.398
6.328LeuGly: 6.328 ± 0.697
1.648LeuHis: 1.648 ± 0.346
3.691LeuIle: 3.691 ± 0.53
2.966LeuLys: 2.966 ± 0.49
5.998LeuLeu: 5.998 ± 0.991
1.186LeuMet: 1.186 ± 0.375
2.768LeuAsn: 2.768 ± 0.435
4.746LeuPro: 4.746 ± 0.69
3.23LeuGln: 3.23 ± 0.533
5.207LeuArg: 5.207 ± 0.757
4.35LeuSer: 4.35 ± 0.599
7.185LeuThr: 7.185 ± 0.775
5.537LeuVal: 5.537 ± 0.568
0.857LeuTrp: 0.857 ± 0.246
1.318LeuTyr: 1.318 ± 0.314
0.0LeuXaa: 0.0 ± 0.0
Met
1.977MetAla: 1.977 ± 0.345
0.132MetCys: 0.132 ± 0.08
0.791MetAsp: 0.791 ± 0.215
0.791MetGlu: 0.791 ± 0.216
0.264MetPhe: 0.264 ± 0.13
0.923MetGly: 0.923 ± 0.245
0.395MetHis: 0.395 ± 0.193
1.055MetIle: 1.055 ± 0.287
1.055MetLys: 1.055 ± 0.261
1.384MetLeu: 1.384 ± 0.299
0.264MetMet: 0.264 ± 0.117
0.461MetAsn: 0.461 ± 0.152
2.043MetPro: 2.043 ± 0.36
0.395MetGln: 0.395 ± 0.203
1.384MetArg: 1.384 ± 0.33
1.78MetSer: 1.78 ± 0.322
3.494MetThr: 3.494 ± 0.452
1.252MetVal: 1.252 ± 0.29
0.198MetTrp: 0.198 ± 0.119
0.33MetTyr: 0.33 ± 0.166
0.0MetXaa: 0.0 ± 0.0
Asn
4.416AsnAla: 4.416 ± 0.55
0.264AsnCys: 0.264 ± 0.129
1.846AsnAsp: 1.846 ± 0.327
0.593AsnGlu: 0.593 ± 0.181
0.923AsnPhe: 0.923 ± 0.295
3.559AsnGly: 3.559 ± 0.553
0.198AsnHis: 0.198 ± 0.135
1.055AsnIle: 1.055 ± 0.299
1.121AsnLys: 1.121 ± 0.274
2.505AsnLeu: 2.505 ± 0.405
0.395AsnMet: 0.395 ± 0.161
0.923AsnAsn: 0.923 ± 0.286
2.834AsnPro: 2.834 ± 0.407
0.593AsnGln: 0.593 ± 0.191
1.714AsnArg: 1.714 ± 0.325
1.121AsnSer: 1.121 ± 0.233
1.977AsnThr: 1.977 ± 0.416
1.516AsnVal: 1.516 ± 0.333
0.857AsnTrp: 0.857 ± 0.231
0.659AsnTyr: 0.659 ± 0.212
0.0AsnXaa: 0.0 ± 0.0
Pro
11.733ProAla: 11.733 ± 0.947
0.527ProCys: 0.527 ± 0.208
3.559ProAsp: 3.559 ± 0.519
4.878ProGlu: 4.878 ± 0.669
1.648ProPhe: 1.648 ± 0.351
5.932ProGly: 5.932 ± 0.757
1.055ProHis: 1.055 ± 0.222
2.505ProIle: 2.505 ± 0.39
3.362ProLys: 3.362 ± 0.682
3.757ProLeu: 3.757 ± 0.477
0.791ProMet: 0.791 ± 0.228
1.714ProAsn: 1.714 ± 0.39
3.032ProPro: 3.032 ± 0.689
1.318ProGln: 1.318 ± 0.308
4.021ProArg: 4.021 ± 0.531
2.834ProSer: 2.834 ± 0.506
4.416ProThr: 4.416 ± 0.602
5.075ProVal: 5.075 ± 0.637
0.593ProTrp: 0.593 ± 0.2
0.659ProTyr: 0.659 ± 0.241
0.0ProXaa: 0.0 ± 0.0
Gln
5.669GlnAla: 5.669 ± 0.777
0.198GlnCys: 0.198 ± 0.127
1.186GlnAsp: 1.186 ± 0.215
1.121GlnGlu: 1.121 ± 0.282
1.45GlnPhe: 1.45 ± 0.322
2.043GlnGly: 2.043 ± 0.351
0.791GlnHis: 0.791 ± 0.272
2.505GlnIle: 2.505 ± 0.361
0.791GlnLys: 0.791 ± 0.233
2.571GlnLeu: 2.571 ± 0.352
0.659GlnMet: 0.659 ± 0.219
0.923GlnAsn: 0.923 ± 0.226
2.373GlnPro: 2.373 ± 0.413
1.582GlnGln: 1.582 ± 0.389
2.439GlnArg: 2.439 ± 0.473
1.648GlnSer: 1.648 ± 0.294
1.45GlnThr: 1.45 ± 0.276
2.373GlnVal: 2.373 ± 0.433
0.527GlnTrp: 0.527 ± 0.184
0.857GlnTyr: 0.857 ± 0.255
0.0GlnXaa: 0.0 ± 0.0
Arg
8.437ArgAla: 8.437 ± 0.702
0.725ArgCys: 0.725 ± 0.231
4.812ArgAsp: 4.812 ± 0.601
2.571ArgGlu: 2.571 ± 0.405
1.714ArgPhe: 1.714 ± 0.309
4.021ArgGly: 4.021 ± 0.548
1.516ArgHis: 1.516 ± 0.391
1.912ArgIle: 1.912 ± 0.336
2.966ArgLys: 2.966 ± 0.641
5.471ArgLeu: 5.471 ± 0.556
1.45ArgMet: 1.45 ± 0.348
1.318ArgAsn: 1.318 ± 0.356
3.889ArgPro: 3.889 ± 0.699
3.032ArgGln: 3.032 ± 0.484
4.944ArgArg: 4.944 ± 0.636
3.362ArgSer: 3.362 ± 0.438
4.35ArgThr: 4.35 ± 0.663
3.23ArgVal: 3.23 ± 0.585
0.923ArgTrp: 0.923 ± 0.234
2.307ArgTyr: 2.307 ± 0.355
0.0ArgXaa: 0.0 ± 0.0
Ser
6.657SerAla: 6.657 ± 0.931
0.198SerCys: 0.198 ± 0.115
2.109SerAsp: 2.109 ± 0.293
1.912SerGlu: 1.912 ± 0.337
1.318SerPhe: 1.318 ± 0.258
5.471SerGly: 5.471 ± 0.71
0.725SerHis: 0.725 ± 0.164
1.912SerIle: 1.912 ± 0.388
1.45SerLys: 1.45 ± 0.357
3.691SerLeu: 3.691 ± 0.576
0.857SerMet: 0.857 ± 0.253
1.648SerAsn: 1.648 ± 0.273
2.241SerPro: 2.241 ± 0.349
1.186SerGln: 1.186 ± 0.263
3.032SerArg: 3.032 ± 0.458
2.373SerSer: 2.373 ± 0.364
3.889SerThr: 3.889 ± 0.588
3.625SerVal: 3.625 ± 0.6
0.989SerTrp: 0.989 ± 0.268
1.384SerTyr: 1.384 ± 0.412
0.0SerXaa: 0.0 ± 0.0
Thr
10.217ThrAla: 10.217 ± 1.149
0.593ThrCys: 0.593 ± 0.226
4.087ThrAsp: 4.087 ± 0.438
4.219ThrGlu: 4.219 ± 0.597
1.846ThrPhe: 1.846 ± 0.288
6.328ThrGly: 6.328 ± 0.628
1.252ThrHis: 1.252 ± 0.256
4.284ThrIle: 4.284 ± 0.505
2.703ThrLys: 2.703 ± 0.5
5.01ThrLeu: 5.01 ± 0.54
1.186ThrMet: 1.186 ± 0.279
1.846ThrAsn: 1.846 ± 0.307
6.46ThrPro: 6.46 ± 0.663
1.78ThrGln: 1.78 ± 0.38
4.35ThrArg: 4.35 ± 0.543
3.757ThrSer: 3.757 ± 0.494
6.526ThrThr: 6.526 ± 1.028
6.064ThrVal: 6.064 ± 0.698
1.121ThrTrp: 1.121 ± 0.322
1.318ThrTyr: 1.318 ± 0.224
0.0ThrXaa: 0.0 ± 0.0
Val
8.239ValAla: 8.239 ± 0.779
0.527ValCys: 0.527 ± 0.171
6.789ValAsp: 6.789 ± 0.643
4.35ValGlu: 4.35 ± 0.527
1.582ValPhe: 1.582 ± 0.267
4.944ValGly: 4.944 ± 0.899
0.857ValHis: 0.857 ± 0.266
3.625ValIle: 3.625 ± 0.469
3.757ValLys: 3.757 ± 0.595
5.01ValLeu: 5.01 ± 0.564
1.121ValMet: 1.121 ± 0.241
1.78ValAsn: 1.78 ± 0.335
3.691ValPro: 3.691 ± 0.417
2.439ValGln: 2.439 ± 0.39
3.559ValArg: 3.559 ± 0.512
3.098ValSer: 3.098 ± 0.485
5.537ValThr: 5.537 ± 0.796
5.075ValVal: 5.075 ± 0.547
1.648ValTrp: 1.648 ± 0.334
1.846ValTyr: 1.846 ± 0.384
0.0ValXaa: 0.0 ± 0.0
Trp
1.714TrpAla: 1.714 ± 0.354
0.066TrpCys: 0.066 ± 0.07
1.318TrpAsp: 1.318 ± 0.37
0.593TrpGlu: 0.593 ± 0.17
0.527TrpPhe: 0.527 ± 0.21
1.252TrpGly: 1.252 ± 0.313
0.395TrpHis: 0.395 ± 0.207
1.252TrpIle: 1.252 ± 0.379
0.659TrpLys: 0.659 ± 0.183
1.318TrpLeu: 1.318 ± 0.335
0.132TrpMet: 0.132 ± 0.093
0.989TrpAsn: 0.989 ± 0.465
0.527TrpPro: 0.527 ± 0.177
0.527TrpGln: 0.527 ± 0.187
0.989TrpArg: 0.989 ± 0.241
0.857TrpSer: 0.857 ± 0.223
0.989TrpThr: 0.989 ± 0.27
0.923TrpVal: 0.923 ± 0.275
0.527TrpTrp: 0.527 ± 0.176
0.593TrpTyr: 0.593 ± 0.189
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.703TyrAla: 2.703 ± 0.49
0.198TyrCys: 0.198 ± 0.109
1.78TyrAsp: 1.78 ± 0.343
1.384TyrGlu: 1.384 ± 0.286
0.659TyrPhe: 0.659 ± 0.205
2.768TyrGly: 2.768 ± 0.485
0.659TyrHis: 0.659 ± 0.211
0.461TyrIle: 0.461 ± 0.177
1.055TyrLys: 1.055 ± 0.263
2.109TyrLeu: 2.109 ± 0.368
0.725TyrMet: 0.725 ± 0.188
0.857TyrAsn: 0.857 ± 0.231
1.516TyrPro: 1.516 ± 0.381
0.989TyrGln: 0.989 ± 0.308
1.714TyrArg: 1.714 ± 0.371
1.516TyrSer: 1.516 ± 0.311
1.252TyrThr: 1.252 ± 0.234
1.648TyrVal: 1.648 ± 0.455
0.395TyrTrp: 0.395 ± 0.154
0.659TyrTyr: 0.659 ± 0.214
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 78 proteins (15172 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski