Amino acid dipepetide frequency for Microbacterium phage Pioneer3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.228AlaAla: 12.228 ± 0.984
0.476AlaCys: 0.476 ± 0.177
4.923AlaAsp: 4.923 ± 0.531
8.576AlaGlu: 8.576 ± 0.88
3.441AlaPhe: 3.441 ± 0.453
7.887AlaGly: 7.887 ± 0.954
1.588AlaHis: 1.588 ± 0.357
4.817AlaIle: 4.817 ± 0.488
4.658AlaLys: 4.658 ± 0.523
8.999AlaLeu: 8.999 ± 0.859
3.176AlaMet: 3.176 ± 0.403
4.605AlaAsn: 4.605 ± 0.579
3.864AlaPro: 3.864 ± 0.392
3.917AlaGln: 3.917 ± 0.448
7.517AlaArg: 7.517 ± 0.718
5.241AlaSer: 5.241 ± 0.504
5.876AlaThr: 5.876 ± 0.661
6.458AlaVal: 6.458 ± 0.678
1.694AlaTrp: 1.694 ± 0.297
2.7AlaTyr: 2.7 ± 0.331
0.0AlaXaa: 0.0 ± 0.0
Cys
0.371CysAla: 0.371 ± 0.138
0.053CysCys: 0.053 ± 0.063
0.371CysAsp: 0.371 ± 0.176
0.318CysGlu: 0.318 ± 0.156
0.212CysPhe: 0.212 ± 0.086
0.423CysGly: 0.423 ± 0.183
0.106CysHis: 0.106 ± 0.073
0.318CysIle: 0.318 ± 0.128
0.476CysLys: 0.476 ± 0.173
0.106CysLeu: 0.106 ± 0.072
0.0CysMet: 0.0 ± 0.0
0.053CysAsn: 0.053 ± 0.05
0.106CysPro: 0.106 ± 0.07
0.159CysGln: 0.159 ± 0.091
0.265CysArg: 0.265 ± 0.109
0.212CysSer: 0.212 ± 0.1
0.265CysThr: 0.265 ± 0.132
0.106CysVal: 0.106 ± 0.084
0.106CysTrp: 0.106 ± 0.075
0.159CysTyr: 0.159 ± 0.103
0.0CysXaa: 0.0 ± 0.0
Asp
6.458AspAla: 6.458 ± 0.704
0.371AspCys: 0.371 ± 0.148
4.394AspAsp: 4.394 ± 0.545
5.823AspGlu: 5.823 ± 0.598
2.012AspPhe: 2.012 ± 0.305
5.558AspGly: 5.558 ± 0.534
0.847AspHis: 0.847 ± 0.263
2.7AspIle: 2.7 ± 0.372
2.7AspLys: 2.7 ± 0.562
5.188AspLeu: 5.188 ± 0.459
1.588AspMet: 1.588 ± 0.373
1.747AspAsn: 1.747 ± 0.29
3.494AspPro: 3.494 ± 0.448
1.429AspGln: 1.429 ± 0.267
3.441AspArg: 3.441 ± 0.572
4.182AspSer: 4.182 ± 0.454
3.123AspThr: 3.123 ± 0.398
4.711AspVal: 4.711 ± 0.532
1.323AspTrp: 1.323 ± 0.272
2.276AspTyr: 2.276 ± 0.334
0.0AspXaa: 0.0 ± 0.0
Glu
7.252GluAla: 7.252 ± 0.766
0.582GluCys: 0.582 ± 0.21
4.235GluAsp: 4.235 ± 0.525
5.029GluGlu: 5.029 ± 0.615
2.753GluPhe: 2.753 ± 0.36
5.135GluGly: 5.135 ± 0.605
1.694GluHis: 1.694 ± 0.31
4.658GluIle: 4.658 ± 0.57
4.023GluLys: 4.023 ± 0.498
4.605GluLeu: 4.605 ± 0.54
1.8GluMet: 1.8 ± 0.362
2.594GluAsn: 2.594 ± 0.37
2.7GluPro: 2.7 ± 0.4
3.176GluGln: 3.176 ± 0.366
5.876GluArg: 5.876 ± 0.698
3.705GluSer: 3.705 ± 0.477
4.023GluThr: 4.023 ± 0.448
6.14GluVal: 6.14 ± 0.718
1.323GluTrp: 1.323 ± 0.244
2.435GluTyr: 2.435 ± 0.413
0.0GluXaa: 0.0 ± 0.0
Phe
3.388PheAla: 3.388 ± 0.359
0.053PheCys: 0.053 ± 0.051
2.594PheAsp: 2.594 ± 0.377
2.7PheGlu: 2.7 ± 0.442
0.794PhePhe: 0.794 ± 0.195
2.7PheGly: 2.7 ± 0.379
0.529PheHis: 0.529 ± 0.18
1.641PheIle: 1.641 ± 0.229
1.218PheLys: 1.218 ± 0.227
1.8PheLeu: 1.8 ± 0.31
1.218PheMet: 1.218 ± 0.237
1.376PheAsn: 1.376 ± 0.289
1.323PhePro: 1.323 ± 0.221
1.006PheGln: 1.006 ± 0.26
2.064PheArg: 2.064 ± 0.31
2.012PheSer: 2.012 ± 0.427
2.541PheThr: 2.541 ± 0.383
2.647PheVal: 2.647 ± 0.421
0.741PheTrp: 0.741 ± 0.187
0.953PheTyr: 0.953 ± 0.254
0.0PheXaa: 0.0 ± 0.0
Gly
6.776GlyAla: 6.776 ± 0.684
0.423GlyCys: 0.423 ± 0.147
6.035GlyAsp: 6.035 ± 1.096
6.299GlyGlu: 6.299 ± 0.9
3.811GlyPhe: 3.811 ± 0.505
7.993GlyGly: 7.993 ± 0.913
1.323GlyHis: 1.323 ± 0.245
4.076GlyIle: 4.076 ± 0.494
2.488GlyLys: 2.488 ± 0.327
6.511GlyLeu: 6.511 ± 0.707
2.17GlyMet: 2.17 ± 0.339
2.7GlyAsn: 2.7 ± 0.432
3.07GlyPro: 3.07 ± 0.809
3.864GlyGln: 3.864 ± 0.494
6.458GlyArg: 6.458 ± 0.613
4.817GlySer: 4.817 ± 0.444
5.876GlyThr: 5.876 ± 0.688
6.776GlyVal: 6.776 ± 0.608
1.588GlyTrp: 1.588 ± 0.348
2.647GlyTyr: 2.647 ± 0.311
0.0GlyXaa: 0.0 ± 0.0
His
1.588HisAla: 1.588 ± 0.288
0.053HisCys: 0.053 ± 0.058
1.27HisAsp: 1.27 ± 0.296
1.059HisGlu: 1.059 ± 0.233
0.688HisPhe: 0.688 ± 0.188
1.006HisGly: 1.006 ± 0.255
0.476HisHis: 0.476 ± 0.13
1.006HisIle: 1.006 ± 0.244
0.741HisLys: 0.741 ± 0.205
1.059HisLeu: 1.059 ± 0.219
0.423HisMet: 0.423 ± 0.148
0.582HisAsn: 0.582 ± 0.168
1.112HisPro: 1.112 ± 0.318
0.635HisGln: 0.635 ± 0.189
1.006HisArg: 1.006 ± 0.205
0.953HisSer: 0.953 ± 0.234
1.535HisThr: 1.535 ± 0.299
1.27HisVal: 1.27 ± 0.263
0.476HisTrp: 0.476 ± 0.122
0.582HisTyr: 0.582 ± 0.166
0.0HisXaa: 0.0 ± 0.0
Ile
6.088IleAla: 6.088 ± 0.646
0.159IleCys: 0.159 ± 0.103
3.811IleAsp: 3.811 ± 0.412
5.294IleGlu: 5.294 ± 0.479
1.165IlePhe: 1.165 ± 0.215
3.229IleGly: 3.229 ± 0.397
0.741IleHis: 0.741 ± 0.198
2.276IleIle: 2.276 ± 0.327
2.7IleLys: 2.7 ± 0.41
2.964IleLeu: 2.964 ± 0.432
0.9IleMet: 0.9 ± 0.205
2.117IleAsn: 2.117 ± 0.357
2.435IlePro: 2.435 ± 0.401
2.223IleGln: 2.223 ± 0.321
3.6IleArg: 3.6 ± 0.491
2.911IleSer: 2.911 ± 0.337
3.229IleThr: 3.229 ± 0.392
3.229IleVal: 3.229 ± 0.399
0.847IleTrp: 0.847 ± 0.18
1.535IleTyr: 1.535 ± 0.304
0.0IleXaa: 0.0 ± 0.0
Lys
5.505LysAla: 5.505 ± 0.704
0.0LysCys: 0.0 ± 0.0
2.117LysAsp: 2.117 ± 0.415
2.329LysGlu: 2.329 ± 0.418
1.641LysPhe: 1.641 ± 0.356
3.547LysGly: 3.547 ± 0.755
0.847LysHis: 0.847 ± 0.199
2.382LysIle: 2.382 ± 0.404
3.335LysLys: 3.335 ± 0.594
3.388LysLeu: 3.388 ± 0.522
0.953LysMet: 0.953 ± 0.239
0.953LysAsn: 0.953 ± 0.256
1.747LysPro: 1.747 ± 0.384
1.429LysGln: 1.429 ± 0.273
2.911LysArg: 2.911 ± 0.54
2.012LysSer: 2.012 ± 0.415
2.753LysThr: 2.753 ± 0.451
2.964LysVal: 2.964 ± 0.417
0.847LysTrp: 0.847 ± 0.212
1.535LysTyr: 1.535 ± 0.335
0.0LysXaa: 0.0 ± 0.0
Leu
7.411LeuAla: 7.411 ± 0.618
0.159LeuCys: 0.159 ± 0.084
4.658LeuAsp: 4.658 ± 0.474
4.605LeuGlu: 4.605 ± 0.478
1.853LeuPhe: 1.853 ± 0.282
6.193LeuGly: 6.193 ± 0.799
1.323LeuHis: 1.323 ± 0.259
3.547LeuIle: 3.547 ± 0.523
2.541LeuLys: 2.541 ± 0.365
4.023LeuLeu: 4.023 ± 0.554
1.906LeuMet: 1.906 ± 0.355
2.964LeuAsn: 2.964 ± 0.517
3.97LeuPro: 3.97 ± 0.388
2.17LeuGln: 2.17 ± 0.432
5.611LeuArg: 5.611 ± 0.515
4.764LeuSer: 4.764 ± 0.428
4.499LeuThr: 4.499 ± 0.513
4.288LeuVal: 4.288 ± 0.427
1.482LeuTrp: 1.482 ± 0.256
1.959LeuTyr: 1.959 ± 0.303
0.0LeuXaa: 0.0 ± 0.0
Met
2.435MetAla: 2.435 ± 0.359
0.159MetCys: 0.159 ± 0.101
1.323MetAsp: 1.323 ± 0.396
1.323MetGlu: 1.323 ± 0.31
0.9MetPhe: 0.9 ± 0.191
1.8MetGly: 1.8 ± 0.33
0.529MetHis: 0.529 ± 0.155
0.847MetIle: 0.847 ± 0.319
0.847MetLys: 0.847 ± 0.226
1.059MetLeu: 1.059 ± 0.226
0.529MetMet: 0.529 ± 0.15
1.059MetAsn: 1.059 ± 0.237
1.27MetPro: 1.27 ± 0.319
0.688MetGln: 0.688 ± 0.191
2.012MetArg: 2.012 ± 0.34
2.276MetSer: 2.276 ± 0.341
2.223MetThr: 2.223 ± 0.367
1.429MetVal: 1.429 ± 0.332
0.529MetTrp: 0.529 ± 0.172
0.476MetTyr: 0.476 ± 0.151
0.0MetXaa: 0.0 ± 0.0
Asn
3.653AsnAla: 3.653 ± 0.364
0.0AsnCys: 0.0 ± 0.0
1.376AsnAsp: 1.376 ± 0.206
2.329AsnGlu: 2.329 ± 0.376
1.006AsnPhe: 1.006 ± 0.179
4.87AsnGly: 4.87 ± 0.564
0.794AsnHis: 0.794 ± 0.246
1.588AsnIle: 1.588 ± 0.275
0.741AsnLys: 0.741 ± 0.247
2.382AsnLeu: 2.382 ± 0.317
1.006AsnMet: 1.006 ± 0.279
1.906AsnAsn: 1.906 ± 0.369
2.488AsnPro: 2.488 ± 0.309
1.218AsnGln: 1.218 ± 0.308
2.806AsnArg: 2.806 ± 0.466
2.435AsnSer: 2.435 ± 0.384
2.012AsnThr: 2.012 ± 0.335
3.176AsnVal: 3.176 ± 0.38
0.688AsnTrp: 0.688 ± 0.166
1.429AsnTyr: 1.429 ± 0.351
0.0AsnXaa: 0.0 ± 0.0
Pro
3.917ProAla: 3.917 ± 0.566
0.106ProCys: 0.106 ± 0.1
3.441ProAsp: 3.441 ± 0.55
3.758ProGlu: 3.758 ± 0.523
1.429ProPhe: 1.429 ± 0.306
4.023ProGly: 4.023 ± 0.667
1.059ProHis: 1.059 ± 0.259
2.594ProIle: 2.594 ± 0.435
2.647ProLys: 2.647 ± 0.444
2.911ProLeu: 2.911 ± 0.417
0.794ProMet: 0.794 ± 0.23
1.853ProAsn: 1.853 ± 0.292
1.694ProPro: 1.694 ± 0.332
1.376ProGln: 1.376 ± 0.299
1.588ProArg: 1.588 ± 0.333
2.7ProSer: 2.7 ± 0.464
3.017ProThr: 3.017 ± 0.537
3.811ProVal: 3.811 ± 0.444
0.953ProTrp: 0.953 ± 0.29
1.165ProTyr: 1.165 ± 0.24
0.0ProXaa: 0.0 ± 0.0
Gln
3.441GlnAla: 3.441 ± 0.474
0.053GlnCys: 0.053 ± 0.052
2.117GlnAsp: 2.117 ± 0.288
2.382GlnGlu: 2.382 ± 0.326
1.112GlnPhe: 1.112 ± 0.261
3.388GlnGly: 3.388 ± 0.655
0.476GlnHis: 0.476 ± 0.19
2.064GlnIle: 2.064 ± 0.347
1.535GlnLys: 1.535 ± 0.301
2.276GlnLeu: 2.276 ± 0.367
0.635GlnMet: 0.635 ± 0.202
1.482GlnAsn: 1.482 ± 0.242
1.165GlnPro: 1.165 ± 0.255
1.059GlnGln: 1.059 ± 0.284
2.382GlnArg: 2.382 ± 0.353
1.959GlnSer: 1.959 ± 0.375
1.535GlnThr: 1.535 ± 0.305
3.494GlnVal: 3.494 ± 0.466
0.423GlnTrp: 0.423 ± 0.171
0.741GlnTyr: 0.741 ± 0.183
0.0GlnXaa: 0.0 ± 0.0
Arg
7.093ArgAla: 7.093 ± 0.736
0.476ArgCys: 0.476 ± 0.164
4.447ArgAsp: 4.447 ± 0.514
4.923ArgGlu: 4.923 ± 0.501
2.647ArgPhe: 2.647 ± 0.318
5.77ArgGly: 5.77 ± 0.742
1.27ArgHis: 1.27 ± 0.304
4.341ArgIle: 4.341 ± 0.539
2.859ArgLys: 2.859 ± 0.51
5.717ArgLeu: 5.717 ± 0.509
1.588ArgMet: 1.588 ± 0.321
3.017ArgAsn: 3.017 ± 0.363
2.859ArgPro: 2.859 ± 0.43
1.906ArgGln: 1.906 ± 0.313
6.352ArgArg: 6.352 ± 0.683
2.806ArgSer: 2.806 ± 0.376
3.335ArgThr: 3.335 ± 0.319
5.082ArgVal: 5.082 ± 0.537
1.535ArgTrp: 1.535 ± 0.275
2.012ArgTyr: 2.012 ± 0.302
0.0ArgXaa: 0.0 ± 0.0
Ser
5.188SerAla: 5.188 ± 0.521
0.106SerCys: 0.106 ± 0.067
3.864SerAsp: 3.864 ± 0.446
3.335SerGlu: 3.335 ± 0.463
2.012SerPhe: 2.012 ± 0.277
5.823SerGly: 5.823 ± 0.602
1.112SerHis: 1.112 ± 0.257
3.176SerIle: 3.176 ± 0.398
3.017SerLys: 3.017 ± 0.358
4.341SerLeu: 4.341 ± 0.529
1.429SerMet: 1.429 ± 0.32
2.064SerAsn: 2.064 ± 0.354
2.541SerPro: 2.541 ± 0.435
1.323SerGln: 1.323 ± 0.32
3.864SerArg: 3.864 ± 0.476
3.282SerSer: 3.282 ± 0.493
4.235SerThr: 4.235 ± 0.585
4.711SerVal: 4.711 ± 0.509
1.218SerTrp: 1.218 ± 0.29
1.535SerTyr: 1.535 ± 0.316
0.0SerXaa: 0.0 ± 0.0
Thr
6.617ThrAla: 6.617 ± 0.758
0.371ThrCys: 0.371 ± 0.155
3.758ThrAsp: 3.758 ± 0.482
4.129ThrGlu: 4.129 ± 0.417
2.964ThrPhe: 2.964 ± 0.395
6.67ThrGly: 6.67 ± 0.874
1.165ThrHis: 1.165 ± 0.294
2.911ThrIle: 2.911 ± 0.414
2.064ThrLys: 2.064 ± 0.344
4.711ThrLeu: 4.711 ± 0.601
1.006ThrMet: 1.006 ± 0.22
2.223ThrAsn: 2.223 ± 0.335
3.176ThrPro: 3.176 ± 0.444
2.064ThrGln: 2.064 ± 0.346
3.017ThrArg: 3.017 ± 0.371
3.494ThrSer: 3.494 ± 0.507
3.758ThrThr: 3.758 ± 0.437
4.923ThrVal: 4.923 ± 0.732
1.059ThrTrp: 1.059 ± 0.238
2.276ThrTyr: 2.276 ± 0.411
0.0ThrXaa: 0.0 ± 0.0
Val
7.676ValAla: 7.676 ± 0.71
0.371ValCys: 0.371 ± 0.164
5.294ValAsp: 5.294 ± 0.499
6.035ValGlu: 6.035 ± 0.677
1.588ValPhe: 1.588 ± 0.316
6.035ValGly: 6.035 ± 0.571
1.059ValHis: 1.059 ± 0.239
4.341ValIle: 4.341 ± 0.468
3.547ValLys: 3.547 ± 0.551
4.87ValLeu: 4.87 ± 0.533
1.482ValMet: 1.482 ± 0.243
2.594ValAsn: 2.594 ± 0.363
3.494ValPro: 3.494 ± 0.566
2.276ValGln: 2.276 ± 0.346
5.346ValArg: 5.346 ± 0.461
5.135ValSer: 5.135 ± 0.596
5.346ValThr: 5.346 ± 0.632
6.088ValVal: 6.088 ± 0.564
1.165ValTrp: 1.165 ± 0.259
1.8ValTyr: 1.8 ± 0.284
0.0ValXaa: 0.0 ± 0.0
Trp
2.012TrpAla: 2.012 ± 0.343
0.0TrpCys: 0.0 ± 0.0
1.323TrpAsp: 1.323 ± 0.272
1.165TrpGlu: 1.165 ± 0.223
0.635TrpPhe: 0.635 ± 0.198
1.959TrpGly: 1.959 ± 0.35
0.265TrpHis: 0.265 ± 0.122
0.953TrpIle: 0.953 ± 0.207
0.318TrpLys: 0.318 ± 0.128
0.9TrpLeu: 0.9 ± 0.18
0.476TrpMet: 0.476 ± 0.138
0.794TrpAsn: 0.794 ± 0.222
0.741TrpPro: 0.741 ± 0.179
0.582TrpGln: 0.582 ± 0.187
1.853TrpArg: 1.853 ± 0.294
1.218TrpSer: 1.218 ± 0.278
1.112TrpThr: 1.112 ± 0.256
1.8TrpVal: 1.8 ± 0.306
0.476TrpTrp: 0.476 ± 0.132
0.476TrpTyr: 0.476 ± 0.155
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.6TyrAla: 3.6 ± 0.418
0.212TyrCys: 0.212 ± 0.111
2.012TyrAsp: 2.012 ± 0.347
2.329TyrGlu: 2.329 ± 0.349
0.635TyrPhe: 0.635 ± 0.204
1.694TyrGly: 1.694 ± 0.308
0.318TyrHis: 0.318 ± 0.128
1.218TyrIle: 1.218 ± 0.235
0.741TyrLys: 0.741 ± 0.18
2.117TyrLeu: 2.117 ± 0.342
0.582TyrMet: 0.582 ± 0.208
1.218TyrAsn: 1.218 ± 0.253
1.588TyrPro: 1.588 ± 0.275
1.218TyrGln: 1.218 ± 0.258
2.064TyrArg: 2.064 ± 0.335
2.064TyrSer: 2.064 ± 0.36
2.064TyrThr: 2.064 ± 0.345
2.435TyrVal: 2.435 ± 0.422
0.582TyrTrp: 0.582 ± 0.197
0.953TyrTyr: 0.953 ± 0.225
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 108 proteins (18892 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski