Amino acid dipepetide frequency for Microbacterium phage Appa

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
17.62AlaAla: 17.62 ± 1.356
0.397AlaCys: 0.397 ± 0.201
8.096AlaAsp: 8.096 ± 0.627
9.683AlaGlu: 9.683 ± 0.747
2.778AlaPhe: 2.778 ± 0.586
9.286AlaGly: 9.286 ± 1.081
1.826AlaHis: 1.826 ± 0.377
6.191AlaIle: 6.191 ± 0.784
4.445AlaLys: 4.445 ± 0.715
11.985AlaLeu: 11.985 ± 0.876
3.175AlaMet: 3.175 ± 0.461
3.572AlaAsn: 3.572 ± 0.479
6.032AlaPro: 6.032 ± 1.038
4.127AlaGln: 4.127 ± 0.447
8.89AlaArg: 8.89 ± 1.214
6.905AlaSer: 6.905 ± 0.598
7.143AlaThr: 7.143 ± 0.69
7.858AlaVal: 7.858 ± 0.83
2.461AlaTrp: 2.461 ± 0.419
1.984AlaTyr: 1.984 ± 0.366
0.0AlaXaa: 0.0 ± 0.0
Cys
0.556CysAla: 0.556 ± 0.243
0.0CysCys: 0.0 ± 0.0
0.476CysAsp: 0.476 ± 0.192
0.238CysGlu: 0.238 ± 0.131
0.0CysPhe: 0.0 ± 0.0
0.794CysGly: 0.794 ± 0.257
0.238CysHis: 0.238 ± 0.141
0.159CysIle: 0.159 ± 0.096
0.159CysLys: 0.159 ± 0.113
0.159CysLeu: 0.159 ± 0.098
0.079CysMet: 0.079 ± 0.085
0.0CysAsn: 0.0 ± 0.0
0.397CysPro: 0.397 ± 0.165
0.079CysGln: 0.079 ± 0.081
0.556CysArg: 0.556 ± 0.227
0.238CysSer: 0.238 ± 0.154
0.159CysThr: 0.159 ± 0.123
0.476CysVal: 0.476 ± 0.229
0.159CysTrp: 0.159 ± 0.104
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
9.763AspAla: 9.763 ± 0.755
0.0AspCys: 0.0 ± 0.0
5.556AspAsp: 5.556 ± 0.924
5.953AspGlu: 5.953 ± 0.617
1.984AspPhe: 1.984 ± 0.384
6.667AspGly: 6.667 ± 0.66
1.032AspHis: 1.032 ± 0.318
3.095AspIle: 3.095 ± 0.479
2.222AspLys: 2.222 ± 0.46
5.635AspLeu: 5.635 ± 0.739
1.349AspMet: 1.349 ± 0.337
1.191AspAsn: 1.191 ± 0.375
3.969AspPro: 3.969 ± 0.634
1.746AspGln: 1.746 ± 0.439
4.365AspArg: 4.365 ± 0.518
3.175AspSer: 3.175 ± 0.592
3.492AspThr: 3.492 ± 0.573
5.635AspVal: 5.635 ± 0.685
1.27AspTrp: 1.27 ± 0.269
1.111AspTyr: 1.111 ± 0.339
0.0AspXaa: 0.0 ± 0.0
Glu
7.858GluAla: 7.858 ± 0.82
0.317GluCys: 0.317 ± 0.166
3.175GluAsp: 3.175 ± 0.569
1.27GluGlu: 1.27 ± 0.348
2.857GluPhe: 2.857 ± 0.437
4.207GluGly: 4.207 ± 0.613
3.016GluHis: 3.016 ± 0.546
4.365GluIle: 4.365 ± 0.557
1.191GluLys: 1.191 ± 0.322
3.969GluLeu: 3.969 ± 0.48
2.143GluMet: 2.143 ± 0.383
1.984GluAsn: 1.984 ± 0.348
4.365GluPro: 4.365 ± 0.75
3.492GluGln: 3.492 ± 0.476
6.35GluArg: 6.35 ± 0.783
3.572GluSer: 3.572 ± 0.485
4.842GluThr: 4.842 ± 0.619
3.175GluVal: 3.175 ± 0.597
0.873GluTrp: 0.873 ± 0.249
2.064GluTyr: 2.064 ± 0.386
0.0GluXaa: 0.0 ± 0.0
Phe
3.572PheAla: 3.572 ± 0.593
0.079PheCys: 0.079 ± 0.073
2.778PheAsp: 2.778 ± 0.463
2.064PheGlu: 2.064 ± 0.429
0.714PhePhe: 0.714 ± 0.3
3.254PheGly: 3.254 ± 0.558
0.397PheHis: 0.397 ± 0.158
1.508PheIle: 1.508 ± 0.479
0.714PheLys: 0.714 ± 0.195
1.191PheLeu: 1.191 ± 0.362
0.635PheMet: 0.635 ± 0.22
0.556PheAsn: 0.556 ± 0.21
0.317PhePro: 0.317 ± 0.141
0.714PheGln: 0.714 ± 0.224
1.508PheArg: 1.508 ± 0.419
1.746PheSer: 1.746 ± 0.399
1.905PheThr: 1.905 ± 0.369
2.461PheVal: 2.461 ± 0.3
0.476PheTrp: 0.476 ± 0.207
0.476PheTyr: 0.476 ± 0.195
0.0PheXaa: 0.0 ± 0.0
Gly
9.286GlyAla: 9.286 ± 0.976
0.397GlyCys: 0.397 ± 0.185
5.953GlyAsp: 5.953 ± 0.724
5.477GlyGlu: 5.477 ± 0.92
2.54GlyPhe: 2.54 ± 0.639
6.826GlyGly: 6.826 ± 0.743
1.191GlyHis: 1.191 ± 0.279
4.286GlyIle: 4.286 ± 0.818
2.54GlyLys: 2.54 ± 0.369
6.032GlyLeu: 6.032 ± 0.952
2.064GlyMet: 2.064 ± 0.514
2.222GlyAsn: 2.222 ± 0.447
3.651GlyPro: 3.651 ± 0.561
2.381GlyGln: 2.381 ± 0.339
5.715GlyArg: 5.715 ± 0.653
5.397GlySer: 5.397 ± 0.91
6.747GlyThr: 6.747 ± 1.036
7.54GlyVal: 7.54 ± 0.632
1.667GlyTrp: 1.667 ± 0.361
2.302GlyTyr: 2.302 ± 0.4
0.0GlyXaa: 0.0 ± 0.0
His
2.064HisAla: 2.064 ± 0.426
0.079HisCys: 0.079 ± 0.072
1.508HisAsp: 1.508 ± 0.379
1.587HisGlu: 1.587 ± 0.441
0.556HisPhe: 0.556 ± 0.217
1.587HisGly: 1.587 ± 0.365
0.556HisHis: 0.556 ± 0.252
0.714HisIle: 0.714 ± 0.208
0.317HisLys: 0.317 ± 0.185
1.27HisLeu: 1.27 ± 0.365
0.397HisMet: 0.397 ± 0.176
0.159HisAsn: 0.159 ± 0.113
1.27HisPro: 1.27 ± 0.349
0.635HisGln: 0.635 ± 0.228
1.429HisArg: 1.429 ± 0.415
0.714HisSer: 0.714 ± 0.266
0.635HisThr: 0.635 ± 0.212
1.667HisVal: 1.667 ± 0.309
0.397HisTrp: 0.397 ± 0.212
0.317HisTyr: 0.317 ± 0.154
0.0HisXaa: 0.0 ± 0.0
Ile
6.429IleAla: 6.429 ± 0.793
0.238IleCys: 0.238 ± 0.14
4.683IleAsp: 4.683 ± 0.606
4.921IleGlu: 4.921 ± 0.57
1.429IlePhe: 1.429 ± 0.321
3.81IleGly: 3.81 ± 0.58
0.476IleHis: 0.476 ± 0.196
1.746IleIle: 1.746 ± 0.358
1.27IleLys: 1.27 ± 0.311
1.429IleLeu: 1.429 ± 0.274
1.111IleMet: 1.111 ± 0.333
0.635IleAsn: 0.635 ± 0.225
1.429IlePro: 1.429 ± 0.315
1.508IleGln: 1.508 ± 0.348
3.095IleArg: 3.095 ± 0.512
2.461IleSer: 2.461 ± 0.393
3.969IleThr: 3.969 ± 0.492
5.397IleVal: 5.397 ± 0.618
0.317IleTrp: 0.317 ± 0.149
1.111IleTyr: 1.111 ± 0.378
0.0IleXaa: 0.0 ± 0.0
Lys
4.048LysAla: 4.048 ± 0.58
0.079LysCys: 0.079 ± 0.076
0.714LysAsp: 0.714 ± 0.265
1.191LysGlu: 1.191 ± 0.367
0.873LysPhe: 0.873 ± 0.328
1.905LysGly: 1.905 ± 0.441
0.714LysHis: 0.714 ± 0.223
2.302LysIle: 2.302 ± 0.496
0.952LysLys: 0.952 ± 0.29
0.714LysLeu: 0.714 ± 0.327
1.191LysMet: 1.191 ± 0.331
0.317LysAsn: 0.317 ± 0.154
2.222LysPro: 2.222 ± 0.42
0.556LysGln: 0.556 ± 0.252
3.969LysArg: 3.969 ± 0.599
1.508LysSer: 1.508 ± 0.303
2.937LysThr: 2.937 ± 0.529
3.334LysVal: 3.334 ± 0.504
0.556LysTrp: 0.556 ± 0.217
0.635LysTyr: 0.635 ± 0.215
0.0LysXaa: 0.0 ± 0.0
Leu
11.191LeuAla: 11.191 ± 1.049
0.635LeuCys: 0.635 ± 0.249
6.985LeuAsp: 6.985 ± 0.813
2.461LeuGlu: 2.461 ± 0.537
1.984LeuPhe: 1.984 ± 0.439
6.191LeuGly: 6.191 ± 0.676
0.794LeuHis: 0.794 ± 0.237
2.857LeuIle: 2.857 ± 0.43
2.222LeuLys: 2.222 ± 0.414
6.905LeuLeu: 6.905 ± 0.748
1.429LeuMet: 1.429 ± 0.369
1.746LeuAsn: 1.746 ± 0.336
3.492LeuPro: 3.492 ± 0.466
1.27LeuGln: 1.27 ± 0.278
6.191LeuArg: 6.191 ± 0.658
5.0LeuSer: 5.0 ± 0.511
5.556LeuThr: 5.556 ± 0.599
6.35LeuVal: 6.35 ± 0.659
1.032LeuTrp: 1.032 ± 0.297
1.508LeuTyr: 1.508 ± 0.334
0.0LeuXaa: 0.0 ± 0.0
Met
1.746MetAla: 1.746 ± 0.361
0.0MetCys: 0.0 ± 0.0
0.635MetAsp: 0.635 ± 0.216
0.794MetGlu: 0.794 ± 0.19
0.635MetPhe: 0.635 ± 0.263
1.191MetGly: 1.191 ± 0.344
0.238MetHis: 0.238 ± 0.122
1.429MetIle: 1.429 ± 0.353
0.556MetLys: 0.556 ± 0.21
1.905MetLeu: 1.905 ± 0.351
0.556MetMet: 0.556 ± 0.238
0.714MetAsn: 0.714 ± 0.218
1.984MetPro: 1.984 ± 0.396
1.587MetGln: 1.587 ± 0.365
2.461MetArg: 2.461 ± 0.526
3.413MetSer: 3.413 ± 0.539
3.334MetThr: 3.334 ± 0.449
1.349MetVal: 1.349 ± 0.327
0.397MetTrp: 0.397 ± 0.171
0.397MetTyr: 0.397 ± 0.161
0.0MetXaa: 0.0 ± 0.0
Asn
3.81AsnAla: 3.81 ± 0.523
0.238AsnCys: 0.238 ± 0.14
1.984AsnAsp: 1.984 ± 0.419
1.111AsnGlu: 1.111 ± 0.237
0.476AsnPhe: 0.476 ± 0.212
4.048AsnGly: 4.048 ± 0.739
0.397AsnHis: 0.397 ± 0.204
0.952AsnIle: 0.952 ± 0.227
0.556AsnLys: 0.556 ± 0.192
1.508AsnLeu: 1.508 ± 0.365
0.159AsnMet: 0.159 ± 0.109
0.317AsnAsn: 0.317 ± 0.146
1.191AsnPro: 1.191 ± 0.264
0.476AsnGln: 0.476 ± 0.176
2.302AsnArg: 2.302 ± 0.334
0.873AsnSer: 0.873 ± 0.249
1.27AsnThr: 1.27 ± 0.35
1.826AsnVal: 1.826 ± 0.344
0.476AsnTrp: 0.476 ± 0.169
0.317AsnTyr: 0.317 ± 0.189
0.0AsnXaa: 0.0 ± 0.0
Pro
4.921ProAla: 4.921 ± 0.949
0.159ProCys: 0.159 ± 0.116
3.889ProAsp: 3.889 ± 0.585
3.81ProGlu: 3.81 ± 0.662
1.032ProPhe: 1.032 ± 0.25
6.032ProGly: 6.032 ± 1.143
0.556ProHis: 0.556 ± 0.239
2.222ProIle: 2.222 ± 0.394
1.905ProLys: 1.905 ± 0.465
3.254ProLeu: 3.254 ± 0.494
1.905ProMet: 1.905 ± 0.404
1.508ProAsn: 1.508 ± 0.335
2.937ProPro: 2.937 ± 0.643
1.27ProGln: 1.27 ± 0.361
2.699ProArg: 2.699 ± 0.467
3.016ProSer: 3.016 ± 0.379
4.127ProThr: 4.127 ± 0.565
4.604ProVal: 4.604 ± 0.517
0.873ProTrp: 0.873 ± 0.239
1.111ProTyr: 1.111 ± 0.278
0.0ProXaa: 0.0 ± 0.0
Gln
3.016GlnAla: 3.016 ± 0.487
0.0GlnCys: 0.0 ± 0.0
1.111GlnAsp: 1.111 ± 0.3
1.587GlnGlu: 1.587 ± 0.416
1.111GlnPhe: 1.111 ± 0.261
2.222GlnGly: 2.222 ± 0.463
1.032GlnHis: 1.032 ± 0.272
1.349GlnIle: 1.349 ± 0.33
0.476GlnLys: 0.476 ± 0.162
0.952GlnLeu: 0.952 ± 0.242
0.952GlnMet: 0.952 ± 0.276
1.032GlnAsn: 1.032 ± 0.302
2.381GlnPro: 2.381 ± 0.589
1.826GlnGln: 1.826 ± 0.47
3.572GlnArg: 3.572 ± 0.605
2.381GlnSer: 2.381 ± 0.433
2.143GlnThr: 2.143 ± 0.395
2.699GlnVal: 2.699 ± 0.52
0.794GlnTrp: 0.794 ± 0.262
1.27GlnTyr: 1.27 ± 0.283
0.0GlnXaa: 0.0 ± 0.0
Arg
9.525ArgAla: 9.525 ± 0.966
0.794ArgCys: 0.794 ± 0.346
5.159ArgAsp: 5.159 ± 0.667
6.032ArgGlu: 6.032 ± 0.777
1.429ArgPhe: 1.429 ± 0.44
6.032ArgGly: 6.032 ± 0.672
1.349ArgHis: 1.349 ± 0.377
2.857ArgIle: 2.857 ± 0.487
2.857ArgLys: 2.857 ± 0.426
7.302ArgLeu: 7.302 ± 0.719
2.143ArgMet: 2.143 ± 0.331
1.984ArgAsn: 1.984 ± 0.407
3.254ArgPro: 3.254 ± 0.517
2.778ArgGln: 2.778 ± 0.504
7.62ArgArg: 7.62 ± 0.89
3.651ArgSer: 3.651 ± 0.41
3.81ArgThr: 3.81 ± 0.524
5.0ArgVal: 5.0 ± 0.698
1.429ArgTrp: 1.429 ± 0.288
2.381ArgTyr: 2.381 ± 0.491
0.0ArgXaa: 0.0 ± 0.0
Ser
6.27SerAla: 6.27 ± 0.861
0.079SerCys: 0.079 ± 0.075
3.81SerAsp: 3.81 ± 0.557
2.619SerGlu: 2.619 ± 0.334
1.984SerPhe: 1.984 ± 0.468
5.953SerGly: 5.953 ± 0.864
0.556SerHis: 0.556 ± 0.221
2.937SerIle: 2.937 ± 0.428
2.143SerLys: 2.143 ± 0.455
4.921SerLeu: 4.921 ± 0.66
2.222SerMet: 2.222 ± 0.369
1.27SerAsn: 1.27 ± 0.294
2.937SerPro: 2.937 ± 0.461
1.746SerGln: 1.746 ± 0.395
3.73SerArg: 3.73 ± 0.554
2.778SerSer: 2.778 ± 0.394
4.365SerThr: 4.365 ± 0.569
4.207SerVal: 4.207 ± 0.377
1.429SerTrp: 1.429 ± 0.354
0.952SerTyr: 0.952 ± 0.255
0.0SerXaa: 0.0 ± 0.0
Thr
7.937ThrAla: 7.937 ± 0.916
0.556ThrCys: 0.556 ± 0.211
3.572ThrAsp: 3.572 ± 0.45
4.921ThrGlu: 4.921 ± 0.66
2.461ThrPhe: 2.461 ± 0.504
5.08ThrGly: 5.08 ± 0.571
1.111ThrHis: 1.111 ± 0.321
2.619ThrIle: 2.619 ± 0.377
2.857ThrLys: 2.857 ± 0.412
6.508ThrLeu: 6.508 ± 0.699
1.111ThrMet: 1.111 ± 0.258
1.587ThrAsn: 1.587 ± 0.287
4.604ThrPro: 4.604 ± 0.604
2.064ThrGln: 2.064 ± 0.405
4.524ThrArg: 4.524 ± 0.503
3.254ThrSer: 3.254 ± 0.513
4.286ThrThr: 4.286 ± 0.709
4.921ThrVal: 4.921 ± 0.494
1.746ThrTrp: 1.746 ± 0.443
1.587ThrTyr: 1.587 ± 0.392
0.0ThrXaa: 0.0 ± 0.0
Val
8.969ValAla: 8.969 ± 0.643
0.556ValCys: 0.556 ± 0.225
6.826ValAsp: 6.826 ± 0.772
5.715ValGlu: 5.715 ± 0.603
1.349ValPhe: 1.349 ± 0.332
6.27ValGly: 6.27 ± 0.623
1.349ValHis: 1.349 ± 0.291
3.889ValIle: 3.889 ± 0.52
2.54ValLys: 2.54 ± 0.398
6.985ValLeu: 6.985 ± 0.768
1.667ValMet: 1.667 ± 0.368
2.064ValAsn: 2.064 ± 0.321
4.048ValPro: 4.048 ± 0.488
2.461ValGln: 2.461 ± 0.333
5.239ValArg: 5.239 ± 0.796
4.207ValSer: 4.207 ± 0.566
5.159ValThr: 5.159 ± 0.827
4.842ValVal: 4.842 ± 0.585
1.27ValTrp: 1.27 ± 0.403
1.905ValTyr: 1.905 ± 0.42
0.0ValXaa: 0.0 ± 0.0
Trp
2.302TrpAla: 2.302 ± 0.456
0.159TrpCys: 0.159 ± 0.11
1.032TrpAsp: 1.032 ± 0.269
2.064TrpGlu: 2.064 ± 0.442
0.397TrpPhe: 0.397 ± 0.162
0.873TrpGly: 0.873 ± 0.23
0.635TrpHis: 0.635 ± 0.271
1.587TrpIle: 1.587 ± 0.47
0.476TrpLys: 0.476 ± 0.199
1.587TrpLeu: 1.587 ± 0.371
0.397TrpMet: 0.397 ± 0.205
0.873TrpAsn: 0.873 ± 0.337
0.635TrpPro: 0.635 ± 0.244
0.317TrpGln: 0.317 ± 0.152
1.429TrpArg: 1.429 ± 0.309
1.191TrpSer: 1.191 ± 0.313
0.714TrpThr: 0.714 ± 0.26
0.873TrpVal: 0.873 ± 0.225
0.397TrpTrp: 0.397 ± 0.194
0.476TrpTyr: 0.476 ± 0.227
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.572TyrAla: 3.572 ± 0.421
0.159TyrCys: 0.159 ± 0.115
1.27TyrAsp: 1.27 ± 0.357
1.587TyrGlu: 1.587 ± 0.397
0.476TyrPhe: 0.476 ± 0.197
1.984TyrGly: 1.984 ± 0.492
0.397TyrHis: 0.397 ± 0.171
0.397TyrIle: 0.397 ± 0.17
0.317TyrLys: 0.317 ± 0.153
1.587TyrLeu: 1.587 ± 0.35
0.556TyrMet: 0.556 ± 0.194
0.556TyrAsn: 0.556 ± 0.222
0.714TyrPro: 0.714 ± 0.217
0.873TyrGln: 0.873 ± 0.246
1.746TyrArg: 1.746 ± 0.463
1.429TyrSer: 1.429 ± 0.371
0.794TyrThr: 0.794 ± 0.241
3.095TyrVal: 3.095 ± 0.496
0.397TyrTrp: 0.397 ± 0.174
0.635TyrTyr: 0.635 ± 0.285
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 65 proteins (12600 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski