Amino acid dipepetide frequency for Microbacterium phage Schubert

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.548AlaAla: 8.548 ± 0.938
0.814AlaCys: 0.814 ± 0.239
6.024AlaAsp: 6.024 ± 0.673
5.861AlaGlu: 5.861 ± 0.709
3.012AlaPhe: 3.012 ± 0.565
7.734AlaGly: 7.734 ± 0.952
1.465AlaHis: 1.465 ± 0.345
4.64AlaIle: 4.64 ± 0.882
6.187AlaLys: 6.187 ± 0.922
10.094AlaLeu: 10.094 ± 1.035
2.686AlaMet: 2.686 ± 0.431
3.093AlaAsn: 3.093 ± 0.493
3.826AlaPro: 3.826 ± 0.67
3.175AlaGln: 3.175 ± 0.421
6.594AlaArg: 6.594 ± 0.98
4.966AlaSer: 4.966 ± 0.609
6.594AlaThr: 6.594 ± 0.699
7.245AlaVal: 7.245 ± 0.616
1.221AlaTrp: 1.221 ± 0.25
2.686AlaTyr: 2.686 ± 0.429
0.0AlaXaa: 0.0 ± 0.0
Cys
0.326CysAla: 0.326 ± 0.142
0.0CysCys: 0.0 ± 0.0
0.326CysAsp: 0.326 ± 0.134
0.244CysGlu: 0.244 ± 0.167
0.081CysPhe: 0.081 ± 0.079
0.651CysGly: 0.651 ± 0.241
0.163CysHis: 0.163 ± 0.122
0.081CysIle: 0.081 ± 0.092
0.814CysLys: 0.814 ± 0.276
0.407CysLeu: 0.407 ± 0.181
0.163CysMet: 0.163 ± 0.134
0.244CysAsn: 0.244 ± 0.152
0.733CysPro: 0.733 ± 0.291
0.163CysGln: 0.163 ± 0.116
0.081CysArg: 0.081 ± 0.084
0.407CysSer: 0.407 ± 0.177
0.407CysThr: 0.407 ± 0.149
0.57CysVal: 0.57 ± 0.2
0.081CysTrp: 0.081 ± 0.074
0.163CysTyr: 0.163 ± 0.115
0.0CysXaa: 0.0 ± 0.0
Asp
5.129AspAla: 5.129 ± 0.58
0.407AspCys: 0.407 ± 0.159
5.291AspAsp: 5.291 ± 0.857
4.884AspGlu: 4.884 ± 0.948
2.524AspPhe: 2.524 ± 0.521
4.152AspGly: 4.152 ± 0.508
1.303AspHis: 1.303 ± 0.406
3.419AspIle: 3.419 ± 0.415
2.279AspLys: 2.279 ± 0.422
5.373AspLeu: 5.373 ± 0.704
1.221AspMet: 1.221 ± 0.269
1.71AspAsn: 1.71 ± 0.311
3.745AspPro: 3.745 ± 0.695
3.093AspGln: 3.093 ± 0.453
3.012AspArg: 3.012 ± 0.482
3.338AspSer: 3.338 ± 0.495
3.012AspThr: 3.012 ± 0.622
3.663AspVal: 3.663 ± 0.517
1.547AspTrp: 1.547 ± 0.365
2.849AspTyr: 2.849 ± 0.474
0.0AspXaa: 0.0 ± 0.0
Glu
7.408GluAla: 7.408 ± 0.843
0.244GluCys: 0.244 ± 0.17
5.291GluAsp: 5.291 ± 1.034
5.047GluGlu: 5.047 ± 0.897
2.035GluPhe: 2.035 ± 0.475
4.477GluGly: 4.477 ± 0.632
1.384GluHis: 1.384 ± 0.322
2.279GluIle: 2.279 ± 0.378
2.279GluLys: 2.279 ± 0.444
5.047GluLeu: 5.047 ± 0.724
1.71GluMet: 1.71 ± 0.378
2.361GluAsn: 2.361 ± 0.413
2.768GluPro: 2.768 ± 0.494
2.442GluGln: 2.442 ± 0.389
3.663GluArg: 3.663 ± 0.526
3.175GluSer: 3.175 ± 0.481
4.07GluThr: 4.07 ± 0.678
4.396GluVal: 4.396 ± 0.834
1.791GluTrp: 1.791 ± 0.349
2.198GluTyr: 2.198 ± 0.423
0.0GluXaa: 0.0 ± 0.0
Phe
2.117PheAla: 2.117 ± 0.392
0.163PheCys: 0.163 ± 0.113
1.71PheAsp: 1.71 ± 0.405
1.221PheGlu: 1.221 ± 0.263
0.651PhePhe: 0.651 ± 0.209
3.419PheGly: 3.419 ± 0.489
0.407PheHis: 0.407 ± 0.228
0.977PheIle: 0.977 ± 0.304
1.954PheLys: 1.954 ± 0.327
2.361PheLeu: 2.361 ± 0.428
0.733PheMet: 0.733 ± 0.248
0.895PheAsn: 0.895 ± 0.243
1.221PhePro: 1.221 ± 0.35
1.791PheGln: 1.791 ± 0.33
2.198PheArg: 2.198 ± 0.489
2.198PheSer: 2.198 ± 0.349
2.035PheThr: 2.035 ± 0.37
1.71PheVal: 1.71 ± 0.359
0.651PheTrp: 0.651 ± 0.265
0.407PheTyr: 0.407 ± 0.19
0.0PheXaa: 0.0 ± 0.0
Gly
7.652GlyAla: 7.652 ± 1.259
0.651GlyCys: 0.651 ± 0.204
3.908GlyAsp: 3.908 ± 0.536
3.419GlyGlu: 3.419 ± 0.464
3.256GlyPhe: 3.256 ± 0.456
6.106GlyGly: 6.106 ± 0.855
1.547GlyHis: 1.547 ± 0.324
5.536GlyIle: 5.536 ± 0.895
4.477GlyLys: 4.477 ± 0.6
6.594GlyLeu: 6.594 ± 0.902
2.605GlyMet: 2.605 ± 0.428
2.849GlyAsn: 2.849 ± 0.446
2.361GlyPro: 2.361 ± 0.463
3.745GlyGln: 3.745 ± 0.616
4.803GlyArg: 4.803 ± 0.703
3.989GlySer: 3.989 ± 0.49
7.082GlyThr: 7.082 ± 1.044
5.129GlyVal: 5.129 ± 0.753
1.547GlyTrp: 1.547 ± 0.294
2.442GlyTyr: 2.442 ± 0.492
0.0GlyXaa: 0.0 ± 0.0
His
1.384HisAla: 1.384 ± 0.347
0.163HisCys: 0.163 ± 0.114
0.895HisAsp: 0.895 ± 0.205
1.384HisGlu: 1.384 ± 0.436
0.488HisPhe: 0.488 ± 0.192
2.361HisGly: 2.361 ± 0.39
0.651HisHis: 0.651 ± 0.258
1.465HisIle: 1.465 ± 0.382
0.977HisLys: 0.977 ± 0.235
1.465HisLeu: 1.465 ± 0.408
0.488HisMet: 0.488 ± 0.202
0.488HisAsn: 0.488 ± 0.24
0.733HisPro: 0.733 ± 0.273
0.488HisGln: 0.488 ± 0.175
1.058HisArg: 1.058 ± 0.397
0.488HisSer: 0.488 ± 0.184
1.058HisThr: 1.058 ± 0.333
1.303HisVal: 1.303 ± 0.541
0.57HisTrp: 0.57 ± 0.214
0.895HisTyr: 0.895 ± 0.231
0.0HisXaa: 0.0 ± 0.0
Ile
4.722IleAla: 4.722 ± 0.707
0.081IleCys: 0.081 ± 0.076
3.826IleAsp: 3.826 ± 0.608
3.826IleGlu: 3.826 ± 0.637
0.733IlePhe: 0.733 ± 0.266
4.152IleGly: 4.152 ± 1.256
1.384IleHis: 1.384 ± 0.362
3.419IleIle: 3.419 ± 0.879
2.605IleLys: 2.605 ± 0.449
3.582IleLeu: 3.582 ± 0.459
1.303IleMet: 1.303 ± 0.282
2.117IleAsn: 2.117 ± 0.433
2.361IlePro: 2.361 ± 0.532
3.175IleGln: 3.175 ± 0.636
3.175IleArg: 3.175 ± 0.605
3.338IleSer: 3.338 ± 0.589
3.093IleThr: 3.093 ± 0.589
3.175IleVal: 3.175 ± 0.692
1.058IleTrp: 1.058 ± 0.289
1.384IleTyr: 1.384 ± 0.349
0.0IleXaa: 0.0 ± 0.0
Lys
6.106LysAla: 6.106 ± 0.903
0.163LysCys: 0.163 ± 0.114
2.686LysAsp: 2.686 ± 0.469
3.256LysGlu: 3.256 ± 0.655
0.488LysPhe: 0.488 ± 0.188
4.233LysGly: 4.233 ± 0.564
0.57LysHis: 0.57 ± 0.193
1.872LysIle: 1.872 ± 0.381
2.686LysLys: 2.686 ± 0.529
3.012LysLeu: 3.012 ± 0.468
1.384LysMet: 1.384 ± 0.246
1.547LysAsn: 1.547 ± 0.347
3.582LysPro: 3.582 ± 0.663
1.547LysGln: 1.547 ± 0.276
2.605LysArg: 2.605 ± 0.472
2.198LysSer: 2.198 ± 0.473
3.012LysThr: 3.012 ± 0.519
3.582LysVal: 3.582 ± 0.57
0.814LysTrp: 0.814 ± 0.264
1.058LysTyr: 1.058 ± 0.225
0.0LysXaa: 0.0 ± 0.0
Leu
9.036LeuAla: 9.036 ± 0.907
0.814LeuCys: 0.814 ± 0.29
5.21LeuAsp: 5.21 ± 0.591
5.943LeuGlu: 5.943 ± 0.666
2.117LeuPhe: 2.117 ± 0.344
5.943LeuGly: 5.943 ± 0.777
1.303LeuHis: 1.303 ± 0.309
4.152LeuIle: 4.152 ± 0.9
4.07LeuLys: 4.07 ± 0.819
7.978LeuLeu: 7.978 ± 0.905
1.628LeuMet: 1.628 ± 0.37
3.093LeuAsn: 3.093 ± 0.536
4.966LeuPro: 4.966 ± 0.91
3.093LeuGln: 3.093 ± 0.52
4.803LeuArg: 4.803 ± 0.666
4.64LeuSer: 4.64 ± 0.62
6.187LeuThr: 6.187 ± 0.975
6.431LeuVal: 6.431 ± 0.785
0.895LeuTrp: 0.895 ± 0.253
2.035LeuTyr: 2.035 ± 0.387
0.0LeuXaa: 0.0 ± 0.0
Met
3.745MetAla: 3.745 ± 0.353
0.081MetCys: 0.081 ± 0.078
1.628MetAsp: 1.628 ± 0.33
0.977MetGlu: 0.977 ± 0.285
0.977MetPhe: 0.977 ± 0.324
1.71MetGly: 1.71 ± 0.401
0.733MetHis: 0.733 ± 0.237
1.303MetIle: 1.303 ± 0.385
0.488MetLys: 0.488 ± 0.177
1.71MetLeu: 1.71 ± 0.346
0.326MetMet: 0.326 ± 0.187
0.814MetAsn: 0.814 ± 0.231
1.221MetPro: 1.221 ± 0.328
1.221MetGln: 1.221 ± 0.282
1.058MetArg: 1.058 ± 0.341
2.117MetSer: 2.117 ± 0.434
1.872MetThr: 1.872 ± 0.367
1.465MetVal: 1.465 ± 0.384
0.488MetTrp: 0.488 ± 0.165
0.407MetTyr: 0.407 ± 0.177
0.0MetXaa: 0.0 ± 0.0
Asn
2.686AsnAla: 2.686 ± 0.472
0.081AsnCys: 0.081 ± 0.075
1.791AsnAsp: 1.791 ± 0.334
1.791AsnGlu: 1.791 ± 0.421
0.977AsnPhe: 0.977 ± 0.322
3.093AsnGly: 3.093 ± 0.578
0.651AsnHis: 0.651 ± 0.189
2.035AsnIle: 2.035 ± 0.395
1.954AsnLys: 1.954 ± 0.399
3.419AsnLeu: 3.419 ± 0.632
0.488AsnMet: 0.488 ± 0.213
1.303AsnAsn: 1.303 ± 0.265
2.117AsnPro: 2.117 ± 0.404
1.547AsnGln: 1.547 ± 0.369
1.628AsnArg: 1.628 ± 0.469
1.954AsnSer: 1.954 ± 0.413
2.524AsnThr: 2.524 ± 0.406
2.686AsnVal: 2.686 ± 0.487
0.651AsnTrp: 0.651 ± 0.196
1.465AsnTyr: 1.465 ± 0.337
0.0AsnXaa: 0.0 ± 0.0
Pro
4.233ProAla: 4.233 ± 0.577
0.244ProCys: 0.244 ± 0.209
2.279ProAsp: 2.279 ± 0.492
2.768ProGlu: 2.768 ± 0.506
1.384ProPhe: 1.384 ± 0.283
3.989ProGly: 3.989 ± 0.426
0.895ProHis: 0.895 ± 0.226
1.547ProIle: 1.547 ± 0.327
2.768ProLys: 2.768 ± 0.513
4.07ProLeu: 4.07 ± 0.512
0.814ProMet: 0.814 ± 0.237
2.117ProAsn: 2.117 ± 0.426
1.384ProPro: 1.384 ± 0.364
2.361ProGln: 2.361 ± 0.695
2.605ProArg: 2.605 ± 0.632
3.5ProSer: 3.5 ± 0.392
3.5ProThr: 3.5 ± 0.48
4.559ProVal: 4.559 ± 0.46
0.733ProTrp: 0.733 ± 0.241
1.384ProTyr: 1.384 ± 0.295
0.0ProXaa: 0.0 ± 0.0
Gln
4.722GlnAla: 4.722 ± 0.635
0.163GlnCys: 0.163 ± 0.128
2.524GlnAsp: 2.524 ± 0.396
4.233GlnGlu: 4.233 ± 0.713
0.488GlnPhe: 0.488 ± 0.208
2.686GlnGly: 2.686 ± 0.359
0.814GlnHis: 0.814 ± 0.272
3.175GlnIle: 3.175 ± 0.663
1.465GlnLys: 1.465 ± 0.341
2.524GlnLeu: 2.524 ± 0.723
0.977GlnMet: 0.977 ± 0.241
1.791GlnAsn: 1.791 ± 0.355
1.384GlnPro: 1.384 ± 0.343
2.605GlnGln: 2.605 ± 0.527
2.361GlnArg: 2.361 ± 0.386
2.442GlnSer: 2.442 ± 0.48
2.686GlnThr: 2.686 ± 0.594
3.012GlnVal: 3.012 ± 0.446
0.57GlnTrp: 0.57 ± 0.178
1.14GlnTyr: 1.14 ± 0.278
0.0GlnXaa: 0.0 ± 0.0
Arg
6.187ArgAla: 6.187 ± 0.731
0.651ArgCys: 0.651 ± 0.233
3.663ArgAsp: 3.663 ± 0.583
4.233ArgGlu: 4.233 ± 0.544
1.71ArgPhe: 1.71 ± 0.39
3.989ArgGly: 3.989 ± 0.692
0.651ArgHis: 0.651 ± 0.23
3.582ArgIle: 3.582 ± 0.497
2.117ArgLys: 2.117 ± 0.473
5.78ArgLeu: 5.78 ± 0.827
2.035ArgMet: 2.035 ± 0.425
2.117ArgAsn: 2.117 ± 0.335
2.524ArgPro: 2.524 ± 0.579
1.547ArgGln: 1.547 ± 0.311
3.745ArgArg: 3.745 ± 0.73
3.582ArgSer: 3.582 ± 0.487
3.826ArgThr: 3.826 ± 0.694
4.07ArgVal: 4.07 ± 0.624
1.303ArgTrp: 1.303 ± 0.402
1.547ArgTyr: 1.547 ± 0.373
0.0ArgXaa: 0.0 ± 0.0
Ser
6.024SerAla: 6.024 ± 0.581
0.081SerCys: 0.081 ± 0.073
3.663SerAsp: 3.663 ± 0.527
3.012SerGlu: 3.012 ± 0.561
2.035SerPhe: 2.035 ± 0.422
5.617SerGly: 5.617 ± 0.881
0.977SerHis: 0.977 ± 0.303
3.338SerIle: 3.338 ± 0.521
2.442SerLys: 2.442 ± 0.409
3.989SerLeu: 3.989 ± 0.669
1.547SerMet: 1.547 ± 0.374
2.279SerAsn: 2.279 ± 0.351
2.605SerPro: 2.605 ± 0.474
2.768SerGln: 2.768 ± 0.431
4.152SerArg: 4.152 ± 0.653
3.989SerSer: 3.989 ± 0.573
4.722SerThr: 4.722 ± 0.517
3.663SerVal: 3.663 ± 0.446
1.303SerTrp: 1.303 ± 0.359
2.442SerTyr: 2.442 ± 0.456
0.0SerXaa: 0.0 ± 0.0
Thr
5.373ThrAla: 5.373 ± 0.727
0.163ThrCys: 0.163 ± 0.135
3.745ThrAsp: 3.745 ± 0.575
3.745ThrGlu: 3.745 ± 0.491
2.361ThrPhe: 2.361 ± 0.373
6.268ThrGly: 6.268 ± 0.643
1.058ThrHis: 1.058 ± 0.32
3.582ThrIle: 3.582 ± 0.651
2.605ThrLys: 2.605 ± 0.459
6.757ThrLeu: 6.757 ± 0.985
1.058ThrMet: 1.058 ± 0.311
2.035ThrAsn: 2.035 ± 0.471
3.582ThrPro: 3.582 ± 0.52
2.361ThrGln: 2.361 ± 0.492
3.663ThrArg: 3.663 ± 0.642
5.129ThrSer: 5.129 ± 0.67
4.966ThrThr: 4.966 ± 0.702
6.431ThrVal: 6.431 ± 0.882
1.872ThrTrp: 1.872 ± 0.502
2.279ThrTyr: 2.279 ± 0.493
0.0ThrXaa: 0.0 ± 0.0
Val
7.001ValAla: 7.001 ± 0.646
0.488ValCys: 0.488 ± 0.205
4.64ValAsp: 4.64 ± 0.712
4.477ValGlu: 4.477 ± 0.801
1.872ValPhe: 1.872 ± 0.368
5.21ValGly: 5.21 ± 0.74
1.465ValHis: 1.465 ± 0.342
3.908ValIle: 3.908 ± 0.7
2.035ValLys: 2.035 ± 0.43
5.617ValLeu: 5.617 ± 0.747
1.954ValMet: 1.954 ± 0.394
2.035ValAsn: 2.035 ± 0.471
3.256ValPro: 3.256 ± 0.553
3.419ValGln: 3.419 ± 0.623
4.396ValArg: 4.396 ± 0.495
4.966ValSer: 4.966 ± 0.516
5.698ValThr: 5.698 ± 0.586
5.78ValVal: 5.78 ± 1.02
1.628ValTrp: 1.628 ± 0.373
2.686ValTyr: 2.686 ± 0.489
0.0ValXaa: 0.0 ± 0.0
Trp
1.547TrpAla: 1.547 ± 0.35
0.163TrpCys: 0.163 ± 0.135
1.303TrpAsp: 1.303 ± 0.253
1.628TrpGlu: 1.628 ± 0.353
0.733TrpPhe: 0.733 ± 0.289
1.14TrpGly: 1.14 ± 0.36
0.57TrpHis: 0.57 ± 0.206
1.14TrpIle: 1.14 ± 0.322
0.977TrpLys: 0.977 ± 0.24
1.628TrpLeu: 1.628 ± 0.407
0.244TrpMet: 0.244 ± 0.136
0.57TrpAsn: 0.57 ± 0.178
1.14TrpPro: 1.14 ± 0.315
0.488TrpGln: 0.488 ± 0.227
1.058TrpArg: 1.058 ± 0.288
1.384TrpSer: 1.384 ± 0.411
1.221TrpThr: 1.221 ± 0.365
1.71TrpVal: 1.71 ± 0.347
0.57TrpTrp: 0.57 ± 0.301
0.977TrpTyr: 0.977 ± 0.275
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.524TyrAla: 2.524 ± 0.393
0.488TyrCys: 0.488 ± 0.183
1.547TyrAsp: 1.547 ± 0.332
2.198TyrGlu: 2.198 ± 0.432
0.895TyrPhe: 0.895 ± 0.23
2.768TyrGly: 2.768 ± 0.434
0.814TyrHis: 0.814 ± 0.251
1.058TyrIle: 1.058 ± 0.27
1.221TyrLys: 1.221 ± 0.387
2.931TyrLeu: 2.931 ± 0.552
0.895TyrMet: 0.895 ± 0.227
1.384TyrAsn: 1.384 ± 0.321
1.628TyrPro: 1.628 ± 0.339
0.733TyrGln: 0.733 ± 0.269
2.198TyrArg: 2.198 ± 0.389
2.849TyrSer: 2.849 ± 0.623
1.465TyrThr: 1.465 ± 0.325
1.954TyrVal: 1.954 ± 0.519
0.895TyrTrp: 0.895 ± 0.285
1.058TyrTyr: 1.058 ± 0.364
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 55 proteins (12285 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski