Amino acid dipepetide frequency for Gordonia phage Lahirium

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
14.088AlaAla: 14.088 ± 3.998
0.613AlaCys: 0.613 ± 0.177
5.032AlaAsp: 5.032 ± 0.417
7.088AlaGlu: 7.088 ± 0.687
2.888AlaPhe: 2.888 ± 0.374
7.088AlaGly: 7.088 ± 0.691
1.138AlaHis: 1.138 ± 0.25
5.382AlaIle: 5.382 ± 0.756
4.244AlaLys: 4.244 ± 0.427
8.182AlaLeu: 8.182 ± 0.941
3.019AlaMet: 3.019 ± 0.417
3.019AlaAsn: 3.019 ± 0.409
4.638AlaPro: 4.638 ± 0.452
3.894AlaGln: 3.894 ± 0.659
5.775AlaArg: 5.775 ± 0.562
5.994AlaSer: 5.994 ± 0.705
5.819AlaThr: 5.819 ± 0.526
6.738AlaVal: 6.738 ± 0.748
1.925AlaTrp: 1.925 ± 0.545
2.669AlaTyr: 2.669 ± 0.267
0.0AlaXaa: 0.0 ± 0.0
Cys
0.788CysAla: 0.788 ± 0.184
0.131CysCys: 0.131 ± 0.071
0.525CysAsp: 0.525 ± 0.162
0.394CysGlu: 0.394 ± 0.138
0.263CysPhe: 0.263 ± 0.123
1.006CysGly: 1.006 ± 0.286
0.306CysHis: 0.306 ± 0.121
0.613CysIle: 0.613 ± 0.188
0.394CysLys: 0.394 ± 0.154
0.263CysLeu: 0.263 ± 0.114
0.175CysMet: 0.175 ± 0.104
0.394CysAsn: 0.394 ± 0.13
0.656CysPro: 0.656 ± 0.174
0.263CysGln: 0.263 ± 0.108
0.744CysArg: 0.744 ± 0.233
0.7CysSer: 0.7 ± 0.194
0.7CysThr: 0.7 ± 0.201
0.656CysVal: 0.656 ± 0.178
0.088CysTrp: 0.088 ± 0.06
0.306CysTyr: 0.306 ± 0.108
0.0CysXaa: 0.0 ± 0.0
Asp
5.294AspAla: 5.294 ± 0.537
0.656AspCys: 0.656 ± 0.206
4.288AspAsp: 4.288 ± 0.666
4.944AspGlu: 4.944 ± 0.698
2.188AspPhe: 2.188 ± 0.285
4.55AspGly: 4.55 ± 0.741
1.006AspHis: 1.006 ± 0.261
3.544AspIle: 3.544 ± 0.393
2.669AspLys: 2.669 ± 0.374
4.375AspLeu: 4.375 ± 0.504
1.4AspMet: 1.4 ± 0.238
2.844AspAsn: 2.844 ± 0.369
3.981AspPro: 3.981 ± 0.586
2.275AspGln: 2.275 ± 0.334
2.669AspArg: 2.669 ± 0.477
3.325AspSer: 3.325 ± 0.332
3.413AspThr: 3.413 ± 0.403
4.244AspVal: 4.244 ± 0.605
1.138AspTrp: 1.138 ± 0.254
2.144AspTyr: 2.144 ± 0.371
0.0AspXaa: 0.0 ± 0.0
Glu
6.519GluAla: 6.519 ± 0.564
0.919GluCys: 0.919 ± 0.257
3.894GluAsp: 3.894 ± 0.494
5.863GluGlu: 5.863 ± 0.659
2.581GluPhe: 2.581 ± 0.414
4.813GluGly: 4.813 ± 0.47
0.919GluHis: 0.919 ± 0.234
3.806GluIle: 3.806 ± 0.334
2.931GluLys: 2.931 ± 0.398
5.907GluLeu: 5.907 ± 0.87
1.531GluMet: 1.531 ± 0.308
1.794GluAsn: 1.794 ± 0.231
2.538GluPro: 2.538 ± 0.477
3.281GluGln: 3.281 ± 0.455
4.944GluArg: 4.944 ± 0.625
4.156GluSer: 4.156 ± 0.378
3.85GluThr: 3.85 ± 0.396
5.207GluVal: 5.207 ± 0.367
1.356GluTrp: 1.356 ± 0.298
2.494GluTyr: 2.494 ± 0.42
0.0GluXaa: 0.0 ± 0.0
Phe
3.369PheAla: 3.369 ± 0.355
0.394PheCys: 0.394 ± 0.142
2.756PheAsp: 2.756 ± 0.398
2.713PheGlu: 2.713 ± 0.414
0.963PhePhe: 0.963 ± 0.157
3.369PheGly: 3.369 ± 0.446
0.788PheHis: 0.788 ± 0.199
1.838PheIle: 1.838 ± 0.318
2.188PheLys: 2.188 ± 0.292
1.925PheLeu: 1.925 ± 0.278
0.919PheMet: 0.919 ± 0.189
1.706PheAsn: 1.706 ± 0.213
1.531PhePro: 1.531 ± 0.304
1.138PheGln: 1.138 ± 0.184
1.619PheArg: 1.619 ± 0.258
2.406PheSer: 2.406 ± 0.337
2.144PheThr: 2.144 ± 0.313
2.013PheVal: 2.013 ± 0.335
0.394PheTrp: 0.394 ± 0.107
0.963PheTyr: 0.963 ± 0.198
0.0PheXaa: 0.0 ± 0.0
Gly
6.782GlyAla: 6.782 ± 0.803
0.831GlyCys: 0.831 ± 0.223
4.681GlyAsp: 4.681 ± 0.433
4.506GlyGlu: 4.506 ± 0.43
2.713GlyPhe: 2.713 ± 0.289
8.182GlyGly: 8.182 ± 1.112
1.225GlyHis: 1.225 ± 0.254
4.244GlyIle: 4.244 ± 0.526
4.813GlyLys: 4.813 ± 0.518
4.988GlyLeu: 4.988 ± 0.619
1.838GlyMet: 1.838 ± 0.28
3.5GlyAsn: 3.5 ± 0.445
3.588GlyPro: 3.588 ± 0.375
2.144GlyGln: 2.144 ± 0.293
4.025GlyArg: 4.025 ± 0.539
5.513GlySer: 5.513 ± 0.632
5.6GlyThr: 5.6 ± 0.588
5.557GlyVal: 5.557 ± 0.425
1.575GlyTrp: 1.575 ± 0.322
2.494GlyTyr: 2.494 ± 0.391
0.0GlyXaa: 0.0 ± 0.0
His
1.225HisAla: 1.225 ± 0.281
0.219HisCys: 0.219 ± 0.099
0.788HisAsp: 0.788 ± 0.243
0.919HisGlu: 0.919 ± 0.255
0.394HisPhe: 0.394 ± 0.134
1.138HisGly: 1.138 ± 0.292
0.35HisHis: 0.35 ± 0.144
1.006HisIle: 1.006 ± 0.176
0.963HisLys: 0.963 ± 0.225
1.575HisLeu: 1.575 ± 0.282
0.438HisMet: 0.438 ± 0.131
0.525HisAsn: 0.525 ± 0.123
1.138HisPro: 1.138 ± 0.221
0.788HisGln: 0.788 ± 0.206
1.225HisArg: 1.225 ± 0.264
1.181HisSer: 1.181 ± 0.229
0.831HisThr: 0.831 ± 0.208
1.05HisVal: 1.05 ± 0.211
0.263HisTrp: 0.263 ± 0.101
0.569HisTyr: 0.569 ± 0.146
0.0HisXaa: 0.0 ± 0.0
Ile
5.294IleAla: 5.294 ± 0.713
0.306IleCys: 0.306 ± 0.097
4.069IleAsp: 4.069 ± 0.452
3.938IleGlu: 3.938 ± 0.458
1.838IlePhe: 1.838 ± 0.267
4.813IleGly: 4.813 ± 0.676
0.656IleHis: 0.656 ± 0.171
3.238IleIle: 3.238 ± 0.373
3.15IleLys: 3.15 ± 0.449
3.238IleLeu: 3.238 ± 0.352
0.919IleMet: 0.919 ± 0.197
2.144IleAsn: 2.144 ± 0.268
2.625IlePro: 2.625 ± 0.333
2.713IleGln: 2.713 ± 0.327
3.238IleArg: 3.238 ± 0.431
2.888IleSer: 2.888 ± 0.303
2.8IleThr: 2.8 ± 0.318
3.588IleVal: 3.588 ± 0.409
0.919IleTrp: 0.919 ± 0.19
1.05IleTyr: 1.05 ± 0.266
0.0IleXaa: 0.0 ± 0.0
Lys
5.119LysAla: 5.119 ± 0.689
0.306LysCys: 0.306 ± 0.131
3.194LysAsp: 3.194 ± 0.547
2.844LysGlu: 2.844 ± 0.427
2.013LysPhe: 2.013 ± 0.257
3.281LysGly: 3.281 ± 0.441
0.788LysHis: 0.788 ± 0.208
2.713LysIle: 2.713 ± 0.34
3.15LysLys: 3.15 ± 0.367
4.594LysLeu: 4.594 ± 0.446
1.269LysMet: 1.269 ± 0.329
2.494LysAsn: 2.494 ± 0.298
2.406LysPro: 2.406 ± 0.383
1.925LysGln: 1.925 ± 0.325
3.369LysArg: 3.369 ± 0.48
2.625LysSer: 2.625 ± 0.44
2.844LysThr: 2.844 ± 0.329
3.806LysVal: 3.806 ± 0.377
0.613LysTrp: 0.613 ± 0.163
1.575LysTyr: 1.575 ± 0.3
0.0LysXaa: 0.0 ± 0.0
Leu
7.088LeuAla: 7.088 ± 0.606
0.744LeuCys: 0.744 ± 0.212
4.594LeuAsp: 4.594 ± 0.423
6.082LeuGlu: 6.082 ± 0.67
2.756LeuPhe: 2.756 ± 0.315
5.075LeuGly: 5.075 ± 0.427
1.356LeuHis: 1.356 ± 0.271
3.456LeuIle: 3.456 ± 0.414
3.675LeuLys: 3.675 ± 0.429
5.513LeuLeu: 5.513 ± 0.416
2.056LeuMet: 2.056 ± 0.292
3.019LeuAsn: 3.019 ± 0.386
3.981LeuPro: 3.981 ± 0.388
3.15LeuGln: 3.15 ± 0.424
4.375LeuArg: 4.375 ± 0.453
3.981LeuSer: 3.981 ± 0.459
4.856LeuThr: 4.856 ± 0.468
4.638LeuVal: 4.638 ± 0.411
1.269LeuTrp: 1.269 ± 0.216
1.794LeuTyr: 1.794 ± 0.352
0.0LeuXaa: 0.0 ± 0.0
Met
2.406MetAla: 2.406 ± 0.343
0.219MetCys: 0.219 ± 0.103
1.181MetAsp: 1.181 ± 0.209
1.181MetGlu: 1.181 ± 0.202
0.613MetPhe: 0.613 ± 0.195
1.488MetGly: 1.488 ± 0.261
0.394MetHis: 0.394 ± 0.138
0.831MetIle: 0.831 ± 0.168
1.794MetLys: 1.794 ± 0.267
1.794MetLeu: 1.794 ± 0.274
0.656MetMet: 0.656 ± 0.195
1.313MetAsn: 1.313 ± 0.24
1.356MetPro: 1.356 ± 0.287
1.05MetGln: 1.05 ± 0.222
1.225MetArg: 1.225 ± 0.21
1.881MetSer: 1.881 ± 0.26
2.013MetThr: 2.013 ± 0.312
1.619MetVal: 1.619 ± 0.396
0.219MetTrp: 0.219 ± 0.078
0.963MetTyr: 0.963 ± 0.233
0.0MetXaa: 0.0 ± 0.0
Asn
3.981AsnAla: 3.981 ± 0.644
0.394AsnCys: 0.394 ± 0.16
2.406AsnAsp: 2.406 ± 0.399
1.925AsnGlu: 1.925 ± 0.291
1.225AsnPhe: 1.225 ± 0.251
3.981AsnGly: 3.981 ± 0.396
0.7AsnHis: 0.7 ± 0.209
2.406AsnIle: 2.406 ± 0.329
2.056AsnLys: 2.056 ± 0.32
3.063AsnLeu: 3.063 ± 0.372
1.138AsnMet: 1.138 ± 0.224
1.881AsnAsn: 1.881 ± 0.321
2.669AsnPro: 2.669 ± 0.294
1.794AsnGln: 1.794 ± 0.245
1.925AsnArg: 1.925 ± 0.346
2.494AsnSer: 2.494 ± 0.397
2.144AsnThr: 2.144 ± 0.32
2.713AsnVal: 2.713 ± 0.364
0.481AsnTrp: 0.481 ± 0.119
1.356AsnTyr: 1.356 ± 0.207
0.0AsnXaa: 0.0 ± 0.0
Pro
4.856ProAla: 4.856 ± 0.404
0.219ProCys: 0.219 ± 0.109
3.763ProAsp: 3.763 ± 0.397
4.331ProGlu: 4.331 ± 0.695
1.881ProPhe: 1.881 ± 0.317
4.638ProGly: 4.638 ± 0.475
0.831ProHis: 0.831 ± 0.252
2.275ProIle: 2.275 ± 0.321
2.756ProLys: 2.756 ± 0.426
2.975ProLeu: 2.975 ± 0.402
1.181ProMet: 1.181 ± 0.196
2.494ProAsn: 2.494 ± 0.384
2.538ProPro: 2.538 ± 0.415
1.794ProGln: 1.794 ± 0.375
1.444ProArg: 1.444 ± 0.298
3.456ProSer: 3.456 ± 0.505
3.369ProThr: 3.369 ± 0.492
4.419ProVal: 4.419 ± 0.4
1.094ProTrp: 1.094 ± 0.254
1.575ProTyr: 1.575 ± 0.263
0.0ProXaa: 0.0 ± 0.0
Gln
4.2GlnAla: 4.2 ± 0.62
0.219GlnCys: 0.219 ± 0.088
1.619GlnAsp: 1.619 ± 0.278
2.45GlnGlu: 2.45 ± 0.461
1.531GlnPhe: 1.531 ± 0.295
2.275GlnGly: 2.275 ± 0.307
0.656GlnHis: 0.656 ± 0.173
2.144GlnIle: 2.144 ± 0.402
1.794GlnLys: 1.794 ± 0.303
3.019GlnLeu: 3.019 ± 0.375
0.7GlnMet: 0.7 ± 0.148
1.313GlnAsn: 1.313 ± 0.353
1.794GlnPro: 1.794 ± 0.329
1.619GlnGln: 1.619 ± 0.306
2.625GlnArg: 2.625 ± 0.454
2.1GlnSer: 2.1 ± 0.325
1.881GlnThr: 1.881 ± 0.343
3.325GlnVal: 3.325 ± 0.282
0.744GlnTrp: 0.744 ± 0.198
1.313GlnTyr: 1.313 ± 0.259
0.0GlnXaa: 0.0 ± 0.0
Arg
4.9ArgAla: 4.9 ± 0.519
0.744ArgCys: 0.744 ± 0.194
2.931ArgAsp: 2.931 ± 0.546
4.156ArgGlu: 4.156 ± 0.437
2.1ArgPhe: 2.1 ± 0.33
3.719ArgGly: 3.719 ± 0.436
1.4ArgHis: 1.4 ± 0.281
3.063ArgIle: 3.063 ± 0.43
2.931ArgLys: 2.931 ± 0.422
5.075ArgLeu: 5.075 ± 0.556
1.488ArgMet: 1.488 ± 0.282
2.581ArgAsn: 2.581 ± 0.363
3.063ArgPro: 3.063 ± 0.421
2.363ArgGln: 2.363 ± 0.325
4.594ArgArg: 4.594 ± 0.644
2.363ArgSer: 2.363 ± 0.294
2.669ArgThr: 2.669 ± 0.331
4.55ArgVal: 4.55 ± 0.645
0.7ArgTrp: 0.7 ± 0.212
2.231ArgTyr: 2.231 ± 0.354
0.0ArgXaa: 0.0 ± 0.0
Ser
5.95SerAla: 5.95 ± 0.553
0.263SerCys: 0.263 ± 0.119
3.806SerAsp: 3.806 ± 0.4
3.238SerGlu: 3.238 ± 0.426
2.275SerPhe: 2.275 ± 0.367
4.9SerGly: 4.9 ± 0.599
1.094SerHis: 1.094 ± 0.217
4.069SerIle: 4.069 ± 0.456
3.019SerLys: 3.019 ± 0.364
4.2SerLeu: 4.2 ± 0.53
1.619SerMet: 1.619 ± 0.25
2.363SerAsn: 2.363 ± 0.37
3.238SerPro: 3.238 ± 0.368
1.663SerGln: 1.663 ± 0.304
3.413SerArg: 3.413 ± 0.347
4.9SerSer: 4.9 ± 0.504
3.675SerThr: 3.675 ± 0.358
4.681SerVal: 4.681 ± 0.511
1.05SerTrp: 1.05 ± 0.192
1.881SerTyr: 1.881 ± 0.278
0.0SerXaa: 0.0 ± 0.0
Thr
6.125ThrAla: 6.125 ± 0.8
0.569ThrCys: 0.569 ± 0.19
3.85ThrAsp: 3.85 ± 0.639
3.85ThrGlu: 3.85 ± 0.512
1.838ThrPhe: 1.838 ± 0.276
5.688ThrGly: 5.688 ± 0.5
0.788ThrHis: 0.788 ± 0.214
2.931ThrIle: 2.931 ± 0.369
2.494ThrLys: 2.494 ± 0.286
4.638ThrLeu: 4.638 ± 0.468
1.094ThrMet: 1.094 ± 0.266
2.363ThrAsn: 2.363 ± 0.314
4.156ThrPro: 4.156 ± 0.636
1.4ThrGln: 1.4 ± 0.231
2.8ThrArg: 2.8 ± 0.362
3.938ThrSer: 3.938 ± 0.51
4.944ThrThr: 4.944 ± 0.598
4.9ThrVal: 4.9 ± 0.528
1.269ThrTrp: 1.269 ± 0.229
1.663ThrTyr: 1.663 ± 0.302
0.0ThrXaa: 0.0 ± 0.0
Val
7.175ValAla: 7.175 ± 0.747
0.656ValCys: 0.656 ± 0.193
4.594ValAsp: 4.594 ± 0.477
5.688ValGlu: 5.688 ± 0.527
3.063ValPhe: 3.063 ± 0.398
4.856ValGly: 4.856 ± 0.719
1.444ValHis: 1.444 ± 0.233
3.763ValIle: 3.763 ± 0.507
3.5ValLys: 3.5 ± 0.37
4.813ValLeu: 4.813 ± 0.433
1.663ValMet: 1.663 ± 0.229
2.713ValAsn: 2.713 ± 0.351
3.675ValPro: 3.675 ± 0.401
2.275ValGln: 2.275 ± 0.326
4.113ValArg: 4.113 ± 0.437
5.075ValSer: 5.075 ± 0.582
4.463ValThr: 4.463 ± 0.529
4.813ValVal: 4.813 ± 0.474
1.313ValTrp: 1.313 ± 0.215
1.706ValTyr: 1.706 ± 0.275
0.0ValXaa: 0.0 ± 0.0
Trp
1.488TrpAla: 1.488 ± 0.495
0.438TrpCys: 0.438 ± 0.152
1.181TrpAsp: 1.181 ± 0.21
1.356TrpGlu: 1.356 ± 0.211
0.788TrpPhe: 0.788 ± 0.182
1.269TrpGly: 1.269 ± 0.251
0.35TrpHis: 0.35 ± 0.124
0.7TrpIle: 0.7 ± 0.176
1.006TrpLys: 1.006 ± 0.223
1.094TrpLeu: 1.094 ± 0.2
0.263TrpMet: 0.263 ± 0.086
0.656TrpAsn: 0.656 ± 0.237
0.788TrpPro: 0.788 ± 0.207
0.744TrpGln: 0.744 ± 0.245
1.313TrpArg: 1.313 ± 0.243
0.963TrpSer: 0.963 ± 0.199
1.225TrpThr: 1.225 ± 0.275
0.831TrpVal: 0.831 ± 0.209
0.175TrpTrp: 0.175 ± 0.089
0.481TrpTyr: 0.481 ± 0.135
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.581TyrAla: 2.581 ± 0.428
0.525TyrCys: 0.525 ± 0.155
1.794TyrAsp: 1.794 ± 0.341
1.838TyrGlu: 1.838 ± 0.353
1.225TyrPhe: 1.225 ± 0.249
2.669TyrGly: 2.669 ± 0.365
0.394TyrHis: 0.394 ± 0.13
1.4TyrIle: 1.4 ± 0.315
1.4TyrLys: 1.4 ± 0.206
2.231TyrLeu: 2.231 ± 0.283
0.788TyrMet: 0.788 ± 0.18
1.619TyrAsn: 1.619 ± 0.283
1.531TyrPro: 1.531 ± 0.246
1.05TyrGln: 1.05 ± 0.193
2.231TyrArg: 2.231 ± 0.382
1.356TyrSer: 1.356 ± 0.25
2.056TyrThr: 2.056 ± 0.331
2.013TyrVal: 2.013 ± 0.415
0.525TyrTrp: 0.525 ± 0.158
0.788TyrTyr: 0.788 ± 0.203
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 91 proteins (22857 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski