Amino acid dipepetide frequency for Mannheimia phage vB_MhS_535AP2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.768AlaAla: 5.768 ± 1.156
0.398AlaCys: 0.398 ± 0.16
5.105AlaAsp: 5.105 ± 0.552
7.028AlaGlu: 7.028 ± 0.735
2.785AlaPhe: 2.785 ± 0.526
4.111AlaGly: 4.111 ± 0.505
1.326AlaHis: 1.326 ± 0.293
5.967AlaIle: 5.967 ± 0.744
6.63AlaLys: 6.63 ± 0.852
5.702AlaLeu: 5.702 ± 0.737
2.32AlaMet: 2.32 ± 0.492
5.105AlaAsn: 5.105 ± 0.778
2.586AlaPro: 2.586 ± 0.505
4.243AlaGln: 4.243 ± 0.72
4.309AlaArg: 4.309 ± 0.698
3.779AlaSer: 3.779 ± 0.758
5.105AlaThr: 5.105 ± 0.695
6.1AlaVal: 6.1 ± 0.596
0.928AlaTrp: 0.928 ± 0.227
2.453AlaTyr: 2.453 ± 0.352
0.0AlaXaa: 0.0 ± 0.0
Cys
0.398CysAla: 0.398 ± 0.202
0.464CysCys: 0.464 ± 0.225
0.53CysAsp: 0.53 ± 0.188
0.597CysGlu: 0.597 ± 0.18
0.199CysPhe: 0.199 ± 0.121
1.193CysGly: 1.193 ± 0.389
0.331CysHis: 0.331 ± 0.158
0.597CysIle: 0.597 ± 0.195
0.796CysLys: 0.796 ± 0.273
0.928CysLeu: 0.928 ± 0.341
0.199CysMet: 0.199 ± 0.126
0.796CysAsn: 0.796 ± 0.272
0.53CysPro: 0.53 ± 0.209
0.331CysGln: 0.331 ± 0.176
0.199CysArg: 0.199 ± 0.125
0.398CysSer: 0.398 ± 0.152
0.464CysThr: 0.464 ± 0.179
0.464CysVal: 0.464 ± 0.206
0.265CysTrp: 0.265 ± 0.131
0.398CysTyr: 0.398 ± 0.161
0.0CysXaa: 0.0 ± 0.0
Asp
3.448AspAla: 3.448 ± 0.574
0.597AspCys: 0.597 ± 0.221
2.917AspAsp: 2.917 ± 0.514
3.978AspGlu: 3.978 ± 0.646
2.387AspPhe: 2.387 ± 0.538
5.171AspGly: 5.171 ± 0.639
0.597AspHis: 0.597 ± 0.185
2.917AspIle: 2.917 ± 0.459
4.177AspLys: 4.177 ± 0.412
5.105AspLeu: 5.105 ± 0.526
0.862AspMet: 0.862 ± 0.245
2.851AspAsn: 2.851 ± 0.453
1.657AspPro: 1.657 ± 0.38
1.26AspGln: 1.26 ± 0.25
1.923AspArg: 1.923 ± 0.354
3.448AspSer: 3.448 ± 0.547
2.586AspThr: 2.586 ± 0.437
3.713AspVal: 3.713 ± 0.48
0.862AspTrp: 0.862 ± 0.272
1.923AspTyr: 1.923 ± 0.415
0.0AspXaa: 0.0 ± 0.0
Glu
5.967GluAla: 5.967 ± 0.885
0.663GluCys: 0.663 ± 0.233
3.05GluAsp: 3.05 ± 0.445
4.707GluGlu: 4.707 ± 0.664
3.646GluPhe: 3.646 ± 0.523
2.387GluGly: 2.387 ± 0.468
0.464GluHis: 0.464 ± 0.163
5.901GluIle: 5.901 ± 0.574
4.442GluLys: 4.442 ± 0.585
6.763GluLeu: 6.763 ± 0.919
2.122GluMet: 2.122 ± 0.417
3.315GluAsn: 3.315 ± 0.421
2.718GluPro: 2.718 ± 0.542
4.972GluGln: 4.972 ± 0.804
4.044GluArg: 4.044 ± 0.574
3.978GluSer: 3.978 ± 0.415
5.304GluThr: 5.304 ± 0.766
3.58GluVal: 3.58 ± 0.481
1.79GluTrp: 1.79 ± 0.327
2.122GluTyr: 2.122 ± 0.397
0.0GluXaa: 0.0 ± 0.0
Phe
3.713PheAla: 3.713 ± 0.396
0.398PheCys: 0.398 ± 0.152
2.851PheAsp: 2.851 ± 0.422
3.05PheGlu: 3.05 ± 0.475
1.657PhePhe: 1.657 ± 0.448
2.851PheGly: 2.851 ± 0.478
0.994PheHis: 0.994 ± 0.213
1.923PheIle: 1.923 ± 0.444
2.188PheLys: 2.188 ± 0.367
2.055PheLeu: 2.055 ± 0.461
0.862PheMet: 0.862 ± 0.248
2.652PheAsn: 2.652 ± 0.404
0.994PhePro: 0.994 ± 0.308
0.928PheGln: 0.928 ± 0.257
1.79PheArg: 1.79 ± 0.348
2.718PheSer: 2.718 ± 0.525
2.586PheThr: 2.586 ± 0.426
2.122PheVal: 2.122 ± 0.374
0.398PheTrp: 0.398 ± 0.152
1.392PheTyr: 1.392 ± 0.384
0.0PheXaa: 0.0 ± 0.0
Gly
3.978GlyAla: 3.978 ± 0.69
0.862GlyCys: 0.862 ± 0.285
3.249GlyAsp: 3.249 ± 0.398
4.641GlyGlu: 4.641 ± 0.768
2.785GlyPhe: 2.785 ± 0.49
4.044GlyGly: 4.044 ± 0.596
0.663GlyHis: 0.663 ± 0.204
3.845GlyIle: 3.845 ± 0.526
5.238GlyLys: 5.238 ± 0.81
6.298GlyLeu: 6.298 ± 0.568
1.591GlyMet: 1.591 ± 0.299
3.845GlyAsn: 3.845 ± 0.516
0.729GlyPro: 0.729 ± 0.207
2.32GlyGln: 2.32 ± 0.428
3.381GlyArg: 3.381 ± 0.393
4.243GlySer: 4.243 ± 0.383
3.116GlyThr: 3.116 ± 0.859
4.575GlyVal: 4.575 ± 0.508
0.994GlyTrp: 0.994 ± 0.216
3.182GlyTyr: 3.182 ± 0.481
0.0GlyXaa: 0.0 ± 0.0
His
0.796HisAla: 0.796 ± 0.225
0.265HisCys: 0.265 ± 0.176
0.663HisAsp: 0.663 ± 0.228
0.597HisGlu: 0.597 ± 0.213
1.326HisPhe: 1.326 ± 0.184
0.928HisGly: 0.928 ± 0.218
0.597HisHis: 0.597 ± 0.214
1.061HisIle: 1.061 ± 0.255
1.26HisLys: 1.26 ± 0.327
1.591HisLeu: 1.591 ± 0.351
0.265HisMet: 0.265 ± 0.151
0.928HisAsn: 0.928 ± 0.252
0.862HisPro: 0.862 ± 0.294
0.994HisGln: 0.994 ± 0.215
0.729HisArg: 0.729 ± 0.207
1.26HisSer: 1.26 ± 0.359
0.729HisThr: 0.729 ± 0.276
0.331HisVal: 0.331 ± 0.185
0.199HisTrp: 0.199 ± 0.115
0.663HisTyr: 0.663 ± 0.179
0.0HisXaa: 0.0 ± 0.0
Ile
6.298IleAla: 6.298 ± 0.681
0.398IleCys: 0.398 ± 0.163
4.243IleAsp: 4.243 ± 0.51
5.635IleGlu: 5.635 ± 0.836
1.392IlePhe: 1.392 ± 0.286
4.177IleGly: 4.177 ± 0.548
0.928IleHis: 0.928 ± 0.21
4.044IleIle: 4.044 ± 0.524
5.768IleLys: 5.768 ± 0.839
5.967IleLeu: 5.967 ± 0.93
1.127IleMet: 1.127 ± 0.303
4.177IleAsn: 4.177 ± 0.606
1.989IlePro: 1.989 ± 0.368
2.718IleGln: 2.718 ± 0.387
3.779IleArg: 3.779 ± 0.442
5.171IleSer: 5.171 ± 0.552
4.575IleThr: 4.575 ± 0.845
3.912IleVal: 3.912 ± 0.618
0.994IleTrp: 0.994 ± 0.251
2.718IleTyr: 2.718 ± 0.406
0.0IleXaa: 0.0 ± 0.0
Lys
6.961LysAla: 6.961 ± 0.769
0.862LysCys: 0.862 ± 0.382
3.182LysAsp: 3.182 ± 0.685
4.774LysGlu: 4.774 ± 0.709
1.923LysPhe: 1.923 ± 0.435
4.508LysGly: 4.508 ± 0.649
1.326LysHis: 1.326 ± 0.345
5.238LysIle: 5.238 ± 0.548
4.575LysLys: 4.575 ± 0.53
5.967LysLeu: 5.967 ± 0.493
2.387LysMet: 2.387 ± 0.409
4.111LysAsn: 4.111 ± 0.553
2.122LysPro: 2.122 ± 0.524
4.111LysGln: 4.111 ± 0.66
4.044LysArg: 4.044 ± 0.807
4.641LysSer: 4.641 ± 0.668
4.044LysThr: 4.044 ± 0.482
4.442LysVal: 4.442 ± 0.523
0.729LysTrp: 0.729 ± 0.211
2.519LysTyr: 2.519 ± 0.498
0.0LysXaa: 0.0 ± 0.0
Leu
7.426LeuAla: 7.426 ± 0.68
0.994LeuCys: 0.994 ± 0.222
4.774LeuAsp: 4.774 ± 0.607
5.768LeuGlu: 5.768 ± 0.704
2.718LeuPhe: 2.718 ± 0.384
4.774LeuGly: 4.774 ± 0.585
1.193LeuHis: 1.193 ± 0.34
5.768LeuIle: 5.768 ± 0.655
5.304LeuLys: 5.304 ± 0.878
6.033LeuLeu: 6.033 ± 0.685
1.989LeuMet: 1.989 ± 0.364
6.696LeuAsn: 6.696 ± 0.602
3.116LeuPro: 3.116 ± 0.502
2.785LeuGln: 2.785 ± 0.432
3.448LeuArg: 3.448 ± 0.498
6.431LeuSer: 6.431 ± 0.656
5.437LeuThr: 5.437 ± 0.775
4.575LeuVal: 4.575 ± 0.558
1.193LeuTrp: 1.193 ± 0.339
2.652LeuTyr: 2.652 ± 0.354
0.0LeuXaa: 0.0 ± 0.0
Met
1.657MetAla: 1.657 ± 0.36
0.331MetCys: 0.331 ± 0.143
0.53MetAsp: 0.53 ± 0.231
1.326MetGlu: 1.326 ± 0.348
0.663MetPhe: 0.663 ± 0.208
1.79MetGly: 1.79 ± 0.296
0.199MetHis: 0.199 ± 0.091
1.459MetIle: 1.459 ± 0.363
1.856MetLys: 1.856 ± 0.343
1.989MetLeu: 1.989 ± 0.305
0.199MetMet: 0.199 ± 0.119
1.127MetAsn: 1.127 ± 0.291
1.061MetPro: 1.061 ± 0.297
1.127MetGln: 1.127 ± 0.344
1.193MetArg: 1.193 ± 0.266
2.188MetSer: 2.188 ± 0.431
1.79MetThr: 1.79 ± 0.435
1.061MetVal: 1.061 ± 0.234
0.199MetTrp: 0.199 ± 0.154
0.597MetTyr: 0.597 ± 0.186
0.0MetXaa: 0.0 ± 0.0
Asn
5.834AsnAla: 5.834 ± 0.942
0.199AsnCys: 0.199 ± 0.131
2.586AsnAsp: 2.586 ± 0.489
3.845AsnGlu: 3.845 ± 0.47
1.989AsnPhe: 1.989 ± 0.371
4.972AsnGly: 4.972 ± 0.537
1.26AsnHis: 1.26 ± 0.311
3.646AsnIle: 3.646 ± 0.538
4.111AsnLys: 4.111 ± 0.503
4.774AsnLeu: 4.774 ± 0.605
1.657AsnMet: 1.657 ± 0.301
3.448AsnAsn: 3.448 ± 0.454
1.724AsnPro: 1.724 ± 0.369
3.58AsnGln: 3.58 ± 0.462
3.448AsnArg: 3.448 ± 0.388
3.646AsnSer: 3.646 ± 0.441
1.856AsnThr: 1.856 ± 0.373
3.58AsnVal: 3.58 ± 0.799
0.398AsnTrp: 0.398 ± 0.201
1.856AsnTyr: 1.856 ± 0.373
0.0AsnXaa: 0.0 ± 0.0
Pro
1.989ProAla: 1.989 ± 0.346
0.398ProCys: 0.398 ± 0.225
1.127ProAsp: 1.127 ± 0.274
2.718ProGlu: 2.718 ± 0.508
1.193ProPhe: 1.193 ± 0.296
0.796ProGly: 0.796 ± 0.183
0.53ProHis: 0.53 ± 0.224
2.519ProIle: 2.519 ± 0.437
2.718ProLys: 2.718 ± 0.533
2.387ProLeu: 2.387 ± 0.424
0.928ProMet: 0.928 ± 0.213
2.188ProAsn: 2.188 ± 0.379
1.392ProPro: 1.392 ± 0.375
1.79ProGln: 1.79 ± 0.325
0.928ProArg: 0.928 ± 0.252
2.055ProSer: 2.055 ± 0.36
1.79ProThr: 1.79 ± 0.298
2.387ProVal: 2.387 ± 0.338
0.464ProTrp: 0.464 ± 0.156
0.862ProTyr: 0.862 ± 0.276
0.0ProXaa: 0.0 ± 0.0
Gln
5.437GlnAla: 5.437 ± 0.711
0.331GlnCys: 0.331 ± 0.192
2.453GlnAsp: 2.453 ± 0.274
2.983GlnGlu: 2.983 ± 0.488
1.459GlnPhe: 1.459 ± 0.294
3.646GlnGly: 3.646 ± 0.538
0.729GlnHis: 0.729 ± 0.218
4.906GlnIle: 4.906 ± 0.623
2.586GlnLys: 2.586 ± 0.508
3.58GlnLeu: 3.58 ± 0.444
1.061GlnMet: 1.061 ± 0.395
2.586GlnAsn: 2.586 ± 0.397
1.127GlnPro: 1.127 ± 0.262
2.785GlnGln: 2.785 ± 0.347
2.851GlnArg: 2.851 ± 0.456
3.315GlnSer: 3.315 ± 0.567
2.785GlnThr: 2.785 ± 0.475
2.32GlnVal: 2.32 ± 0.32
0.729GlnTrp: 0.729 ± 0.279
1.79GlnTyr: 1.79 ± 0.291
0.0GlnXaa: 0.0 ± 0.0
Arg
3.249ArgAla: 3.249 ± 0.405
0.398ArgCys: 0.398 ± 0.161
2.785ArgAsp: 2.785 ± 0.447
3.58ArgGlu: 3.58 ± 0.391
2.387ArgPhe: 2.387 ± 0.43
2.122ArgGly: 2.122 ± 0.46
1.392ArgHis: 1.392 ± 0.295
3.713ArgIle: 3.713 ± 0.516
4.309ArgLys: 4.309 ± 0.681
5.503ArgLeu: 5.503 ± 0.655
1.193ArgMet: 1.193 ± 0.219
2.917ArgAsn: 2.917 ± 0.417
1.392ArgPro: 1.392 ± 0.31
2.122ArgGln: 2.122 ± 0.36
2.188ArgArg: 2.188 ± 0.349
1.989ArgSer: 1.989 ± 0.377
3.249ArgThr: 3.249 ± 0.609
1.989ArgVal: 1.989 ± 0.401
0.597ArgTrp: 0.597 ± 0.194
1.724ArgTyr: 1.724 ± 0.296
0.0ArgXaa: 0.0 ± 0.0
Ser
6.033SerAla: 6.033 ± 0.885
0.796SerCys: 0.796 ± 0.301
3.646SerAsp: 3.646 ± 0.519
4.707SerGlu: 4.707 ± 0.489
1.724SerPhe: 1.724 ± 0.357
4.575SerGly: 4.575 ± 0.464
1.127SerHis: 1.127 ± 0.267
4.508SerIle: 4.508 ± 0.701
3.978SerLys: 3.978 ± 0.427
4.707SerLeu: 4.707 ± 0.445
1.127SerMet: 1.127 ± 0.285
3.315SerAsn: 3.315 ± 0.568
1.657SerPro: 1.657 ± 0.363
3.978SerGln: 3.978 ± 0.619
3.116SerArg: 3.116 ± 0.369
4.177SerSer: 4.177 ± 0.614
3.448SerThr: 3.448 ± 0.506
4.508SerVal: 4.508 ± 0.567
0.53SerTrp: 0.53 ± 0.209
2.652SerTyr: 2.652 ± 0.483
0.0SerXaa: 0.0 ± 0.0
Thr
5.702ThrAla: 5.702 ± 1.401
0.398ThrCys: 0.398 ± 0.168
3.514ThrAsp: 3.514 ± 0.453
4.508ThrGlu: 4.508 ± 0.795
2.519ThrPhe: 2.519 ± 0.35
4.243ThrGly: 4.243 ± 0.925
1.061ThrHis: 1.061 ± 0.233
4.376ThrIle: 4.376 ± 0.608
4.575ThrLys: 4.575 ± 0.465
4.84ThrLeu: 4.84 ± 0.481
0.862ThrMet: 0.862 ± 0.26
3.05ThrAsn: 3.05 ± 0.483
2.188ThrPro: 2.188 ± 0.335
3.646ThrGln: 3.646 ± 0.657
1.856ThrArg: 1.856 ± 0.455
2.851ThrSer: 2.851 ± 0.516
2.718ThrThr: 2.718 ± 0.488
3.381ThrVal: 3.381 ± 0.483
0.663ThrTrp: 0.663 ± 0.213
1.856ThrTyr: 1.856 ± 0.624
0.0ThrXaa: 0.0 ± 0.0
Val
4.575ValAla: 4.575 ± 0.733
0.796ValCys: 0.796 ± 0.265
2.718ValAsp: 2.718 ± 0.455
4.707ValGlu: 4.707 ± 0.642
2.586ValPhe: 2.586 ± 0.419
3.713ValGly: 3.713 ± 0.428
0.663ValHis: 0.663 ± 0.259
4.641ValIle: 4.641 ± 0.598
4.84ValLys: 4.84 ± 0.622
3.779ValLeu: 3.779 ± 0.55
1.061ValMet: 1.061 ± 0.226
3.646ValAsn: 3.646 ± 0.607
1.79ValPro: 1.79 ± 0.327
2.586ValGln: 2.586 ± 0.369
2.718ValArg: 2.718 ± 0.36
4.575ValSer: 4.575 ± 0.611
4.177ValThr: 4.177 ± 0.92
4.177ValVal: 4.177 ± 0.423
0.796ValTrp: 0.796 ± 0.264
1.392ValTyr: 1.392 ± 0.288
0.0ValXaa: 0.0 ± 0.0
Trp
0.862TrpAla: 0.862 ± 0.21
0.0TrpCys: 0.0 ± 0.0
0.597TrpAsp: 0.597 ± 0.16
0.862TrpGlu: 0.862 ± 0.224
0.796TrpPhe: 0.796 ± 0.248
0.663TrpGly: 0.663 ± 0.212
0.265TrpHis: 0.265 ± 0.146
0.862TrpIle: 0.862 ± 0.273
0.729TrpLys: 0.729 ± 0.244
2.055TrpLeu: 2.055 ± 0.355
0.133TrpMet: 0.133 ± 0.094
0.53TrpAsn: 0.53 ± 0.169
0.066TrpPro: 0.066 ± 0.08
0.994TrpGln: 0.994 ± 0.228
0.862TrpArg: 0.862 ± 0.226
0.729TrpSer: 0.729 ± 0.191
0.994TrpThr: 0.994 ± 0.419
1.127TrpVal: 1.127 ± 0.318
0.265TrpTrp: 0.265 ± 0.163
0.133TrpTyr: 0.133 ± 0.1
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.657TyrAla: 1.657 ± 0.324
0.53TyrCys: 0.53 ± 0.137
1.856TyrAsp: 1.856 ± 0.411
1.989TyrGlu: 1.989 ± 0.429
1.989TyrPhe: 1.989 ± 0.404
2.917TyrGly: 2.917 ± 0.39
0.398TyrHis: 0.398 ± 0.153
2.122TyrIle: 2.122 ± 0.39
2.387TyrLys: 2.387 ± 0.427
3.05TyrLeu: 3.05 ± 0.424
0.331TyrMet: 0.331 ± 0.138
1.127TyrAsn: 1.127 ± 0.321
1.459TyrPro: 1.459 ± 0.357
2.188TyrGln: 2.188 ± 0.359
1.989TyrArg: 1.989 ± 0.404
2.718TyrSer: 2.718 ± 0.475
2.055TyrThr: 2.055 ± 0.429
1.591TyrVal: 1.591 ± 0.289
0.53TyrTrp: 0.53 ± 0.169
0.928TyrTyr: 0.928 ± 0.248
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 79 proteins (15084 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski