Amino acid dipepetide frequency for Shewanella phage X14

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.403AlaAla: 7.403 ± 1.078
0.457AlaCys: 0.457 ± 0.31
5.392AlaAsp: 5.392 ± 0.771
9.413AlaGlu: 9.413 ± 1.339
2.193AlaPhe: 2.193 ± 0.431
5.666AlaGly: 5.666 ± 0.766
1.462AlaHis: 1.462 ± 0.243
5.209AlaIle: 5.209 ± 0.677
7.585AlaLys: 7.585 ± 0.83
6.58AlaLeu: 6.58 ± 0.698
3.199AlaMet: 3.199 ± 0.534
4.387AlaAsn: 4.387 ± 0.385
3.564AlaPro: 3.564 ± 0.894
4.387AlaGln: 4.387 ± 0.786
3.473AlaArg: 3.473 ± 0.569
4.661AlaSer: 4.661 ± 0.986
4.935AlaThr: 4.935 ± 0.721
5.849AlaVal: 5.849 ± 0.692
1.188AlaTrp: 1.188 ± 0.377
2.559AlaTyr: 2.559 ± 0.481
0.0AlaXaa: 0.0 ± 0.0
Cys
0.64CysAla: 0.64 ± 0.279
0.274CysCys: 0.274 ± 0.186
0.823CysAsp: 0.823 ± 0.312
0.366CysGlu: 0.366 ± 0.209
0.457CysPhe: 0.457 ± 0.249
0.548CysGly: 0.548 ± 0.277
0.091CysHis: 0.091 ± 0.107
0.823CysIle: 0.823 ± 0.432
0.457CysLys: 0.457 ± 0.287
0.731CysLeu: 0.731 ± 0.32
0.091CysMet: 0.091 ± 0.092
0.366CysAsn: 0.366 ± 0.249
0.64CysPro: 0.64 ± 0.304
0.548CysGln: 0.548 ± 0.257
0.183CysArg: 0.183 ± 0.129
0.457CysSer: 0.457 ± 0.224
0.548CysThr: 0.548 ± 0.254
0.274CysVal: 0.274 ± 0.242
0.0CysTrp: 0.0 ± 0.0
0.731CysTyr: 0.731 ± 0.296
0.0CysXaa: 0.0 ± 0.0
Asp
5.118AspAla: 5.118 ± 0.469
0.274AspCys: 0.274 ± 0.184
2.742AspAsp: 2.742 ± 0.499
4.57AspGlu: 4.57 ± 0.547
2.468AspPhe: 2.468 ± 0.475
4.478AspGly: 4.478 ± 0.642
1.462AspHis: 1.462 ± 0.277
3.564AspIle: 3.564 ± 0.691
3.747AspLys: 3.747 ± 0.624
5.758AspLeu: 5.758 ± 0.663
1.736AspMet: 1.736 ± 0.405
2.193AspAsn: 2.193 ± 0.614
2.559AspPro: 2.559 ± 0.566
3.473AspGln: 3.473 ± 0.496
2.285AspArg: 2.285 ± 0.361
4.387AspSer: 4.387 ± 0.565
2.285AspThr: 2.285 ± 0.523
4.113AspVal: 4.113 ± 0.711
0.457AspTrp: 0.457 ± 0.187
2.376AspTyr: 2.376 ± 0.461
0.0AspXaa: 0.0 ± 0.0
Glu
8.956GluAla: 8.956 ± 1.075
0.731GluCys: 0.731 ± 0.35
4.021GluAsp: 4.021 ± 0.581
6.58GluGlu: 6.58 ± 1.644
3.564GluPhe: 3.564 ± 0.472
5.209GluGly: 5.209 ± 0.828
1.462GluHis: 1.462 ± 0.446
5.027GluIle: 5.027 ± 0.591
6.854GluLys: 6.854 ± 0.725
7.128GluLeu: 7.128 ± 0.835
1.919GluMet: 1.919 ± 0.426
3.107GluAsn: 3.107 ± 0.498
3.656GluPro: 3.656 ± 0.708
6.763GluGln: 6.763 ± 1.036
5.301GluArg: 5.301 ± 0.772
3.656GluSer: 3.656 ± 0.566
3.93GluThr: 3.93 ± 0.653
4.57GluVal: 4.57 ± 0.562
1.097GluTrp: 1.097 ± 0.202
3.29GluTyr: 3.29 ± 0.471
0.0GluXaa: 0.0 ± 0.0
Phe
2.559PheAla: 2.559 ± 0.387
0.457PheCys: 0.457 ± 0.273
2.102PheAsp: 2.102 ± 0.549
2.65PheGlu: 2.65 ± 0.494
1.736PhePhe: 1.736 ± 0.429
2.559PheGly: 2.559 ± 0.573
0.914PheHis: 0.914 ± 0.329
1.919PheIle: 1.919 ± 0.391
3.656PheLys: 3.656 ± 0.623
2.468PheLeu: 2.468 ± 0.418
0.64PheMet: 0.64 ± 0.255
2.742PheAsn: 2.742 ± 0.546
1.279PhePro: 1.279 ± 0.294
1.645PheGln: 1.645 ± 0.326
1.645PheArg: 1.645 ± 0.46
1.736PheSer: 1.736 ± 0.476
2.468PheThr: 2.468 ± 0.367
2.011PheVal: 2.011 ± 0.398
0.274PheTrp: 0.274 ± 0.133
1.462PheTyr: 1.462 ± 0.41
0.0PheXaa: 0.0 ± 0.0
Gly
5.666GlyAla: 5.666 ± 1.001
0.366GlyCys: 0.366 ± 0.202
4.295GlyAsp: 4.295 ± 0.459
4.752GlyGlu: 4.752 ± 0.54
2.65GlyPhe: 2.65 ± 0.516
5.666GlyGly: 5.666 ± 0.915
1.371GlyHis: 1.371 ± 0.294
4.387GlyIle: 4.387 ± 0.704
6.946GlyLys: 6.946 ± 1.145
4.387GlyLeu: 4.387 ± 0.713
1.919GlyMet: 1.919 ± 0.361
2.376GlyAsn: 2.376 ± 0.49
1.279GlyPro: 1.279 ± 0.329
2.742GlyGln: 2.742 ± 0.601
3.381GlyArg: 3.381 ± 0.804
3.564GlySer: 3.564 ± 0.63
3.29GlyThr: 3.29 ± 0.549
4.295GlyVal: 4.295 ± 0.659
1.097GlyTrp: 1.097 ± 0.268
3.016GlyTyr: 3.016 ± 0.38
0.0GlyXaa: 0.0 ± 0.0
His
1.097HisAla: 1.097 ± 0.326
0.274HisCys: 0.274 ± 0.18
1.097HisAsp: 1.097 ± 0.422
1.097HisGlu: 1.097 ± 0.275
0.548HisPhe: 0.548 ± 0.265
1.097HisGly: 1.097 ± 0.354
0.366HisHis: 0.366 ± 0.199
0.823HisIle: 0.823 ± 0.246
1.188HisLys: 1.188 ± 0.366
1.279HisLeu: 1.279 ± 0.302
0.366HisMet: 0.366 ± 0.154
0.548HisAsn: 0.548 ± 0.181
0.274HisPro: 0.274 ± 0.27
0.548HisGln: 0.548 ± 0.286
1.188HisArg: 1.188 ± 0.258
1.371HisSer: 1.371 ± 0.53
0.64HisThr: 0.64 ± 0.2
0.64HisVal: 0.64 ± 0.212
0.366HisTrp: 0.366 ± 0.187
0.731HisTyr: 0.731 ± 0.261
0.0HisXaa: 0.0 ± 0.0
Ile
6.58IleAla: 6.58 ± 1.022
0.548IleCys: 0.548 ± 0.279
3.656IleAsp: 3.656 ± 0.474
5.666IleGlu: 5.666 ± 0.697
1.919IlePhe: 1.919 ± 0.443
2.559IleGly: 2.559 ± 0.444
0.457IleHis: 0.457 ± 0.216
2.65IleIle: 2.65 ± 0.697
5.027IleLys: 5.027 ± 0.758
2.468IleLeu: 2.468 ± 0.456
1.279IleMet: 1.279 ± 0.253
2.559IleAsn: 2.559 ± 0.509
2.742IlePro: 2.742 ± 0.584
2.193IleGln: 2.193 ± 0.329
2.376IleArg: 2.376 ± 0.327
3.656IleSer: 3.656 ± 0.546
3.747IleThr: 3.747 ± 0.498
2.742IleVal: 2.742 ± 0.706
0.457IleTrp: 0.457 ± 0.181
1.005IleTyr: 1.005 ± 0.383
0.0IleXaa: 0.0 ± 0.0
Lys
8.317LysAla: 8.317 ± 0.932
0.64LysCys: 0.64 ± 0.37
4.113LysAsp: 4.113 ± 0.702
7.951LysGlu: 7.951 ± 0.735
3.016LysPhe: 3.016 ± 0.407
5.118LysGly: 5.118 ± 0.776
1.371LysHis: 1.371 ± 0.319
4.752LysIle: 4.752 ± 0.716
6.854LysLys: 6.854 ± 1.067
6.672LysLeu: 6.672 ± 0.703
1.736LysMet: 1.736 ± 0.37
3.381LysAsn: 3.381 ± 0.555
3.564LysPro: 3.564 ± 0.915
5.027LysGln: 5.027 ± 1.024
4.021LysArg: 4.021 ± 0.628
4.113LysSer: 4.113 ± 0.5
5.483LysThr: 5.483 ± 0.75
3.473LysVal: 3.473 ± 0.469
0.823LysTrp: 0.823 ± 0.322
1.828LysTyr: 1.828 ± 0.33
0.0LysXaa: 0.0 ± 0.0
Leu
6.672LeuAla: 6.672 ± 0.708
0.823LeuCys: 0.823 ± 0.348
4.295LeuAsp: 4.295 ± 0.546
6.854LeuGlu: 6.854 ± 0.93
2.468LeuPhe: 2.468 ± 0.612
4.57LeuGly: 4.57 ± 0.597
1.005LeuHis: 1.005 ± 0.253
3.93LeuIle: 3.93 ± 0.747
5.209LeuLys: 5.209 ± 0.593
5.392LeuLeu: 5.392 ± 0.567
2.925LeuMet: 2.925 ± 0.564
4.021LeuAsn: 4.021 ± 0.559
3.199LeuPro: 3.199 ± 0.547
2.742LeuGln: 2.742 ± 0.374
2.833LeuArg: 2.833 ± 0.469
4.57LeuSer: 4.57 ± 0.698
4.661LeuThr: 4.661 ± 0.682
5.301LeuVal: 5.301 ± 0.716
1.462LeuTrp: 1.462 ± 0.42
2.011LeuTyr: 2.011 ± 0.336
0.0LeuXaa: 0.0 ± 0.0
Met
3.473MetAla: 3.473 ± 0.435
0.091MetCys: 0.091 ± 0.092
2.193MetAsp: 2.193 ± 0.32
1.371MetGlu: 1.371 ± 0.256
0.731MetPhe: 0.731 ± 0.255
1.736MetGly: 1.736 ± 0.326
0.366MetHis: 0.366 ± 0.173
1.097MetIle: 1.097 ± 0.287
2.468MetLys: 2.468 ± 0.542
1.462MetLeu: 1.462 ± 0.298
0.366MetMet: 0.366 ± 0.203
1.736MetAsn: 1.736 ± 0.4
0.823MetPro: 0.823 ± 0.293
1.188MetGln: 1.188 ± 0.308
1.188MetArg: 1.188 ± 0.333
2.102MetSer: 2.102 ± 0.386
1.554MetThr: 1.554 ± 0.359
2.011MetVal: 2.011 ± 0.416
0.274MetTrp: 0.274 ± 0.171
1.645MetTyr: 1.645 ± 0.369
0.0MetXaa: 0.0 ± 0.0
Asn
3.564AsnAla: 3.564 ± 0.588
0.731AsnCys: 0.731 ± 0.254
2.65AsnAsp: 2.65 ± 0.444
3.107AsnGlu: 3.107 ± 0.428
1.097AsnPhe: 1.097 ± 0.333
2.925AsnGly: 2.925 ± 0.715
0.823AsnHis: 0.823 ± 0.321
3.29AsnIle: 3.29 ± 0.682
3.656AsnLys: 3.656 ± 0.554
3.016AsnLeu: 3.016 ± 0.554
1.188AsnMet: 1.188 ± 0.243
3.199AsnAsn: 3.199 ± 0.711
2.193AsnPro: 2.193 ± 0.405
3.199AsnGln: 3.199 ± 0.754
2.102AsnArg: 2.102 ± 0.38
1.736AsnSer: 1.736 ± 0.472
2.102AsnThr: 2.102 ± 0.564
2.102AsnVal: 2.102 ± 0.39
1.097AsnTrp: 1.097 ± 0.352
1.279AsnTyr: 1.279 ± 0.331
0.0AsnXaa: 0.0 ± 0.0
Pro
3.381ProAla: 3.381 ± 0.461
0.457ProCys: 0.457 ± 0.373
2.376ProAsp: 2.376 ± 0.35
5.483ProGlu: 5.483 ± 1.178
1.462ProPhe: 1.462 ± 0.395
2.285ProGly: 2.285 ± 0.351
0.457ProHis: 0.457 ± 0.258
2.193ProIle: 2.193 ± 0.358
3.473ProLys: 3.473 ± 0.561
2.468ProLeu: 2.468 ± 0.449
1.005ProMet: 1.005 ± 0.236
1.005ProAsn: 1.005 ± 0.28
0.914ProPro: 0.914 ± 0.267
1.828ProGln: 1.828 ± 0.435
1.645ProArg: 1.645 ± 0.344
1.279ProSer: 1.279 ± 0.331
1.828ProThr: 1.828 ± 0.277
3.747ProVal: 3.747 ± 0.556
0.366ProTrp: 0.366 ± 0.228
0.64ProTyr: 0.64 ± 0.222
0.0ProXaa: 0.0 ± 0.0
Gln
6.123GlnAla: 6.123 ± 0.941
0.091GlnCys: 0.091 ± 0.09
2.65GlnAsp: 2.65 ± 0.52
4.935GlnGlu: 4.935 ± 0.721
2.193GlnPhe: 2.193 ± 0.526
2.65GlnGly: 2.65 ± 0.455
0.64GlnHis: 0.64 ± 0.289
1.371GlnIle: 1.371 ± 0.295
2.833GlnLys: 2.833 ± 0.605
5.118GlnLeu: 5.118 ± 0.712
2.193GlnMet: 2.193 ± 0.338
2.468GlnAsn: 2.468 ± 0.539
1.645GlnPro: 1.645 ± 0.267
4.57GlnGln: 4.57 ± 0.857
3.199GlnArg: 3.199 ± 0.585
3.016GlnSer: 3.016 ± 0.549
1.919GlnThr: 1.919 ± 0.392
2.65GlnVal: 2.65 ± 0.412
0.457GlnTrp: 0.457 ± 0.279
1.554GlnTyr: 1.554 ± 0.346
0.0GlnXaa: 0.0 ± 0.0
Arg
3.838ArgAla: 3.838 ± 0.373
0.64ArgCys: 0.64 ± 0.383
2.742ArgAsp: 2.742 ± 0.475
4.935ArgGlu: 4.935 ± 0.757
1.554ArgPhe: 1.554 ± 0.442
2.102ArgGly: 2.102 ± 0.548
0.823ArgHis: 0.823 ± 0.299
3.381ArgIle: 3.381 ± 0.486
4.204ArgLys: 4.204 ± 0.586
3.381ArgLeu: 3.381 ± 0.561
1.462ArgMet: 1.462 ± 0.32
1.554ArgAsn: 1.554 ± 0.333
1.919ArgPro: 1.919 ± 0.316
2.011ArgGln: 2.011 ± 0.405
2.376ArgArg: 2.376 ± 0.497
2.833ArgSer: 2.833 ± 0.632
1.919ArgThr: 1.919 ± 0.351
2.925ArgVal: 2.925 ± 0.406
0.091ArgTrp: 0.091 ± 0.099
1.371ArgTyr: 1.371 ± 0.323
0.0ArgXaa: 0.0 ± 0.0
Ser
4.387SerAla: 4.387 ± 0.764
0.457SerCys: 0.457 ± 0.23
4.387SerAsp: 4.387 ± 0.589
5.666SerGlu: 5.666 ± 0.822
2.833SerPhe: 2.833 ± 0.429
5.301SerGly: 5.301 ± 0.734
0.366SerHis: 0.366 ± 0.149
2.376SerIle: 2.376 ± 0.524
5.118SerLys: 5.118 ± 0.813
4.478SerLeu: 4.478 ± 0.53
1.554SerMet: 1.554 ± 0.316
2.011SerAsn: 2.011 ± 0.539
1.645SerPro: 1.645 ± 0.404
2.468SerGln: 2.468 ± 0.756
2.193SerArg: 2.193 ± 0.564
2.559SerSer: 2.559 ± 0.442
2.468SerThr: 2.468 ± 0.376
3.838SerVal: 3.838 ± 0.594
0.366SerTrp: 0.366 ± 0.148
2.011SerTyr: 2.011 ± 0.388
0.0SerXaa: 0.0 ± 0.0
Thr
3.838ThrAla: 3.838 ± 0.449
0.457ThrCys: 0.457 ± 0.194
3.656ThrAsp: 3.656 ± 0.588
3.93ThrGlu: 3.93 ± 0.536
2.193ThrPhe: 2.193 ± 0.374
5.94ThrGly: 5.94 ± 0.731
0.731ThrHis: 0.731 ± 0.207
2.742ThrIle: 2.742 ± 0.475
3.381ThrLys: 3.381 ± 0.486
4.204ThrLeu: 4.204 ± 0.578
1.279ThrMet: 1.279 ± 0.234
2.65ThrAsn: 2.65 ± 0.522
2.468ThrPro: 2.468 ± 0.34
1.736ThrGln: 1.736 ± 0.343
1.736ThrArg: 1.736 ± 0.405
2.925ThrSer: 2.925 ± 0.531
2.559ThrThr: 2.559 ± 0.736
3.381ThrVal: 3.381 ± 0.628
0.457ThrTrp: 0.457 ± 0.242
2.011ThrTyr: 2.011 ± 0.634
0.0ThrXaa: 0.0 ± 0.0
Val
4.844ValAla: 4.844 ± 0.684
0.366ValCys: 0.366 ± 0.206
3.199ValAsp: 3.199 ± 0.501
4.113ValGlu: 4.113 ± 0.462
2.193ValPhe: 2.193 ± 0.38
4.387ValGly: 4.387 ± 0.673
0.274ValHis: 0.274 ± 0.173
2.559ValIle: 2.559 ± 0.568
6.58ValLys: 6.58 ± 1.346
4.204ValLeu: 4.204 ± 0.558
2.102ValMet: 2.102 ± 0.363
2.925ValAsn: 2.925 ± 0.677
2.376ValPro: 2.376 ± 0.451
2.376ValGln: 2.376 ± 0.384
2.742ValArg: 2.742 ± 0.637
4.113ValSer: 4.113 ± 0.511
3.564ValThr: 3.564 ± 0.45
4.57ValVal: 4.57 ± 0.618
1.005ValTrp: 1.005 ± 0.238
1.828ValTyr: 1.828 ± 0.475
0.0ValXaa: 0.0 ± 0.0
Trp
0.914TrpAla: 0.914 ± 0.248
0.366TrpCys: 0.366 ± 0.201
0.823TrpAsp: 0.823 ± 0.261
0.731TrpGlu: 0.731 ± 0.254
0.548TrpPhe: 0.548 ± 0.28
0.548TrpGly: 0.548 ± 0.195
0.457TrpHis: 0.457 ± 0.184
0.457TrpIle: 0.457 ± 0.215
0.914TrpLys: 0.914 ± 0.242
1.462TrpLeu: 1.462 ± 0.454
0.274TrpMet: 0.274 ± 0.156
0.64TrpAsn: 0.64 ± 0.25
0.548TrpPro: 0.548 ± 0.282
0.548TrpGln: 0.548 ± 0.194
0.64TrpArg: 0.64 ± 0.278
0.64TrpSer: 0.64 ± 0.307
0.731TrpThr: 0.731 ± 0.244
0.457TrpVal: 0.457 ± 0.166
0.183TrpTrp: 0.183 ± 0.096
0.183TrpTyr: 0.183 ± 0.144
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.193TyrAla: 2.193 ± 0.396
0.548TyrCys: 0.548 ± 0.261
2.833TyrAsp: 2.833 ± 0.372
2.65TyrGlu: 2.65 ± 0.389
1.188TyrPhe: 1.188 ± 0.344
2.102TyrGly: 2.102 ± 0.43
0.64TyrHis: 0.64 ± 0.279
1.736TyrIle: 1.736 ± 0.38
2.102TyrLys: 2.102 ± 0.432
2.285TyrLeu: 2.285 ± 0.465
0.548TyrMet: 0.548 ± 0.268
1.188TyrAsn: 1.188 ± 0.31
1.005TyrPro: 1.005 ± 0.306
2.285TyrGln: 2.285 ± 0.643
1.554TyrArg: 1.554 ± 0.418
3.107TyrSer: 3.107 ± 0.478
1.645TyrThr: 1.645 ± 0.459
1.371TyrVal: 1.371 ± 0.304
0.548TyrTrp: 0.548 ± 0.293
0.914TyrTyr: 0.914 ± 0.279
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 39 proteins (10943 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski