Amino acid dipepetide frequency for Cyanobacteria bacterium PMG_004

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.369AlaAla: 3.369 ± 0.453
0.632AlaCys: 0.632 ± 0.186
2.316AlaAsp: 2.316 ± 0.278
2.913AlaGlu: 2.913 ± 0.31
3.053AlaPhe: 3.053 ± 0.342
3.299AlaGly: 3.299 ± 0.453
0.983AlaHis: 0.983 ± 0.187
2.667AlaIle: 2.667 ± 0.317
3.088AlaLys: 3.088 ± 0.307
6.387AlaLeu: 6.387 ± 0.585
1.263AlaMet: 1.263 ± 0.238
2.878AlaAsn: 2.878 ± 0.334
2.457AlaPro: 2.457 ± 0.293
2.527AlaGln: 2.527 ± 0.348
2.913AlaArg: 2.913 ± 0.361
4.457AlaSer: 4.457 ± 0.363
2.632AlaThr: 2.632 ± 0.383
3.755AlaVal: 3.755 ± 0.49
0.632AlaTrp: 0.632 ± 0.169
1.895AlaTyr: 1.895 ± 0.245
0.0AlaXaa: 0.0 ± 0.0
Cys
0.597CysAla: 0.597 ± 0.136
0.281CysCys: 0.281 ± 0.114
0.491CysAsp: 0.491 ± 0.148
0.807CysGlu: 0.807 ± 0.157
1.369CysPhe: 1.369 ± 0.217
0.597CysGly: 0.597 ± 0.151
0.175CysHis: 0.175 ± 0.073
0.772CysIle: 0.772 ± 0.18
0.421CysLys: 0.421 ± 0.125
1.509CysLeu: 1.509 ± 0.194
0.281CysMet: 0.281 ± 0.101
0.632CysAsn: 0.632 ± 0.157
0.456CysPro: 0.456 ± 0.124
1.193CysGln: 1.193 ± 0.249
0.386CysArg: 0.386 ± 0.105
0.702CysSer: 0.702 ± 0.147
0.351CysThr: 0.351 ± 0.119
0.772CysVal: 0.772 ± 0.175
0.105CysTrp: 0.105 ± 0.062
0.737CysTyr: 0.737 ± 0.176
0.0CysXaa: 0.0 ± 0.0
Asp
1.825AspAla: 1.825 ± 0.271
0.351AspCys: 0.351 ± 0.095
1.299AspAsp: 1.299 ± 0.275
1.474AspGlu: 1.474 ± 0.257
2.422AspPhe: 2.422 ± 0.294
1.72AspGly: 1.72 ± 0.249
0.667AspHis: 0.667 ± 0.159
3.65AspIle: 3.65 ± 0.432
3.264AspLys: 3.264 ± 0.405
5.58AspLeu: 5.58 ± 0.423
0.667AspMet: 0.667 ± 0.168
1.649AspAsn: 1.649 ± 0.237
1.86AspPro: 1.86 ± 0.327
1.86AspGln: 1.86 ± 0.25
2.141AspArg: 2.141 ± 0.299
2.176AspSer: 2.176 ± 0.259
2.071AspThr: 2.071 ± 0.305
4.843AspVal: 4.843 ± 0.633
0.316AspTrp: 0.316 ± 0.107
1.72AspTyr: 1.72 ± 0.285
0.0AspXaa: 0.0 ± 0.0
Glu
3.615GluAla: 3.615 ± 0.371
0.456GluCys: 0.456 ± 0.15
2.0GluAsp: 2.0 ± 0.306
3.123GluGlu: 3.123 ± 0.397
1.755GluPhe: 1.755 ± 0.269
3.58GluGly: 3.58 ± 0.331
0.386GluHis: 0.386 ± 0.114
4.141GluIle: 4.141 ± 0.313
5.194GluLys: 5.194 ± 0.542
4.352GluLeu: 4.352 ± 0.384
2.211GluMet: 2.211 ± 0.28
3.159GluAsn: 3.159 ± 0.311
1.685GluPro: 1.685 ± 0.266
1.93GluGln: 1.93 ± 0.236
2.316GluArg: 2.316 ± 0.269
3.474GluSer: 3.474 ± 0.453
3.299GluThr: 3.299 ± 0.302
2.141GluVal: 2.141 ± 0.258
0.632GluTrp: 0.632 ± 0.164
1.755GluTyr: 1.755 ± 0.206
0.0GluXaa: 0.0 ± 0.0
Phe
3.825PheAla: 3.825 ± 0.391
0.877PheCys: 0.877 ± 0.189
2.632PheAsp: 2.632 ± 0.304
2.562PheGlu: 2.562 ± 0.341
5.124PhePhe: 5.124 ± 0.518
3.896PheGly: 3.896 ± 0.44
0.948PheHis: 0.948 ± 0.176
3.299PheIle: 3.299 ± 0.468
2.667PheLys: 2.667 ± 0.276
8.704PheLeu: 8.704 ± 0.757
1.439PheMet: 1.439 ± 0.227
2.878PheAsn: 2.878 ± 0.409
2.036PhePro: 2.036 ± 0.293
3.088PheGln: 3.088 ± 0.374
2.211PheArg: 2.211 ± 0.307
4.878PheSer: 4.878 ± 0.735
3.229PheThr: 3.229 ± 0.408
5.264PheVal: 5.264 ± 0.435
0.562PheTrp: 0.562 ± 0.144
2.597PheTyr: 2.597 ± 0.321
0.0PheXaa: 0.0 ± 0.0
Gly
3.334GlyAla: 3.334 ± 0.377
0.912GlyCys: 0.912 ± 0.188
3.474GlyAsp: 3.474 ± 0.448
2.597GlyGlu: 2.597 ± 0.331
3.966GlyPhe: 3.966 ± 0.377
3.65GlyGly: 3.65 ± 0.509
0.912GlyHis: 0.912 ± 0.199
5.124GlyIle: 5.124 ± 0.485
3.966GlyLys: 3.966 ± 0.395
6.668GlyLeu: 6.668 ± 0.679
1.72GlyMet: 1.72 ± 0.252
2.281GlyAsn: 2.281 ± 0.282
1.79GlyPro: 1.79 ± 0.352
2.492GlyGln: 2.492 ± 0.311
2.773GlyArg: 2.773 ± 0.442
3.053GlySer: 3.053 ± 0.3
2.527GlyThr: 2.527 ± 0.282
4.738GlyVal: 4.738 ± 0.396
1.088GlyTrp: 1.088 ± 0.198
1.579GlyTyr: 1.579 ± 0.25
0.0GlyXaa: 0.0 ± 0.0
His
0.737HisAla: 0.737 ± 0.177
0.211HisCys: 0.211 ± 0.087
0.597HisAsp: 0.597 ± 0.154
0.526HisGlu: 0.526 ± 0.141
1.299HisPhe: 1.299 ± 0.245
1.158HisGly: 1.158 ± 0.215
0.246HisHis: 0.246 ± 0.082
1.053HisIle: 1.053 ± 0.214
0.737HisLys: 0.737 ± 0.209
1.895HisLeu: 1.895 ± 0.322
0.386HisMet: 0.386 ± 0.112
0.702HisAsn: 0.702 ± 0.151
0.948HisPro: 0.948 ± 0.166
0.421HisGln: 0.421 ± 0.093
0.842HisArg: 0.842 ± 0.177
0.983HisSer: 0.983 ± 0.196
0.948HisThr: 0.948 ± 0.185
0.842HisVal: 0.842 ± 0.229
0.175HisTrp: 0.175 ± 0.085
0.526HisTyr: 0.526 ± 0.155
0.0HisXaa: 0.0 ± 0.0
Ile
4.211IleAla: 4.211 ± 0.363
1.685IleCys: 1.685 ± 0.307
3.053IleAsp: 3.053 ± 0.338
4.036IleGlu: 4.036 ± 0.437
4.562IlePhe: 4.562 ± 0.56
3.65IleGly: 3.65 ± 0.39
1.123IleHis: 1.123 ± 0.217
3.825IleIle: 3.825 ± 0.489
4.984IleLys: 4.984 ± 0.475
7.405IleLeu: 7.405 ± 0.678
1.193IleMet: 1.193 ± 0.223
3.123IleAsn: 3.123 ± 0.357
3.439IlePro: 3.439 ± 0.306
2.281IleGln: 2.281 ± 0.271
3.018IleArg: 3.018 ± 0.324
6.984IleSer: 6.984 ± 0.597
4.703IleThr: 4.703 ± 0.376
3.369IleVal: 3.369 ± 0.36
0.667IleTrp: 0.667 ± 0.203
2.351IleTyr: 2.351 ± 0.283
0.0IleXaa: 0.0 ± 0.0
Lys
4.036LysAla: 4.036 ± 0.395
0.632LysCys: 0.632 ± 0.153
3.51LysAsp: 3.51 ± 0.414
5.966LysGlu: 5.966 ± 0.987
2.457LysPhe: 2.457 ± 0.255
4.913LysGly: 4.913 ± 0.422
1.053LysHis: 1.053 ± 0.174
6.142LysIle: 6.142 ± 0.49
8.668LysLys: 8.668 ± 1.011
5.791LysLeu: 5.791 ± 0.551
1.86LysMet: 1.86 ± 0.262
6.247LysAsn: 6.247 ± 0.535
2.667LysPro: 2.667 ± 0.399
4.247LysGln: 4.247 ± 0.484
3.053LysArg: 3.053 ± 0.339
5.756LysSer: 5.756 ± 0.515
5.089LysThr: 5.089 ± 0.582
4.703LysVal: 4.703 ± 0.409
0.491LysTrp: 0.491 ± 0.12
1.825LysTyr: 1.825 ± 0.235
0.0LysXaa: 0.0 ± 0.0
Leu
5.931LeuAla: 5.931 ± 0.524
1.158LeuCys: 1.158 ± 0.202
4.247LeuAsp: 4.247 ± 0.37
6.493LeuGlu: 6.493 ± 0.499
6.387LeuPhe: 6.387 ± 0.736
6.949LeuGly: 6.949 ± 0.648
1.614LeuHis: 1.614 ± 0.253
7.23LeuIle: 7.23 ± 0.685
6.493LeuLys: 6.493 ± 0.486
13.652LeuLeu: 13.652 ± 0.875
2.211LeuMet: 2.211 ± 0.354
4.948LeuAsn: 4.948 ± 0.403
5.089LeuPro: 5.089 ± 0.588
3.86LeuGln: 3.86 ± 0.358
4.562LeuArg: 4.562 ± 0.39
8.142LeuSer: 8.142 ± 0.529
9.792LeuThr: 9.792 ± 1.0
6.633LeuVal: 6.633 ± 0.521
1.193LeuTrp: 1.193 ± 0.208
2.983LeuTyr: 2.983 ± 0.354
0.0LeuXaa: 0.0 ± 0.0
Met
1.755MetAla: 1.755 ± 0.31
0.105MetCys: 0.105 ± 0.061
0.526MetAsp: 0.526 ± 0.132
0.983MetGlu: 0.983 ± 0.242
1.018MetPhe: 1.018 ± 0.207
1.299MetGly: 1.299 ± 0.212
0.421MetHis: 0.421 ± 0.153
1.579MetIle: 1.579 ± 0.282
1.299MetLys: 1.299 ± 0.2
3.194MetLeu: 3.194 ± 0.349
0.491MetMet: 0.491 ± 0.121
0.702MetAsn: 0.702 ± 0.165
1.018MetPro: 1.018 ± 0.215
0.351MetGln: 0.351 ± 0.114
0.983MetArg: 0.983 ± 0.208
2.632MetSer: 2.632 ± 0.279
1.579MetThr: 1.579 ± 0.188
0.983MetVal: 0.983 ± 0.176
0.14MetTrp: 0.14 ± 0.076
0.526MetTyr: 0.526 ± 0.141
0.0MetXaa: 0.0 ± 0.0
Asn
1.649AsnAla: 1.649 ± 0.27
0.526AsnCys: 0.526 ± 0.154
1.755AsnAsp: 1.755 ± 0.226
2.316AsnGlu: 2.316 ± 0.342
2.913AsnPhe: 2.913 ± 0.283
1.895AsnGly: 1.895 ± 0.289
0.632AsnHis: 0.632 ± 0.141
5.685AsnIle: 5.685 ± 0.553
5.159AsnLys: 5.159 ± 0.513
5.966AsnLeu: 5.966 ± 0.593
0.772AsnMet: 0.772 ± 0.16
2.667AsnAsn: 2.667 ± 0.318
2.808AsnPro: 2.808 ± 0.3
2.702AsnGln: 2.702 ± 0.387
2.457AsnArg: 2.457 ± 0.351
3.369AsnSer: 3.369 ± 0.402
2.632AsnThr: 2.632 ± 0.269
3.931AsnVal: 3.931 ± 0.389
0.421AsnTrp: 0.421 ± 0.092
2.106AsnTyr: 2.106 ± 0.327
0.0AsnXaa: 0.0 ± 0.0
Pro
1.685ProAla: 1.685 ± 0.292
0.281ProCys: 0.281 ± 0.093
1.79ProAsp: 1.79 ± 0.291
2.351ProGlu: 2.351 ± 0.246
2.948ProPhe: 2.948 ± 0.307
2.246ProGly: 2.246 ± 0.287
0.948ProHis: 0.948 ± 0.147
2.913ProIle: 2.913 ± 0.386
4.422ProLys: 4.422 ± 0.421
5.124ProLeu: 5.124 ± 0.431
0.562ProMet: 0.562 ± 0.153
2.316ProAsn: 2.316 ± 0.316
1.474ProPro: 1.474 ± 0.227
1.369ProGln: 1.369 ± 0.257
1.544ProArg: 1.544 ± 0.243
2.948ProSer: 2.948 ± 0.29
2.457ProThr: 2.457 ± 0.285
2.492ProVal: 2.492 ± 0.358
0.632ProTrp: 0.632 ± 0.176
1.123ProTyr: 1.123 ± 0.199
0.0ProXaa: 0.0 ± 0.0
Gln
2.667GlnAla: 2.667 ± 0.321
0.281GlnCys: 0.281 ± 0.098
1.755GlnAsp: 1.755 ± 0.21
2.773GlnGlu: 2.773 ± 0.362
1.579GlnPhe: 1.579 ± 0.237
2.141GlnGly: 2.141 ± 0.292
0.456GlnHis: 0.456 ± 0.105
3.229GlnIle: 3.229 ± 0.431
4.176GlnLys: 4.176 ± 0.506
3.194GlnLeu: 3.194 ± 0.374
0.948GlnMet: 0.948 ± 0.195
2.246GlnAsn: 2.246 ± 0.362
1.369GlnPro: 1.369 ± 0.241
1.579GlnGln: 1.579 ± 0.303
2.176GlnArg: 2.176 ± 0.293
3.088GlnSer: 3.088 ± 0.313
2.948GlnThr: 2.948 ± 0.346
2.702GlnVal: 2.702 ± 0.366
0.281GlnTrp: 0.281 ± 0.092
1.123GlnTyr: 1.123 ± 0.216
0.0GlnXaa: 0.0 ± 0.0
Arg
2.597ArgAla: 2.597 ± 0.316
0.632ArgCys: 0.632 ± 0.15
2.702ArgAsp: 2.702 ± 0.29
2.281ArgGlu: 2.281 ± 0.434
3.439ArgPhe: 3.439 ± 0.367
2.667ArgGly: 2.667 ± 0.415
0.667ArgHis: 0.667 ± 0.157
2.808ArgIle: 2.808 ± 0.414
3.474ArgLys: 3.474 ± 0.351
3.896ArgLeu: 3.896 ± 0.345
0.772ArgMet: 0.772 ± 0.153
2.036ArgAsn: 2.036 ± 0.342
1.544ArgPro: 1.544 ± 0.289
1.895ArgGln: 1.895 ± 0.292
3.088ArgArg: 3.088 ± 0.378
3.86ArgSer: 3.86 ± 0.36
1.93ArgThr: 1.93 ± 0.225
3.018ArgVal: 3.018 ± 0.386
0.456ArgTrp: 0.456 ± 0.128
1.334ArgTyr: 1.334 ± 0.253
0.0ArgXaa: 0.0 ± 0.0
Ser
3.545SerAla: 3.545 ± 0.333
1.088SerCys: 1.088 ± 0.168
3.229SerAsp: 3.229 ± 0.572
3.229SerGlu: 3.229 ± 0.334
5.931SerPhe: 5.931 ± 0.553
4.352SerGly: 4.352 ± 0.318
1.334SerHis: 1.334 ± 0.264
5.405SerIle: 5.405 ± 0.419
6.212SerLys: 6.212 ± 0.63
8.984SerLeu: 8.984 ± 0.472
1.263SerMet: 1.263 ± 0.229
5.089SerAsn: 5.089 ± 0.48
3.79SerPro: 3.79 ± 0.412
3.755SerGln: 3.755 ± 0.348
2.843SerArg: 2.843 ± 0.307
5.58SerSer: 5.58 ± 0.566
3.545SerThr: 3.545 ± 0.318
5.58SerVal: 5.58 ± 0.461
0.912SerTrp: 0.912 ± 0.158
2.211SerTyr: 2.211 ± 0.298
0.0SerXaa: 0.0 ± 0.0
Thr
3.088ThrAla: 3.088 ± 0.349
0.562ThrCys: 0.562 ± 0.135
1.86ThrAsp: 1.86 ± 0.278
2.246ThrGlu: 2.246 ± 0.251
4.984ThrPhe: 4.984 ± 0.442
3.474ThrGly: 3.474 ± 0.339
0.807ThrHis: 0.807 ± 0.166
3.72ThrIle: 3.72 ± 0.387
5.405ThrLys: 5.405 ± 0.733
5.475ThrLeu: 5.475 ± 0.4
1.018ThrMet: 1.018 ± 0.203
4.317ThrAsn: 4.317 ± 0.538
2.948ThrPro: 2.948 ± 0.301
1.825ThrGln: 1.825 ± 0.31
2.0ThrArg: 2.0 ± 0.234
7.44ThrSer: 7.44 ± 0.875
3.404ThrThr: 3.404 ± 0.381
2.913ThrVal: 2.913 ± 0.372
0.386ThrTrp: 0.386 ± 0.114
1.299ThrTyr: 1.299 ± 0.23
0.0ThrXaa: 0.0 ± 0.0
Val
3.404ValAla: 3.404 ± 0.378
1.439ValCys: 1.439 ± 0.272
2.351ValAsp: 2.351 ± 0.379
3.018ValGlu: 3.018 ± 0.351
4.913ValPhe: 4.913 ± 0.438
4.633ValGly: 4.633 ± 0.546
0.877ValHis: 0.877 ± 0.18
3.65ValIle: 3.65 ± 0.381
6.282ValLys: 6.282 ± 0.541
6.808ValLeu: 6.808 ± 0.493
1.614ValMet: 1.614 ± 0.182
2.492ValAsn: 2.492 ± 0.366
2.492ValPro: 2.492 ± 0.302
1.334ValGln: 1.334 ± 0.236
4.176ValArg: 4.176 ± 0.405
5.299ValSer: 5.299 ± 0.524
3.825ValThr: 3.825 ± 0.326
3.58ValVal: 3.58 ± 0.442
0.807ValTrp: 0.807 ± 0.181
2.071ValTyr: 2.071 ± 0.279
0.0ValXaa: 0.0 ± 0.0
Trp
0.702TrpAla: 0.702 ± 0.187
0.105TrpCys: 0.105 ± 0.061
0.316TrpAsp: 0.316 ± 0.128
0.456TrpGlu: 0.456 ± 0.167
0.772TrpPhe: 0.772 ± 0.166
0.632TrpGly: 0.632 ± 0.164
0.281TrpHis: 0.281 ± 0.109
1.088TrpIle: 1.088 ± 0.218
0.421TrpLys: 0.421 ± 0.116
1.649TrpLeu: 1.649 ± 0.283
0.211TrpMet: 0.211 ± 0.092
0.386TrpAsn: 0.386 ± 0.089
0.316TrpPro: 0.316 ± 0.114
0.456TrpGln: 0.456 ± 0.125
0.281TrpArg: 0.281 ± 0.106
0.737TrpSer: 0.737 ± 0.165
0.456TrpThr: 0.456 ± 0.134
0.877TrpVal: 0.877 ± 0.15
0.14TrpTrp: 0.14 ± 0.061
0.281TrpTyr: 0.281 ± 0.094
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.544TyrAla: 1.544 ± 0.259
0.421TyrCys: 0.421 ± 0.136
1.649TyrAsp: 1.649 ± 0.261
1.053TyrGlu: 1.053 ± 0.218
1.825TyrPhe: 1.825 ± 0.241
1.895TyrGly: 1.895 ± 0.309
0.702TyrHis: 0.702 ± 0.166
1.685TyrIle: 1.685 ± 0.262
2.808TyrLys: 2.808 ± 0.41
3.334TyrLeu: 3.334 ± 0.323
0.667TyrMet: 0.667 ± 0.16
1.614TyrAsn: 1.614 ± 0.269
1.439TyrPro: 1.439 ± 0.22
1.544TyrGln: 1.544 ± 0.201
1.334TyrArg: 1.334 ± 0.231
2.316TyrSer: 2.316 ± 0.3
1.755TyrThr: 1.755 ± 0.241
1.93TyrVal: 1.93 ± 0.293
0.526TyrTrp: 0.526 ± 0.179
1.474TyrTyr: 1.474 ± 0.274
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 142 proteins (28495 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski