Amino acid dipepetide frequency for Ralstonia phage RSB3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
17.541AlaAla: 17.541 ± 1.704
1.355AlaCys: 1.355 ± 0.305
5.947AlaAsp: 5.947 ± 0.603
6.474AlaGlu: 6.474 ± 0.654
2.71AlaPhe: 2.71 ± 0.426
7.604AlaGly: 7.604 ± 0.868
2.334AlaHis: 2.334 ± 0.442
5.646AlaIle: 5.646 ± 0.675
6.023AlaLys: 6.023 ± 0.695
10.54AlaLeu: 10.54 ± 1.017
3.839AlaMet: 3.839 ± 0.486
5.044AlaAsn: 5.044 ± 0.587
5.496AlaPro: 5.496 ± 1.455
6.474AlaGln: 6.474 ± 0.771
5.571AlaArg: 5.571 ± 0.641
6.098AlaSer: 6.098 ± 0.932
6.7AlaThr: 6.7 ± 0.623
7.227AlaVal: 7.227 ± 0.94
1.43AlaTrp: 1.43 ± 0.282
4.065AlaTyr: 4.065 ± 0.571
0.0AlaXaa: 0.0 ± 0.0
Cys
0.828CysAla: 0.828 ± 0.237
0.151CysCys: 0.151 ± 0.096
0.979CysAsp: 0.979 ± 0.283
0.301CysGlu: 0.301 ± 0.167
0.376CysPhe: 0.376 ± 0.184
0.828CysGly: 0.828 ± 0.299
0.301CysHis: 0.301 ± 0.203
0.376CysIle: 0.376 ± 0.156
0.376CysLys: 0.376 ± 0.172
0.376CysLeu: 0.376 ± 0.158
0.452CysMet: 0.452 ± 0.221
0.301CysAsn: 0.301 ± 0.145
0.452CysPro: 0.452 ± 0.181
0.301CysGln: 0.301 ± 0.141
0.828CysArg: 0.828 ± 0.349
0.452CysSer: 0.452 ± 0.222
0.602CysThr: 0.602 ± 0.22
0.753CysVal: 0.753 ± 0.291
0.151CysTrp: 0.151 ± 0.107
0.452CysTyr: 0.452 ± 0.184
0.0CysXaa: 0.0 ± 0.0
Asp
6.926AspAla: 6.926 ± 0.786
0.828AspCys: 0.828 ± 0.254
3.915AspAsp: 3.915 ± 0.502
3.689AspGlu: 3.689 ± 0.652
2.108AspPhe: 2.108 ± 0.449
4.291AspGly: 4.291 ± 0.642
1.28AspHis: 1.28 ± 0.341
3.313AspIle: 3.313 ± 0.365
2.861AspLys: 2.861 ± 0.407
4.818AspLeu: 4.818 ± 0.651
1.656AspMet: 1.656 ± 0.347
1.732AspAsn: 1.732 ± 0.475
2.861AspPro: 2.861 ± 0.476
1.506AspGln: 1.506 ± 0.281
3.162AspArg: 3.162 ± 0.409
3.538AspSer: 3.538 ± 0.645
3.915AspThr: 3.915 ± 0.432
3.162AspVal: 3.162 ± 0.371
0.753AspTrp: 0.753 ± 0.209
1.882AspTyr: 1.882 ± 0.457
0.0AspXaa: 0.0 ± 0.0
Glu
6.474GluAla: 6.474 ± 0.767
0.753GluCys: 0.753 ± 0.277
3.162GluAsp: 3.162 ± 0.529
3.538GluGlu: 3.538 ± 0.525
2.334GluPhe: 2.334 ± 0.545
4.818GluGly: 4.818 ± 0.578
1.882GluHis: 1.882 ± 0.405
2.259GluIle: 2.259 ± 0.44
3.087GluLys: 3.087 ± 0.485
5.496GluLeu: 5.496 ± 0.639
2.108GluMet: 2.108 ± 0.365
1.732GluAsn: 1.732 ± 0.266
1.807GluPro: 1.807 ± 0.42
3.011GluGln: 3.011 ± 0.594
3.915GluArg: 3.915 ± 0.572
2.484GluSer: 2.484 ± 0.55
2.56GluThr: 2.56 ± 0.34
4.291GluVal: 4.291 ± 0.678
0.753GluTrp: 0.753 ± 0.202
1.957GluTyr: 1.957 ± 0.438
0.0GluXaa: 0.0 ± 0.0
Phe
2.259PheAla: 2.259 ± 0.345
0.301PheCys: 0.301 ± 0.15
2.033PheAsp: 2.033 ± 0.33
1.882PheGlu: 1.882 ± 0.326
1.129PhePhe: 1.129 ± 0.242
2.409PheGly: 2.409 ± 0.383
0.527PheHis: 0.527 ± 0.213
1.205PheIle: 1.205 ± 0.344
1.882PheLys: 1.882 ± 0.467
2.334PheLeu: 2.334 ± 0.267
0.602PheMet: 0.602 ± 0.177
2.108PheAsn: 2.108 ± 0.437
1.882PhePro: 1.882 ± 0.444
1.28PheGln: 1.28 ± 0.325
1.28PheArg: 1.28 ± 0.278
2.635PheSer: 2.635 ± 0.349
2.183PheThr: 2.183 ± 0.399
2.409PheVal: 2.409 ± 0.446
0.376PheTrp: 0.376 ± 0.149
1.28PheTyr: 1.28 ± 0.379
0.0PheXaa: 0.0 ± 0.0
Gly
8.884GlyAla: 8.884 ± 0.816
0.376GlyCys: 0.376 ± 0.155
3.764GlyAsp: 3.764 ± 0.679
4.668GlyGlu: 4.668 ± 0.588
2.635GlyPhe: 2.635 ± 0.54
6.098GlyGly: 6.098 ± 0.706
1.355GlyHis: 1.355 ± 0.293
4.366GlyIle: 4.366 ± 0.567
5.722GlyLys: 5.722 ± 0.715
5.872GlyLeu: 5.872 ± 0.79
2.409GlyMet: 2.409 ± 0.335
3.463GlyAsn: 3.463 ± 0.597
1.732GlyPro: 1.732 ± 0.339
3.162GlyGln: 3.162 ± 0.527
3.237GlyArg: 3.237 ± 0.519
3.839GlySer: 3.839 ± 0.679
5.872GlyThr: 5.872 ± 0.833
5.42GlyVal: 5.42 ± 0.689
1.506GlyTrp: 1.506 ± 0.313
2.033GlyTyr: 2.033 ± 0.47
0.0GlyXaa: 0.0 ± 0.0
His
1.656HisAla: 1.656 ± 0.322
0.075HisCys: 0.075 ± 0.068
1.355HisAsp: 1.355 ± 0.243
1.28HisGlu: 1.28 ± 0.307
0.753HisPhe: 0.753 ± 0.255
1.882HisGly: 1.882 ± 0.499
0.678HisHis: 0.678 ± 0.184
0.903HisIle: 0.903 ± 0.255
1.355HisLys: 1.355 ± 0.324
1.581HisLeu: 1.581 ± 0.34
0.602HisMet: 0.602 ± 0.228
1.355HisAsn: 1.355 ± 0.357
0.903HisPro: 0.903 ± 0.246
1.054HisGln: 1.054 ± 0.263
1.355HisArg: 1.355 ± 0.363
0.527HisSer: 0.527 ± 0.227
1.205HisThr: 1.205 ± 0.301
1.28HisVal: 1.28 ± 0.344
0.151HisTrp: 0.151 ± 0.104
1.054HisTyr: 1.054 ± 0.317
0.0HisXaa: 0.0 ± 0.0
Ile
5.044IleAla: 5.044 ± 0.539
0.376IleCys: 0.376 ± 0.19
3.087IleAsp: 3.087 ± 0.445
3.388IleGlu: 3.388 ± 0.45
0.828IlePhe: 0.828 ± 0.226
3.162IleGly: 3.162 ± 0.524
1.506IleHis: 1.506 ± 0.317
1.355IleIle: 1.355 ± 0.287
3.538IleLys: 3.538 ± 0.668
2.484IleLeu: 2.484 ± 0.568
0.828IleMet: 0.828 ± 0.218
2.033IleAsn: 2.033 ± 0.325
1.43IlePro: 1.43 ± 0.325
1.732IleGln: 1.732 ± 0.294
2.484IleArg: 2.484 ± 0.435
3.011IleSer: 3.011 ± 0.323
2.108IleThr: 2.108 ± 0.484
3.087IleVal: 3.087 ± 0.52
0.527IleTrp: 0.527 ± 0.241
1.054IleTyr: 1.054 ± 0.254
0.0IleXaa: 0.0 ± 0.0
Lys
6.098LysAla: 6.098 ± 0.763
0.376LysCys: 0.376 ± 0.145
2.861LysAsp: 2.861 ± 0.397
2.861LysGlu: 2.861 ± 0.64
1.355LysPhe: 1.355 ± 0.268
4.216LysGly: 4.216 ± 0.633
0.828LysHis: 0.828 ± 0.225
1.732LysIle: 1.732 ± 0.331
2.334LysLys: 2.334 ± 0.581
5.646LysLeu: 5.646 ± 0.688
1.129LysMet: 1.129 ± 0.375
2.334LysAsn: 2.334 ± 0.368
2.334LysPro: 2.334 ± 0.394
3.463LysGln: 3.463 ± 0.707
3.087LysArg: 3.087 ± 0.528
2.259LysSer: 2.259 ± 0.44
2.936LysThr: 2.936 ± 0.448
3.915LysVal: 3.915 ± 0.533
0.602LysTrp: 0.602 ± 0.184
1.506LysTyr: 1.506 ± 0.239
0.0LysXaa: 0.0 ± 0.0
Leu
10.389LeuAla: 10.389 ± 1.165
0.602LeuCys: 0.602 ± 0.217
5.119LeuAsp: 5.119 ± 0.697
4.291LeuGlu: 4.291 ± 0.512
2.71LeuPhe: 2.71 ± 0.582
5.42LeuGly: 5.42 ± 0.647
2.033LeuHis: 2.033 ± 0.404
3.614LeuIle: 3.614 ± 0.528
3.237LeuLys: 3.237 ± 0.368
6.625LeuLeu: 6.625 ± 0.759
3.011LeuMet: 3.011 ± 0.477
3.689LeuAsn: 3.689 ± 0.557
3.313LeuPro: 3.313 ± 0.52
2.484LeuGln: 2.484 ± 0.459
5.722LeuArg: 5.722 ± 0.819
5.119LeuSer: 5.119 ± 0.599
5.646LeuThr: 5.646 ± 0.608
6.098LeuVal: 6.098 ± 0.819
1.129LeuTrp: 1.129 ± 0.299
1.957LeuTyr: 1.957 ± 0.448
0.0LeuXaa: 0.0 ± 0.0
Met
2.635MetAla: 2.635 ± 0.385
0.301MetCys: 0.301 ± 0.17
1.656MetAsp: 1.656 ± 0.368
1.43MetGlu: 1.43 ± 0.305
0.979MetPhe: 0.979 ± 0.296
2.033MetGly: 2.033 ± 0.468
0.903MetHis: 0.903 ± 0.211
0.903MetIle: 0.903 ± 0.314
1.581MetLys: 1.581 ± 0.355
2.71MetLeu: 2.71 ± 0.428
0.903MetMet: 0.903 ± 0.251
0.903MetAsn: 0.903 ± 0.307
2.259MetPro: 2.259 ± 0.337
1.205MetGln: 1.205 ± 0.307
2.183MetArg: 2.183 ± 0.435
2.108MetSer: 2.108 ± 0.387
1.581MetThr: 1.581 ± 0.305
1.732MetVal: 1.732 ± 0.425
0.301MetTrp: 0.301 ± 0.102
1.129MetTyr: 1.129 ± 0.302
0.0MetXaa: 0.0 ± 0.0
Asn
4.893AsnAla: 4.893 ± 0.668
0.602AsnCys: 0.602 ± 0.231
2.334AsnAsp: 2.334 ± 0.392
2.259AsnGlu: 2.259 ± 0.583
1.129AsnPhe: 1.129 ± 0.262
3.614AsnGly: 3.614 ± 0.455
0.602AsnHis: 0.602 ± 0.195
1.957AsnIle: 1.957 ± 0.289
1.656AsnLys: 1.656 ± 0.303
3.011AsnLeu: 3.011 ± 0.495
1.129AsnMet: 1.129 ± 0.304
1.656AsnAsn: 1.656 ± 0.49
2.484AsnPro: 2.484 ± 0.341
1.581AsnGln: 1.581 ± 0.316
2.334AsnArg: 2.334 ± 0.397
2.484AsnSer: 2.484 ± 0.614
3.162AsnThr: 3.162 ± 0.466
3.237AsnVal: 3.237 ± 0.562
0.452AsnTrp: 0.452 ± 0.202
1.129AsnTyr: 1.129 ± 0.273
0.0AsnXaa: 0.0 ± 0.0
Pro
5.646ProAla: 5.646 ± 1.241
0.226ProCys: 0.226 ± 0.116
2.409ProAsp: 2.409 ± 0.481
3.463ProGlu: 3.463 ± 0.6
1.581ProPhe: 1.581 ± 0.305
2.786ProGly: 2.786 ± 0.42
0.226ProHis: 0.226 ± 0.134
1.28ProIle: 1.28 ± 0.216
2.259ProLys: 2.259 ± 0.463
3.011ProLeu: 3.011 ± 0.444
0.903ProMet: 0.903 ± 0.182
1.506ProAsn: 1.506 ± 0.317
1.581ProPro: 1.581 ± 0.445
1.656ProGln: 1.656 ± 0.317
2.183ProArg: 2.183 ± 0.376
3.011ProSer: 3.011 ± 0.466
2.71ProThr: 2.71 ± 0.382
3.162ProVal: 3.162 ± 0.588
0.376ProTrp: 0.376 ± 0.173
2.108ProTyr: 2.108 ± 0.392
0.0ProXaa: 0.0 ± 0.0
Gln
6.625GlnAla: 6.625 ± 1.101
0.301GlnCys: 0.301 ± 0.2
2.183GlnAsp: 2.183 ± 0.401
3.011GlnGlu: 3.011 ± 0.47
1.506GlnPhe: 1.506 ± 0.27
3.463GlnGly: 3.463 ± 0.429
0.979GlnHis: 0.979 ± 0.232
1.205GlnIle: 1.205 ± 0.222
2.183GlnLys: 2.183 ± 0.268
3.388GlnLeu: 3.388 ± 0.488
1.205GlnMet: 1.205 ± 0.201
1.506GlnAsn: 1.506 ± 0.328
1.205GlnPro: 1.205 ± 0.365
3.162GlnGln: 3.162 ± 0.592
3.011GlnArg: 3.011 ± 0.57
3.162GlnSer: 3.162 ± 0.678
2.183GlnThr: 2.183 ± 0.419
2.861GlnVal: 2.861 ± 0.432
0.753GlnTrp: 0.753 ± 0.263
1.732GlnTyr: 1.732 ± 0.391
0.0GlnXaa: 0.0 ± 0.0
Arg
5.947ArgAla: 5.947 ± 0.717
0.602ArgCys: 0.602 ± 0.233
3.237ArgAsp: 3.237 ± 0.618
3.614ArgGlu: 3.614 ± 0.534
2.033ArgPhe: 2.033 ± 0.368
4.065ArgGly: 4.065 ± 0.683
1.28ArgHis: 1.28 ± 0.342
3.764ArgIle: 3.764 ± 0.617
3.915ArgLys: 3.915 ± 0.74
4.969ArgLeu: 4.969 ± 0.721
1.957ArgMet: 1.957 ± 0.35
2.635ArgAsn: 2.635 ± 0.443
1.656ArgPro: 1.656 ± 0.328
2.334ArgGln: 2.334 ± 0.444
3.839ArgArg: 3.839 ± 0.676
2.861ArgSer: 2.861 ± 0.36
2.71ArgThr: 2.71 ± 0.361
2.635ArgVal: 2.635 ± 0.374
0.527ArgTrp: 0.527 ± 0.205
1.807ArgTyr: 1.807 ± 0.377
0.0ArgXaa: 0.0 ± 0.0
Ser
5.571SerAla: 5.571 ± 0.749
0.602SerCys: 0.602 ± 0.207
3.237SerAsp: 3.237 ± 0.499
2.334SerGlu: 2.334 ± 0.351
2.183SerPhe: 2.183 ± 0.383
5.119SerGly: 5.119 ± 0.594
0.753SerHis: 0.753 ± 0.221
2.786SerIle: 2.786 ± 0.441
2.71SerLys: 2.71 ± 0.439
4.065SerLeu: 4.065 ± 0.54
1.732SerMet: 1.732 ± 0.406
3.087SerAsn: 3.087 ± 0.573
2.334SerPro: 2.334 ± 0.534
3.162SerGln: 3.162 ± 0.485
2.409SerArg: 2.409 ± 0.451
3.388SerSer: 3.388 ± 0.856
4.442SerThr: 4.442 ± 0.707
3.614SerVal: 3.614 ± 0.632
1.28SerTrp: 1.28 ± 0.281
2.56SerTyr: 2.56 ± 0.481
0.0SerXaa: 0.0 ± 0.0
Thr
7.528ThrAla: 7.528 ± 0.659
0.527ThrCys: 0.527 ± 0.191
3.162ThrAsp: 3.162 ± 0.498
2.71ThrGlu: 2.71 ± 0.397
2.334ThrPhe: 2.334 ± 0.395
5.195ThrGly: 5.195 ± 0.518
1.054ThrHis: 1.054 ± 0.265
2.259ThrIle: 2.259 ± 0.426
2.484ThrLys: 2.484 ± 0.58
5.345ThrLeu: 5.345 ± 0.56
1.656ThrMet: 1.656 ± 0.434
2.334ThrAsn: 2.334 ± 0.5
3.011ThrPro: 3.011 ± 0.62
1.807ThrGln: 1.807 ± 0.493
3.162ThrArg: 3.162 ± 0.447
3.915ThrSer: 3.915 ± 0.743
3.463ThrThr: 3.463 ± 0.537
5.42ThrVal: 5.42 ± 0.782
1.43ThrTrp: 1.43 ± 0.425
1.882ThrTyr: 1.882 ± 0.388
0.0ThrXaa: 0.0 ± 0.0
Val
7.83ValAla: 7.83 ± 1.011
0.979ValCys: 0.979 ± 0.239
4.592ValAsp: 4.592 ± 0.569
3.689ValGlu: 3.689 ± 0.56
1.506ValPhe: 1.506 ± 0.413
5.345ValGly: 5.345 ± 0.758
1.807ValHis: 1.807 ± 0.354
2.71ValIle: 2.71 ± 0.409
3.087ValLys: 3.087 ± 0.515
4.893ValLeu: 4.893 ± 0.557
1.656ValMet: 1.656 ± 0.408
2.409ValAsn: 2.409 ± 0.437
3.237ValPro: 3.237 ± 0.34
3.689ValGln: 3.689 ± 0.616
4.065ValArg: 4.065 ± 0.544
4.065ValSer: 4.065 ± 0.634
4.291ValThr: 4.291 ± 0.761
4.893ValVal: 4.893 ± 0.606
1.355ValTrp: 1.355 ± 0.326
1.957ValTyr: 1.957 ± 0.329
0.0ValXaa: 0.0 ± 0.0
Trp
1.807TrpAla: 1.807 ± 0.465
0.075TrpCys: 0.075 ± 0.068
0.828TrpAsp: 0.828 ± 0.318
1.054TrpGlu: 1.054 ± 0.3
0.527TrpPhe: 0.527 ± 0.199
1.129TrpGly: 1.129 ± 0.356
0.226TrpHis: 0.226 ± 0.12
0.301TrpIle: 0.301 ± 0.152
0.678TrpLys: 0.678 ± 0.24
1.355TrpLeu: 1.355 ± 0.45
0.753TrpMet: 0.753 ± 0.25
0.678TrpAsn: 0.678 ± 0.189
0.452TrpPro: 0.452 ± 0.212
0.753TrpGln: 0.753 ± 0.191
0.678TrpArg: 0.678 ± 0.235
0.828TrpSer: 0.828 ± 0.23
0.979TrpThr: 0.979 ± 0.252
0.753TrpVal: 0.753 ± 0.219
0.602TrpTrp: 0.602 ± 0.187
0.376TrpTyr: 0.376 ± 0.171
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.538TyrAla: 3.538 ± 0.527
0.226TyrCys: 0.226 ± 0.131
2.334TyrAsp: 2.334 ± 0.427
2.484TyrGlu: 2.484 ± 0.523
1.28TyrPhe: 1.28 ± 0.339
2.936TyrGly: 2.936 ± 0.488
0.452TyrHis: 0.452 ± 0.183
1.129TyrIle: 1.129 ± 0.315
0.903TyrLys: 0.903 ± 0.199
3.689TyrLeu: 3.689 ± 0.478
0.828TyrMet: 0.828 ± 0.25
1.205TyrAsn: 1.205 ± 0.235
1.732TyrPro: 1.732 ± 0.451
1.807TyrGln: 1.807 ± 0.351
1.882TyrArg: 1.882 ± 0.457
1.581TyrSer: 1.581 ± 0.371
1.43TyrThr: 1.43 ± 0.339
2.108TyrVal: 2.108 ± 0.489
0.376TyrTrp: 0.376 ± 0.162
0.979TyrTyr: 0.979 ± 0.229
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 50 proteins (13284 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski