Amino acid dipepetide frequency for Ralstonia phage RS-PII-1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
17.904AlaAla: 17.904 ± 2.183
1.246AlaCys: 1.246 ± 0.327
7.707AlaAsp: 7.707 ± 0.708
8.563AlaGlu: 8.563 ± 0.912
3.503AlaPhe: 3.503 ± 0.471
10.276AlaGly: 10.276 ± 1.064
1.868AlaHis: 1.868 ± 0.373
5.06AlaIle: 5.06 ± 0.719
5.371AlaLys: 5.371 ± 1.15
12.144AlaLeu: 12.144 ± 1.304
4.826AlaMet: 4.826 ± 0.833
3.192AlaAsn: 3.192 ± 0.481
4.048AlaPro: 4.048 ± 0.92
6.85AlaGln: 6.85 ± 0.871
7.551AlaArg: 7.551 ± 0.839
6.928AlaSer: 6.928 ± 1.034
6.383AlaThr: 6.383 ± 1.199
6.305AlaVal: 6.305 ± 0.657
1.868AlaTrp: 1.868 ± 0.381
3.036AlaTyr: 3.036 ± 0.36
0.0AlaXaa: 0.0 ± 0.0
Cys
1.012CysAla: 1.012 ± 0.344
0.078CysCys: 0.078 ± 0.076
0.545CysAsp: 0.545 ± 0.215
0.467CysGlu: 0.467 ± 0.164
0.389CysPhe: 0.389 ± 0.157
0.701CysGly: 0.701 ± 0.263
0.234CysHis: 0.234 ± 0.154
0.389CysIle: 0.389 ± 0.166
0.234CysLys: 0.234 ± 0.184
1.09CysLeu: 1.09 ± 0.336
0.156CysMet: 0.156 ± 0.113
0.389CysAsn: 0.389 ± 0.176
0.156CysPro: 0.156 ± 0.114
0.311CysGln: 0.311 ± 0.196
0.311CysArg: 0.311 ± 0.138
0.234CysSer: 0.234 ± 0.124
0.311CysThr: 0.311 ± 0.127
1.168CysVal: 1.168 ± 0.354
0.078CysTrp: 0.078 ± 0.085
0.078CysTyr: 0.078 ± 0.085
0.0CysXaa: 0.0 ± 0.0
Asp
7.395AspAla: 7.395 ± 0.915
0.545AspCys: 0.545 ± 0.237
4.359AspAsp: 4.359 ± 0.647
3.27AspGlu: 3.27 ± 0.409
2.569AspPhe: 2.569 ± 0.388
5.994AspGly: 5.994 ± 0.847
1.012AspHis: 1.012 ± 0.288
3.192AspIle: 3.192 ± 0.578
2.569AspLys: 2.569 ± 0.654
4.281AspLeu: 4.281 ± 0.565
2.024AspMet: 2.024 ± 0.31
1.323AspAsn: 1.323 ± 0.371
4.437AspPro: 4.437 ± 0.538
1.323AspGln: 1.323 ± 0.334
2.802AspArg: 2.802 ± 0.478
3.892AspSer: 3.892 ± 0.571
3.425AspThr: 3.425 ± 0.488
3.581AspVal: 3.581 ± 0.712
1.557AspTrp: 1.557 ± 0.39
2.102AspTyr: 2.102 ± 0.397
0.0AspXaa: 0.0 ± 0.0
Glu
9.186GluAla: 9.186 ± 1.008
0.545GluCys: 0.545 ± 0.179
3.659GluAsp: 3.659 ± 0.594
3.036GluGlu: 3.036 ± 0.466
1.557GluPhe: 1.557 ± 0.331
2.958GluGly: 2.958 ± 0.456
1.946GluHis: 1.946 ± 0.422
1.946GluIle: 1.946 ± 0.482
2.88GluLys: 2.88 ± 0.489
4.982GluLeu: 4.982 ± 0.775
0.856GluMet: 0.856 ± 0.221
2.413GluAsn: 2.413 ± 0.505
2.491GluPro: 2.491 ± 0.48
4.126GluGln: 4.126 ± 0.568
4.126GluArg: 4.126 ± 0.521
2.647GluSer: 2.647 ± 0.458
3.036GluThr: 3.036 ± 0.433
4.048GluVal: 4.048 ± 0.532
1.09GluTrp: 1.09 ± 0.344
1.635GluTyr: 1.635 ± 0.444
0.0GluXaa: 0.0 ± 0.0
Phe
3.27PheAla: 3.27 ± 0.438
0.156PheCys: 0.156 ± 0.14
2.569PheAsp: 2.569 ± 0.332
1.946PheGlu: 1.946 ± 0.587
1.168PhePhe: 1.168 ± 0.407
2.88PheGly: 2.88 ± 0.516
1.09PheHis: 1.09 ± 0.457
1.479PheIle: 1.479 ± 0.32
1.635PheLys: 1.635 ± 0.251
2.413PheLeu: 2.413 ± 0.476
0.778PheMet: 0.778 ± 0.243
2.024PheAsn: 2.024 ± 0.473
1.479PhePro: 1.479 ± 0.311
1.401PheGln: 1.401 ± 0.298
2.024PheArg: 2.024 ± 0.412
1.479PheSer: 1.479 ± 0.465
1.557PheThr: 1.557 ± 0.429
2.102PheVal: 2.102 ± 0.576
0.467PheTrp: 0.467 ± 0.173
1.012PheTyr: 1.012 ± 0.294
0.0PheXaa: 0.0 ± 0.0
Gly
8.719GlyAla: 8.719 ± 1.03
0.467GlyCys: 0.467 ± 0.212
5.06GlyAsp: 5.06 ± 0.716
3.581GlyGlu: 3.581 ± 0.554
3.036GlyPhe: 3.036 ± 0.353
7.395GlyGly: 7.395 ± 0.961
1.635GlyHis: 1.635 ± 0.433
4.281GlyIle: 4.281 ± 0.62
4.126GlyLys: 4.126 ± 0.516
6.305GlyLeu: 6.305 ± 0.685
2.18GlyMet: 2.18 ± 0.639
3.036GlyAsn: 3.036 ± 0.57
2.258GlyPro: 2.258 ± 0.629
3.581GlyGln: 3.581 ± 0.572
4.204GlyArg: 4.204 ± 0.685
5.138GlySer: 5.138 ± 0.895
5.06GlyThr: 5.06 ± 0.721
5.449GlyVal: 5.449 ± 0.631
1.479GlyTrp: 1.479 ± 0.32
2.725GlyTyr: 2.725 ± 0.513
0.0GlyXaa: 0.0 ± 0.0
His
2.102HisAla: 2.102 ± 0.42
0.156HisCys: 0.156 ± 0.138
1.479HisAsp: 1.479 ± 0.343
1.401HisGlu: 1.401 ± 0.338
0.701HisPhe: 0.701 ± 0.224
1.946HisGly: 1.946 ± 0.466
0.623HisHis: 0.623 ± 0.232
1.09HisIle: 1.09 ± 0.331
0.778HisLys: 0.778 ± 0.382
2.024HisLeu: 2.024 ± 0.437
0.623HisMet: 0.623 ± 0.266
0.778HisAsn: 0.778 ± 0.205
0.856HisPro: 0.856 ± 0.27
0.545HisGln: 0.545 ± 0.268
1.09HisArg: 1.09 ± 0.339
0.934HisSer: 0.934 ± 0.499
1.246HisThr: 1.246 ± 0.25
1.09HisVal: 1.09 ± 0.472
0.078HisTrp: 0.078 ± 0.078
0.778HisTyr: 0.778 ± 0.274
0.0HisXaa: 0.0 ± 0.0
Ile
4.515IleAla: 4.515 ± 0.43
0.467IleCys: 0.467 ± 0.254
2.491IleAsp: 2.491 ± 0.506
3.581IleGlu: 3.581 ± 0.571
0.934IlePhe: 0.934 ± 0.271
3.503IleGly: 3.503 ± 0.401
1.401IleHis: 1.401 ± 0.212
2.024IleIle: 2.024 ± 0.292
2.258IleLys: 2.258 ± 0.454
2.88IleLeu: 2.88 ± 0.47
0.778IleMet: 0.778 ± 0.284
1.713IleAsn: 1.713 ± 0.441
2.102IlePro: 2.102 ± 0.332
1.479IleGln: 1.479 ± 0.407
3.347IleArg: 3.347 ± 0.556
1.557IleSer: 1.557 ± 0.281
2.88IleThr: 2.88 ± 0.494
3.114IleVal: 3.114 ± 0.514
0.311IleTrp: 0.311 ± 0.175
0.701IleTyr: 0.701 ± 0.23
0.0IleXaa: 0.0 ± 0.0
Lys
7.006LysAla: 7.006 ± 1.055
0.156LysCys: 0.156 ± 0.102
3.27LysAsp: 3.27 ± 0.648
2.88LysGlu: 2.88 ± 0.448
1.713LysPhe: 1.713 ± 0.26
3.581LysGly: 3.581 ± 0.61
0.701LysHis: 0.701 ± 0.25
1.246LysIle: 1.246 ± 0.316
3.036LysLys: 3.036 ± 0.561
4.982LysLeu: 4.982 ± 0.625
0.545LysMet: 0.545 ± 0.21
2.024LysAsn: 2.024 ± 0.441
2.18LysPro: 2.18 ± 0.543
1.946LysGln: 1.946 ± 0.381
3.192LysArg: 3.192 ± 0.529
1.946LysSer: 1.946 ± 0.463
2.258LysThr: 2.258 ± 0.51
3.737LysVal: 3.737 ± 0.522
1.012LysTrp: 1.012 ± 0.274
1.323LysTyr: 1.323 ± 0.306
0.0LysXaa: 0.0 ± 0.0
Leu
11.677LeuAla: 11.677 ± 0.998
0.856LeuCys: 0.856 ± 0.282
5.683LeuAsp: 5.683 ± 0.716
4.982LeuGlu: 4.982 ± 0.777
1.79LeuPhe: 1.79 ± 0.414
5.605LeuGly: 5.605 ± 0.793
1.635LeuHis: 1.635 ± 0.346
2.802LeuIle: 2.802 ± 0.445
5.293LeuLys: 5.293 ± 0.705
6.15LeuLeu: 6.15 ± 1.033
1.713LeuMet: 1.713 ± 0.29
3.737LeuAsn: 3.737 ± 0.46
3.814LeuPro: 3.814 ± 0.517
3.581LeuGln: 3.581 ± 0.644
5.06LeuArg: 5.06 ± 0.63
4.749LeuSer: 4.749 ± 0.53
6.305LeuThr: 6.305 ± 0.68
4.904LeuVal: 4.904 ± 0.646
1.012LeuTrp: 1.012 ± 0.265
2.258LeuTyr: 2.258 ± 0.366
0.0LeuXaa: 0.0 ± 0.0
Met
2.958MetAla: 2.958 ± 0.424
0.078MetCys: 0.078 ± 0.085
1.635MetAsp: 1.635 ± 0.493
1.246MetGlu: 1.246 ± 0.452
0.778MetPhe: 0.778 ± 0.222
1.868MetGly: 1.868 ± 0.451
0.389MetHis: 0.389 ± 0.197
0.311MetIle: 0.311 ± 0.16
1.09MetLys: 1.09 ± 0.268
2.725MetLeu: 2.725 ± 0.457
0.467MetMet: 0.467 ± 0.16
1.323MetAsn: 1.323 ± 0.525
1.012MetPro: 1.012 ± 0.281
0.934MetGln: 0.934 ± 0.251
2.258MetArg: 2.258 ± 0.367
2.18MetSer: 2.18 ± 0.446
2.18MetThr: 2.18 ± 0.442
1.246MetVal: 1.246 ± 0.36
0.545MetTrp: 0.545 ± 0.217
0.701MetTyr: 0.701 ± 0.258
0.0MetXaa: 0.0 ± 0.0
Asn
4.982AsnAla: 4.982 ± 0.734
0.389AsnCys: 0.389 ± 0.193
1.946AsnAsp: 1.946 ± 0.355
2.102AsnGlu: 2.102 ± 0.341
1.557AsnPhe: 1.557 ± 0.343
3.114AsnGly: 3.114 ± 0.652
0.545AsnHis: 0.545 ± 0.239
1.868AsnIle: 1.868 ± 0.374
1.479AsnLys: 1.479 ± 0.29
3.036AsnLeu: 3.036 ± 0.378
1.012AsnMet: 1.012 ± 0.285
1.401AsnAsn: 1.401 ± 0.388
2.647AsnPro: 2.647 ± 0.406
1.168AsnGln: 1.168 ± 0.279
2.024AsnArg: 2.024 ± 0.373
1.946AsnSer: 1.946 ± 0.351
2.647AsnThr: 2.647 ± 0.599
2.725AsnVal: 2.725 ± 0.589
0.701AsnTrp: 0.701 ± 0.202
1.635AsnTyr: 1.635 ± 0.389
0.0AsnXaa: 0.0 ± 0.0
Pro
6.305ProAla: 6.305 ± 0.983
0.156ProCys: 0.156 ± 0.104
3.814ProAsp: 3.814 ± 0.572
3.347ProGlu: 3.347 ± 0.645
1.635ProPhe: 1.635 ± 0.433
3.814ProGly: 3.814 ± 0.619
0.623ProHis: 0.623 ± 0.265
1.323ProIle: 1.323 ± 0.385
2.413ProLys: 2.413 ± 0.603
3.192ProLeu: 3.192 ± 0.7
1.012ProMet: 1.012 ± 0.356
1.479ProAsn: 1.479 ± 0.335
1.479ProPro: 1.479 ± 0.479
1.479ProGln: 1.479 ± 0.374
1.557ProArg: 1.557 ± 0.368
2.335ProSer: 2.335 ± 0.42
2.958ProThr: 2.958 ± 0.489
3.737ProVal: 3.737 ± 0.649
0.467ProTrp: 0.467 ± 0.171
1.323ProTyr: 1.323 ± 0.474
0.0ProXaa: 0.0 ± 0.0
Gln
5.994GlnAla: 5.994 ± 0.691
0.623GlnCys: 0.623 ± 0.21
1.635GlnAsp: 1.635 ± 0.327
1.479GlnGlu: 1.479 ± 0.265
1.168GlnPhe: 1.168 ± 0.386
3.425GlnGly: 3.425 ± 0.507
1.246GlnHis: 1.246 ± 0.308
2.18GlnIle: 2.18 ± 0.384
1.946GlnLys: 1.946 ± 0.365
3.27GlnLeu: 3.27 ± 0.66
1.557GlnMet: 1.557 ± 0.306
0.934GlnAsn: 0.934 ± 0.251
1.401GlnPro: 1.401 ± 0.313
2.725GlnGln: 2.725 ± 0.674
3.581GlnArg: 3.581 ± 0.549
2.569GlnSer: 2.569 ± 0.546
2.413GlnThr: 2.413 ± 0.469
3.036GlnVal: 3.036 ± 0.677
0.701GlnTrp: 0.701 ± 0.252
1.946GlnTyr: 1.946 ± 0.454
0.0GlnXaa: 0.0 ± 0.0
Arg
6.617ArgAla: 6.617 ± 0.788
0.389ArgCys: 0.389 ± 0.184
3.659ArgAsp: 3.659 ± 0.714
3.97ArgGlu: 3.97 ± 0.731
2.958ArgPhe: 2.958 ± 0.455
4.126ArgGly: 4.126 ± 0.572
1.012ArgHis: 1.012 ± 0.321
2.491ArgIle: 2.491 ± 0.557
2.958ArgLys: 2.958 ± 0.446
5.449ArgLeu: 5.449 ± 0.572
2.491ArgMet: 2.491 ± 0.64
2.647ArgAsn: 2.647 ± 0.541
2.18ArgPro: 2.18 ± 0.482
2.102ArgGln: 2.102 ± 0.449
4.671ArgArg: 4.671 ± 0.9
2.802ArgSer: 2.802 ± 0.573
3.581ArgThr: 3.581 ± 0.514
4.126ArgVal: 4.126 ± 0.662
1.09ArgTrp: 1.09 ± 0.273
2.258ArgTyr: 2.258 ± 0.508
0.0ArgXaa: 0.0 ± 0.0
Ser
5.916SerAla: 5.916 ± 0.901
0.467SerCys: 0.467 ± 0.199
2.569SerAsp: 2.569 ± 0.606
2.958SerGlu: 2.958 ± 0.337
1.246SerPhe: 1.246 ± 0.315
5.138SerGly: 5.138 ± 0.744
1.09SerHis: 1.09 ± 0.422
2.647SerIle: 2.647 ± 0.478
2.647SerLys: 2.647 ± 0.486
4.126SerLeu: 4.126 ± 0.489
1.012SerMet: 1.012 ± 0.261
2.491SerAsn: 2.491 ± 0.438
3.347SerPro: 3.347 ± 0.306
2.18SerGln: 2.18 ± 0.369
3.425SerArg: 3.425 ± 0.501
2.725SerSer: 2.725 ± 0.532
3.503SerThr: 3.503 ± 0.589
3.659SerVal: 3.659 ± 0.79
0.856SerTrp: 0.856 ± 0.247
2.024SerTyr: 2.024 ± 0.684
0.0SerXaa: 0.0 ± 0.0
Thr
7.24ThrAla: 7.24 ± 0.888
0.311ThrCys: 0.311 ± 0.16
2.491ThrAsp: 2.491 ± 0.383
3.503ThrGlu: 3.503 ± 0.431
2.102ThrPhe: 2.102 ± 0.366
5.293ThrGly: 5.293 ± 0.69
1.246ThrHis: 1.246 ± 0.289
2.958ThrIle: 2.958 ± 0.608
3.114ThrLys: 3.114 ± 0.619
5.138ThrLeu: 5.138 ± 0.619
1.246ThrMet: 1.246 ± 0.417
2.102ThrAsn: 2.102 ± 0.604
3.114ThrPro: 3.114 ± 0.467
2.18ThrGln: 2.18 ± 0.269
3.27ThrArg: 3.27 ± 0.37
3.892ThrSer: 3.892 ± 0.789
3.814ThrThr: 3.814 ± 0.453
3.581ThrVal: 3.581 ± 0.616
0.856ThrTrp: 0.856 ± 0.304
1.246ThrTyr: 1.246 ± 0.34
0.0ThrXaa: 0.0 ± 0.0
Val
7.006ValAla: 7.006 ± 0.785
0.778ValCys: 0.778 ± 0.294
4.126ValAsp: 4.126 ± 0.579
3.581ValGlu: 3.581 ± 0.582
2.024ValPhe: 2.024 ± 0.484
4.826ValGly: 4.826 ± 0.825
1.168ValHis: 1.168 ± 0.309
3.425ValIle: 3.425 ± 0.486
3.114ValLys: 3.114 ± 0.586
4.515ValLeu: 4.515 ± 0.791
1.557ValMet: 1.557 ± 0.417
3.892ValAsn: 3.892 ± 0.691
3.503ValPro: 3.503 ± 0.514
3.425ValGln: 3.425 ± 0.695
4.671ValArg: 4.671 ± 0.47
3.503ValSer: 3.503 ± 0.977
3.27ValThr: 3.27 ± 0.518
5.06ValVal: 5.06 ± 0.906
0.934ValTrp: 0.934 ± 0.201
1.868ValTyr: 1.868 ± 0.322
0.0ValXaa: 0.0 ± 0.0
Trp
2.102TrpAla: 2.102 ± 0.394
0.078TrpCys: 0.078 ± 0.071
0.934TrpAsp: 0.934 ± 0.301
1.323TrpGlu: 1.323 ± 0.339
0.778TrpPhe: 0.778 ± 0.263
0.856TrpGly: 0.856 ± 0.289
0.311TrpHis: 0.311 ± 0.139
0.467TrpIle: 0.467 ± 0.168
0.545TrpLys: 0.545 ± 0.222
1.946TrpLeu: 1.946 ± 0.574
0.311TrpMet: 0.311 ± 0.148
0.623TrpAsn: 0.623 ± 0.236
0.467TrpPro: 0.467 ± 0.247
1.168TrpGln: 1.168 ± 0.283
0.623TrpArg: 0.623 ± 0.245
0.623TrpSer: 0.623 ± 0.226
0.623TrpThr: 0.623 ± 0.221
1.09TrpVal: 1.09 ± 0.425
0.234TrpTrp: 0.234 ± 0.152
0.545TrpTyr: 0.545 ± 0.194
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.802TyrAla: 2.802 ± 0.46
0.389TyrCys: 0.389 ± 0.179
2.024TyrAsp: 2.024 ± 0.499
1.946TyrGlu: 1.946 ± 0.376
1.323TyrPhe: 1.323 ± 0.464
2.413TyrGly: 2.413 ± 0.444
0.545TyrHis: 0.545 ± 0.162
0.856TyrIle: 0.856 ± 0.311
1.246TyrLys: 1.246 ± 0.337
2.958TyrLeu: 2.958 ± 0.412
0.701TyrMet: 0.701 ± 0.265
1.401TyrAsn: 1.401 ± 0.291
1.479TyrPro: 1.479 ± 0.451
1.323TyrGln: 1.323 ± 0.338
1.79TyrArg: 1.79 ± 0.339
1.946TyrSer: 1.946 ± 0.491
1.246TyrThr: 1.246 ± 0.363
2.491TyrVal: 2.491 ± 0.482
0.311TyrTrp: 0.311 ± 0.223
0.389TyrTyr: 0.389 ± 0.172
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 46 proteins (12847 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski