Amino acid dipepetide frequency for Ralstonia phage RS603

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
18.102AlaAla: 18.102 ± 3.239
1.766AlaCys: 1.766 ± 0.76
5.298AlaAsp: 5.298 ± 1.476
7.064AlaGlu: 7.064 ± 3.136
4.415AlaPhe: 4.415 ± 2.025
9.713AlaGly: 9.713 ± 1.917
2.208AlaHis: 2.208 ± 0.752
6.181AlaIle: 6.181 ± 1.804
6.623AlaLys: 6.623 ± 3.848
14.128AlaLeu: 14.128 ± 2.522
4.857AlaMet: 4.857 ± 1.298
2.649AlaAsn: 2.649 ± 1.236
3.974AlaPro: 3.974 ± 0.96
7.064AlaGln: 7.064 ± 1.319
7.947AlaArg: 7.947 ± 2.42
5.74AlaSer: 5.74 ± 1.544
7.064AlaThr: 7.064 ± 2.424
10.155AlaVal: 10.155 ± 2.18
4.415AlaTrp: 4.415 ± 1.146
2.649AlaTyr: 2.649 ± 0.91
0.0AlaXaa: 0.0 ± 0.0
Cys
1.325CysAla: 1.325 ± 0.634
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.883CysGlu: 0.883 ± 0.758
0.442CysPhe: 0.442 ± 0.372
0.442CysGly: 0.442 ± 0.372
0.883CysHis: 0.883 ± 0.436
1.325CysIle: 1.325 ± 0.595
0.442CysLys: 0.442 ± 0.372
0.883CysLeu: 0.883 ± 0.632
0.442CysMet: 0.442 ± 0.525
0.0CysAsn: 0.0 ± 0.0
0.442CysPro: 0.442 ± 0.354
0.442CysGln: 0.442 ± 0.388
0.442CysArg: 0.442 ± 0.379
1.325CysSer: 1.325 ± 0.827
0.442CysThr: 0.442 ± 0.354
1.325CysVal: 1.325 ± 0.621
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
5.74AspAla: 5.74 ± 1.194
0.0AspCys: 0.0 ± 0.0
1.766AspAsp: 1.766 ± 0.55
3.532AspGlu: 3.532 ± 1.056
1.766AspPhe: 1.766 ± 0.904
5.298AspGly: 5.298 ± 1.294
0.883AspHis: 0.883 ± 0.777
2.649AspIle: 2.649 ± 1.029
1.325AspLys: 1.325 ± 0.704
4.415AspLeu: 4.415 ± 1.13
1.325AspMet: 1.325 ± 0.541
1.325AspAsn: 1.325 ± 0.484
1.325AspPro: 1.325 ± 0.621
2.649AspGln: 2.649 ± 0.968
3.532AspArg: 3.532 ± 1.329
2.649AspSer: 2.649 ± 1.284
2.649AspThr: 2.649 ± 1.051
3.532AspVal: 3.532 ± 1.023
0.883AspTrp: 0.883 ± 0.615
1.325AspTyr: 1.325 ± 0.821
0.0AspXaa: 0.0 ± 0.0
Glu
8.83GluAla: 8.83 ± 3.889
1.325GluCys: 1.325 ± 0.541
1.766GluAsp: 1.766 ± 0.789
3.091GluGlu: 3.091 ± 1.324
4.857GluPhe: 4.857 ± 1.7
2.649GluGly: 2.649 ± 1.141
0.883GluHis: 0.883 ± 0.491
1.766GluIle: 1.766 ± 0.848
1.766GluLys: 1.766 ± 0.835
5.298GluLeu: 5.298 ± 2.098
0.442GluMet: 0.442 ± 0.538
2.649GluAsn: 2.649 ± 1.191
1.325GluPro: 1.325 ± 0.64
2.208GluGln: 2.208 ± 0.969
3.091GluArg: 3.091 ± 1.162
1.325GluSer: 1.325 ± 0.682
1.766GluThr: 1.766 ± 0.982
2.649GluVal: 2.649 ± 1.153
0.883GluTrp: 0.883 ± 0.384
1.325GluTyr: 1.325 ± 0.894
0.0GluXaa: 0.0 ± 0.0
Phe
3.091PheAla: 3.091 ± 1.262
0.0PheCys: 0.0 ± 0.0
1.766PheAsp: 1.766 ± 0.791
1.325PheGlu: 1.325 ± 0.497
3.974PhePhe: 3.974 ± 2.325
3.532PheGly: 3.532 ± 1.348
0.442PheHis: 0.442 ± 0.372
1.325PheIle: 1.325 ± 1.339
0.442PheLys: 0.442 ± 0.505
2.208PheLeu: 2.208 ± 0.809
0.883PheMet: 0.883 ± 0.6
2.649PheAsn: 2.649 ± 1.203
0.442PhePro: 0.442 ± 0.354
0.883PheGln: 0.883 ± 0.837
2.649PheArg: 2.649 ± 0.897
2.208PheSer: 2.208 ± 0.725
1.766PheThr: 1.766 ± 1.37
2.649PheVal: 2.649 ± 0.702
0.883PheTrp: 0.883 ± 0.491
0.883PheTyr: 0.883 ± 0.708
0.0PheXaa: 0.0 ± 0.0
Gly
8.83GlyAla: 8.83 ± 1.557
0.883GlyCys: 0.883 ± 0.708
4.415GlyAsp: 4.415 ± 1.382
1.766GlyGlu: 1.766 ± 1.097
3.532GlyPhe: 3.532 ± 1.739
10.596GlyGly: 10.596 ± 1.869
1.325GlyHis: 1.325 ± 0.598
3.091GlyIle: 3.091 ± 1.661
2.649GlyLys: 2.649 ± 1.473
5.74GlyLeu: 5.74 ± 2.746
3.091GlyMet: 3.091 ± 1.021
4.415GlyAsn: 4.415 ± 2.986
2.208GlyPro: 2.208 ± 0.619
3.091GlyGln: 3.091 ± 0.668
6.623GlyArg: 6.623 ± 1.591
3.974GlySer: 3.974 ± 1.766
4.415GlyThr: 4.415 ± 1.428
9.713GlyVal: 9.713 ± 2.451
1.766GlyTrp: 1.766 ± 0.76
2.649GlyTyr: 2.649 ± 0.802
0.0GlyXaa: 0.0 ± 0.0
His
0.883HisAla: 0.883 ± 0.466
0.0HisCys: 0.0 ± 0.0
2.208HisAsp: 2.208 ± 1.017
1.766HisGlu: 1.766 ± 0.971
0.883HisPhe: 0.883 ± 0.538
1.766HisGly: 1.766 ± 0.786
0.0HisHis: 0.0 ± 0.0
0.442HisIle: 0.442 ± 0.372
0.442HisLys: 0.442 ± 0.388
0.442HisLeu: 0.442 ± 0.43
0.883HisMet: 0.883 ± 0.743
0.883HisAsn: 0.883 ± 0.623
0.883HisPro: 0.883 ± 0.384
0.442HisGln: 0.442 ± 0.388
2.208HisArg: 2.208 ± 1.246
0.442HisSer: 0.442 ± 0.372
0.0HisThr: 0.0 ± 0.0
1.325HisVal: 1.325 ± 0.804
0.0HisTrp: 0.0 ± 0.0
0.883HisTyr: 0.883 ± 0.436
0.0HisXaa: 0.0 ± 0.0
Ile
8.389IleAla: 8.389 ± 2.261
0.0IleCys: 0.0 ± 0.0
1.766IleAsp: 1.766 ± 0.807
2.649IleGlu: 2.649 ± 1.608
0.442IlePhe: 0.442 ± 0.498
3.974IleGly: 3.974 ± 1.536
0.0IleHis: 0.0 ± 0.0
0.442IleIle: 0.442 ± 0.505
1.766IleLys: 1.766 ± 0.694
2.649IleLeu: 2.649 ± 0.943
0.442IleMet: 0.442 ± 0.352
2.208IleAsn: 2.208 ± 0.976
1.325IlePro: 1.325 ± 0.861
2.208IleGln: 2.208 ± 1.02
1.766IleArg: 1.766 ± 0.757
1.766IleSer: 1.766 ± 0.945
1.325IleThr: 1.325 ± 0.534
2.649IleVal: 2.649 ± 1.132
0.442IleTrp: 0.442 ± 0.43
0.442IleTyr: 0.442 ± 0.538
0.0IleXaa: 0.0 ± 0.0
Lys
4.857LysAla: 4.857 ± 1.566
0.0LysCys: 0.0 ± 0.0
2.208LysAsp: 2.208 ± 0.863
2.208LysGlu: 2.208 ± 1.125
0.883LysPhe: 0.883 ± 0.384
3.091LysGly: 3.091 ± 1.901
0.0LysHis: 0.0 ± 0.0
1.766LysIle: 1.766 ± 1.219
2.208LysLys: 2.208 ± 1.111
3.532LysLeu: 3.532 ± 1.732
0.442LysMet: 0.442 ± 0.379
0.442LysAsn: 0.442 ± 0.538
1.325LysPro: 1.325 ± 1.115
2.208LysGln: 2.208 ± 0.748
3.091LysArg: 3.091 ± 1.066
2.208LysSer: 2.208 ± 0.75
3.532LysThr: 3.532 ± 1.066
4.857LysVal: 4.857 ± 1.601
1.325LysTrp: 1.325 ± 0.872
1.766LysTyr: 1.766 ± 0.828
0.0LysXaa: 0.0 ± 0.0
Leu
8.83LeuAla: 8.83 ± 1.491
0.883LeuCys: 0.883 ± 0.678
6.181LeuAsp: 6.181 ± 1.356
3.974LeuGlu: 3.974 ± 2.235
3.091LeuPhe: 3.091 ± 1.749
4.415LeuGly: 4.415 ± 2.017
3.091LeuHis: 3.091 ± 1.374
3.091LeuIle: 3.091 ± 1.002
3.974LeuLys: 3.974 ± 1.488
7.506LeuLeu: 7.506 ± 3.081
2.208LeuMet: 2.208 ± 0.672
2.208LeuAsn: 2.208 ± 0.809
4.857LeuPro: 4.857 ± 1.23
3.974LeuGln: 3.974 ± 1.625
5.74LeuArg: 5.74 ± 1.938
3.974LeuSer: 3.974 ± 1.459
4.857LeuThr: 4.857 ± 1.373
7.064LeuVal: 7.064 ± 1.918
3.091LeuTrp: 3.091 ± 1.061
0.442LeuTyr: 0.442 ± 0.354
0.0LeuXaa: 0.0 ± 0.0
Met
4.415MetAla: 4.415 ± 1.436
0.883MetCys: 0.883 ± 0.466
0.883MetAsp: 0.883 ± 0.756
0.883MetGlu: 0.883 ± 0.638
0.442MetPhe: 0.442 ± 0.498
0.883MetGly: 0.883 ± 0.749
0.883MetHis: 0.883 ± 0.634
1.325MetIle: 1.325 ± 0.703
1.325MetLys: 1.325 ± 0.92
2.649MetLeu: 2.649 ± 0.963
0.883MetMet: 0.883 ± 0.615
0.442MetAsn: 0.442 ± 0.388
0.883MetPro: 0.883 ± 0.384
0.883MetGln: 0.883 ± 0.742
2.649MetArg: 2.649 ± 1.343
2.208MetSer: 2.208 ± 1.199
3.532MetThr: 3.532 ± 1.499
1.325MetVal: 1.325 ± 0.704
0.0MetTrp: 0.0 ± 0.0
0.442MetTyr: 0.442 ± 0.354
0.0MetXaa: 0.0 ± 0.0
Asn
3.974AsnAla: 3.974 ± 1.25
0.442AsnCys: 0.442 ± 0.372
1.766AsnAsp: 1.766 ± 0.583
1.325AsnGlu: 1.325 ± 0.799
0.883AsnPhe: 0.883 ± 0.384
4.857AsnGly: 4.857 ± 1.974
0.0AsnHis: 0.0 ± 0.0
1.325AsnIle: 1.325 ± 0.779
1.766AsnLys: 1.766 ± 0.886
2.649AsnLeu: 2.649 ± 1.434
0.0AsnMet: 0.0 ± 0.0
0.442AsnAsn: 0.442 ± 0.388
3.532AsnPro: 3.532 ± 1.567
0.883AsnGln: 0.883 ± 0.384
1.325AsnArg: 1.325 ± 1.165
1.766AsnSer: 1.766 ± 0.764
1.766AsnThr: 1.766 ± 0.649
4.415AsnVal: 4.415 ± 0.892
0.442AsnTrp: 0.442 ± 0.354
1.325AsnTyr: 1.325 ± 0.634
0.0AsnXaa: 0.0 ± 0.0
Pro
5.74ProAla: 5.74 ± 1.32
0.883ProCys: 0.883 ± 0.615
1.325ProAsp: 1.325 ± 0.629
2.649ProGlu: 2.649 ± 1.496
0.0ProPhe: 0.0 ± 0.0
2.649ProGly: 2.649 ± 0.662
0.442ProHis: 0.442 ± 0.388
0.883ProIle: 0.883 ± 0.384
1.325ProLys: 1.325 ± 0.629
1.325ProLeu: 1.325 ± 0.833
1.766ProMet: 1.766 ± 0.761
1.325ProAsn: 1.325 ± 0.828
1.766ProPro: 1.766 ± 0.647
2.208ProGln: 2.208 ± 1.001
1.766ProArg: 1.766 ± 0.436
4.415ProSer: 4.415 ± 1.626
1.766ProThr: 1.766 ± 0.869
4.857ProVal: 4.857 ± 2.568
0.442ProTrp: 0.442 ± 0.354
0.883ProTyr: 0.883 ± 0.53
0.0ProXaa: 0.0 ± 0.0
Gln
5.298GlnAla: 5.298 ± 1.855
0.883GlnCys: 0.883 ± 0.695
2.649GlnAsp: 2.649 ± 1.02
0.883GlnGlu: 0.883 ± 0.758
0.883GlnPhe: 0.883 ± 0.758
3.091GlnGly: 3.091 ± 1.003
0.0GlnHis: 0.0 ± 0.0
2.649GlnIle: 2.649 ± 1.192
3.974GlnLys: 3.974 ± 1.31
3.091GlnLeu: 3.091 ± 1.503
0.883GlnMet: 0.883 ± 0.591
1.766GlnAsn: 1.766 ± 0.6
1.766GlnPro: 1.766 ± 0.743
2.649GlnGln: 2.649 ± 1.838
3.091GlnArg: 3.091 ± 1.101
3.532GlnSer: 3.532 ± 0.935
2.208GlnThr: 2.208 ± 1.031
1.325GlnVal: 1.325 ± 0.518
1.766GlnTrp: 1.766 ± 1.228
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
9.272ArgAla: 9.272 ± 2.633
1.325ArgCys: 1.325 ± 0.729
4.415ArgAsp: 4.415 ± 1.59
4.857ArgGlu: 4.857 ± 1.982
1.325ArgPhe: 1.325 ± 0.908
4.857ArgGly: 4.857 ± 1.769
1.766ArgHis: 1.766 ± 0.862
1.325ArgIle: 1.325 ± 0.523
2.649ArgLys: 2.649 ± 0.891
7.064ArgLeu: 7.064 ± 1.653
1.325ArgMet: 1.325 ± 0.742
0.442ArgAsn: 0.442 ± 0.372
2.208ArgPro: 2.208 ± 1.155
2.208ArgGln: 2.208 ± 1.126
3.974ArgArg: 3.974 ± 1.936
2.649ArgSer: 2.649 ± 0.721
2.649ArgThr: 2.649 ± 0.9
5.298ArgVal: 5.298 ± 1.311
1.766ArgTrp: 1.766 ± 1.112
1.766ArgTyr: 1.766 ± 0.763
0.0ArgXaa: 0.0 ± 0.0
Ser
8.83SerAla: 8.83 ± 1.823
0.883SerCys: 0.883 ± 0.614
2.649SerAsp: 2.649 ± 1.089
2.208SerGlu: 2.208 ± 1.032
1.325SerPhe: 1.325 ± 0.827
7.947SerGly: 7.947 ± 2.042
0.883SerHis: 0.883 ± 0.777
0.883SerIle: 0.883 ± 0.581
1.766SerLys: 1.766 ± 0.91
4.857SerLeu: 4.857 ± 2.237
3.532SerMet: 3.532 ± 1.033
3.091SerAsn: 3.091 ± 0.932
1.325SerPro: 1.325 ± 0.8
1.325SerGln: 1.325 ± 0.484
3.532SerArg: 3.532 ± 1.566
5.298SerSer: 5.298 ± 1.545
3.532SerThr: 3.532 ± 2.415
4.857SerVal: 4.857 ± 1.629
0.883SerTrp: 0.883 ± 0.537
1.325SerTyr: 1.325 ± 0.729
0.0SerXaa: 0.0 ± 0.0
Thr
5.298ThrAla: 5.298 ± 1.163
0.442ThrCys: 0.442 ± 0.379
2.208ThrAsp: 2.208 ± 1.189
4.415ThrGlu: 4.415 ± 0.856
1.325ThrPhe: 1.325 ± 0.779
6.181ThrGly: 6.181 ± 2.183
0.442ThrHis: 0.442 ± 0.372
1.325ThrIle: 1.325 ± 0.595
2.649ThrLys: 2.649 ± 1.005
4.857ThrLeu: 4.857 ± 1.69
1.766ThrMet: 1.766 ± 0.503
2.208ThrAsn: 2.208 ± 1.01
3.091ThrPro: 3.091 ± 1.254
2.208ThrGln: 2.208 ± 0.906
1.766ThrArg: 1.766 ± 0.743
3.532ThrSer: 3.532 ± 1.67
11.479ThrThr: 11.479 ± 6.019
4.857ThrVal: 4.857 ± 1.831
0.442ThrTrp: 0.442 ± 0.388
1.766ThrTyr: 1.766 ± 0.924
0.0ThrXaa: 0.0 ± 0.0
Val
15.011ValAla: 15.011 ± 2.557
0.442ValCys: 0.442 ± 0.354
3.532ValAsp: 3.532 ± 1.179
2.208ValGlu: 2.208 ± 1.536
2.649ValPhe: 2.649 ± 1.822
6.181ValGly: 6.181 ± 1.522
1.766ValHis: 1.766 ± 1.262
3.091ValIle: 3.091 ± 1.006
2.649ValLys: 2.649 ± 1.299
5.74ValLeu: 5.74 ± 1.237
1.325ValMet: 1.325 ± 0.663
3.091ValAsn: 3.091 ± 0.9
3.532ValPro: 3.532 ± 0.919
3.091ValGln: 3.091 ± 0.96
3.974ValArg: 3.974 ± 1.285
9.272ValSer: 9.272 ± 1.838
5.298ValThr: 5.298 ± 2.131
10.155ValVal: 10.155 ± 1.888
2.208ValTrp: 2.208 ± 1.246
1.325ValTyr: 1.325 ± 0.406
0.0ValXaa: 0.0 ± 0.0
Trp
2.208TrpAla: 2.208 ± 0.964
0.0TrpCys: 0.0 ± 0.0
0.442TrpAsp: 0.442 ± 0.372
0.883TrpGlu: 0.883 ± 0.62
0.0TrpPhe: 0.0 ± 0.0
1.325TrpGly: 1.325 ± 0.406
0.883TrpHis: 0.883 ± 0.491
0.442TrpIle: 0.442 ± 0.379
0.883TrpLys: 0.883 ± 0.538
3.091TrpLeu: 3.091 ± 1.171
0.883TrpMet: 0.883 ± 0.818
1.766TrpAsn: 1.766 ± 0.769
1.766TrpPro: 1.766 ± 0.946
0.883TrpGln: 0.883 ± 0.55
1.766TrpArg: 1.766 ± 0.879
0.883TrpSer: 0.883 ± 0.594
0.442TrpThr: 0.442 ± 0.388
1.325TrpVal: 1.325 ± 0.518
0.0TrpTrp: 0.0 ± 0.0
1.325TrpTyr: 1.325 ± 0.687
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.091TyrAla: 3.091 ± 0.796
0.442TyrCys: 0.442 ± 0.372
1.325TyrAsp: 1.325 ± 0.77
1.325TyrGlu: 1.325 ± 0.634
0.883TyrPhe: 0.883 ± 0.491
1.325TyrGly: 1.325 ± 0.406
0.0TyrHis: 0.0 ± 0.0
1.325TyrIle: 1.325 ± 1.115
1.325TyrLys: 1.325 ± 0.687
1.325TyrLeu: 1.325 ± 0.621
0.0TyrMet: 0.0 ± 0.0
0.883TyrAsn: 0.883 ± 0.708
0.442TyrPro: 0.442 ± 0.566
0.883TyrGln: 0.883 ± 0.384
2.208TyrArg: 2.208 ± 1.162
1.766TyrSer: 1.766 ± 0.891
1.766TyrThr: 1.766 ± 0.789
2.208TyrVal: 2.208 ± 0.708
0.0TyrTrp: 0.0 ± 0.0
0.442TyrTyr: 0.442 ± 0.372
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 13 proteins (2266 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski