Amino acid dipepetide frequency for Ralstonia phage RSM1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
15.273AlaAla: 15.273 ± 2.416
1.206AlaCys: 1.206 ± 0.973
6.029AlaAsp: 6.029 ± 1.525
5.627AlaGlu: 5.627 ± 2.122
4.019AlaPhe: 4.019 ± 1.179
8.039AlaGly: 8.039 ± 2.001
1.608AlaHis: 1.608 ± 0.669
6.431AlaIle: 6.431 ± 1.716
5.225AlaLys: 5.225 ± 1.554
12.46AlaLeu: 12.46 ± 1.816
6.029AlaMet: 6.029 ± 1.832
2.01AlaAsn: 2.01 ± 0.739
5.225AlaPro: 5.225 ± 2.439
6.833AlaGln: 6.833 ± 1.573
9.646AlaArg: 9.646 ± 1.806
5.627AlaSer: 5.627 ± 1.425
9.244AlaThr: 9.244 ± 2.685
13.264AlaVal: 13.264 ± 1.802
3.617AlaTrp: 3.617 ± 1.076
1.608AlaTyr: 1.608 ± 0.87
0.0AlaXaa: 0.0 ± 0.0
Cys
2.01CysAla: 2.01 ± 0.982
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.402CysGlu: 0.402 ± 0.323
0.402CysPhe: 0.402 ± 0.36
0.804CysGly: 0.804 ± 0.58
0.402CysHis: 0.402 ± 0.36
1.608CysIle: 1.608 ± 0.658
0.402CysLys: 0.402 ± 0.36
0.402CysLeu: 0.402 ± 0.528
0.402CysMet: 0.402 ± 0.31
0.804CysAsn: 0.804 ± 0.557
0.804CysPro: 0.804 ± 0.648
0.804CysGln: 0.804 ± 0.409
0.402CysArg: 0.402 ± 0.53
0.402CysSer: 0.402 ± 0.324
0.402CysThr: 0.402 ± 0.45
1.206CysVal: 1.206 ± 0.695
0.402CysTrp: 0.402 ± 0.323
0.402CysTyr: 0.402 ± 0.449
0.0CysXaa: 0.0 ± 0.0
Asp
5.627AspAla: 5.627 ± 1.195
0.0AspCys: 0.0 ± 0.0
1.608AspAsp: 1.608 ± 0.578
2.814AspGlu: 2.814 ± 0.848
1.206AspPhe: 1.206 ± 0.723
4.421AspGly: 4.421 ± 1.356
1.206AspHis: 1.206 ± 0.767
2.814AspIle: 2.814 ± 0.99
1.206AspLys: 1.206 ± 0.84
4.823AspLeu: 4.823 ± 1.396
0.804AspMet: 0.804 ± 0.43
0.804AspAsn: 0.804 ± 0.409
0.402AspPro: 0.402 ± 0.324
1.608AspGln: 1.608 ± 0.764
4.823AspArg: 4.823 ± 1.344
2.412AspSer: 2.412 ± 0.926
2.01AspThr: 2.01 ± 0.83
2.412AspVal: 2.412 ± 0.893
1.608AspTrp: 1.608 ± 0.749
1.608AspTyr: 1.608 ± 0.734
0.0AspXaa: 0.0 ± 0.0
Glu
5.627GluAla: 5.627 ± 1.581
0.402GluCys: 0.402 ± 0.36
2.412GluAsp: 2.412 ± 1.04
2.01GluGlu: 2.01 ± 0.795
4.019GluPhe: 4.019 ± 1.36
2.412GluGly: 2.412 ± 0.914
0.804GluHis: 0.804 ± 0.405
0.402GluIle: 0.402 ± 0.331
1.608GluLys: 1.608 ± 0.87
3.617GluLeu: 3.617 ± 1.555
0.0GluMet: 0.0 ± 0.0
1.608GluAsn: 1.608 ± 0.809
1.206GluPro: 1.206 ± 0.448
3.215GluGln: 3.215 ± 1.267
2.814GluArg: 2.814 ± 1.328
1.206GluSer: 1.206 ± 0.482
2.814GluThr: 2.814 ± 1.043
2.412GluVal: 2.412 ± 0.803
0.804GluTrp: 0.804 ± 0.382
0.804GluTyr: 0.804 ± 0.536
0.0GluXaa: 0.0 ± 0.0
Phe
2.814PheAla: 2.814 ± 1.261
0.0PheCys: 0.0 ± 0.0
1.608PheAsp: 1.608 ± 0.683
1.206PheGlu: 1.206 ± 0.482
4.019PhePhe: 4.019 ± 1.781
4.019PheGly: 4.019 ± 1.201
1.206PheHis: 1.206 ± 0.68
1.206PheIle: 1.206 ± 0.744
1.206PheLys: 1.206 ± 0.603
2.01PheLeu: 2.01 ± 0.847
1.206PheMet: 1.206 ± 0.594
2.412PheAsn: 2.412 ± 1.021
0.402PhePro: 0.402 ± 0.324
0.402PheGln: 0.402 ± 0.413
1.206PheArg: 1.206 ± 0.7
2.412PheSer: 2.412 ± 1.316
0.804PheThr: 0.804 ± 0.826
2.814PheVal: 2.814 ± 1.017
1.206PheTrp: 1.206 ± 0.562
0.402PheTyr: 0.402 ± 0.324
0.0PheXaa: 0.0 ± 0.0
Gly
10.852GlyAla: 10.852 ± 2.726
0.804GlyCys: 0.804 ± 0.516
3.617GlyAsp: 3.617 ± 0.84
1.206GlyGlu: 1.206 ± 0.717
4.019GlyPhe: 4.019 ± 1.716
8.842GlyGly: 8.842 ± 1.932
2.01GlyHis: 2.01 ± 0.721
2.01GlyIle: 2.01 ± 0.996
2.814GlyLys: 2.814 ± 1.172
4.823GlyLeu: 4.823 ± 2.352
2.01GlyMet: 2.01 ± 0.603
1.608GlyAsn: 1.608 ± 0.991
2.01GlyPro: 2.01 ± 0.813
2.01GlyGln: 2.01 ± 0.558
6.029GlyArg: 6.029 ± 1.384
6.029GlySer: 6.029 ± 2.633
5.627GlyThr: 5.627 ± 1.804
10.852GlyVal: 10.852 ± 2.534
2.01GlyTrp: 2.01 ± 0.847
2.814GlyTyr: 2.814 ± 0.968
0.0GlyXaa: 0.0 ± 0.0
His
1.206HisAla: 1.206 ± 0.702
0.402HisCys: 0.402 ± 0.323
1.206HisAsp: 1.206 ± 0.627
2.412HisGlu: 2.412 ± 0.955
0.804HisPhe: 0.804 ± 0.548
1.608HisGly: 1.608 ± 0.739
0.0HisHis: 0.0 ± 0.0
1.206HisIle: 1.206 ± 0.845
0.402HisLys: 0.402 ± 0.331
0.402HisLeu: 0.402 ± 0.418
0.804HisMet: 0.804 ± 0.721
0.804HisAsn: 0.804 ± 0.568
0.804HisPro: 0.804 ± 0.529
0.402HisGln: 0.402 ± 0.331
2.814HisArg: 2.814 ± 1.152
0.402HisSer: 0.402 ± 0.36
0.402HisThr: 0.402 ± 0.323
1.608HisVal: 1.608 ± 0.738
0.402HisTrp: 0.402 ± 0.323
1.206HisTyr: 1.206 ± 0.519
0.0HisXaa: 0.0 ± 0.0
Ile
6.833IleAla: 6.833 ± 2.223
0.0IleCys: 0.0 ± 0.0
1.608IleAsp: 1.608 ± 0.616
1.608IleGlu: 1.608 ± 0.798
1.608IlePhe: 1.608 ± 1.218
6.029IleGly: 6.029 ± 2.09
0.402IleHis: 0.402 ± 0.418
0.804IleIle: 0.804 ± 0.56
2.412IleLys: 2.412 ± 0.796
1.608IleLeu: 1.608 ± 0.923
0.402IleMet: 0.402 ± 0.323
2.412IleAsn: 2.412 ± 0.902
1.608IlePro: 1.608 ± 0.733
2.412IleGln: 2.412 ± 0.984
2.412IleArg: 2.412 ± 0.827
0.804IleSer: 0.804 ± 0.47
2.01IleThr: 2.01 ± 0.72
3.617IleVal: 3.617 ± 0.963
0.804IleTrp: 0.804 ± 0.837
0.402IleTyr: 0.402 ± 0.323
0.0IleXaa: 0.0 ± 0.0
Lys
4.421LysAla: 4.421 ± 1.082
0.402LysCys: 0.402 ± 0.323
2.814LysAsp: 2.814 ± 1.28
1.206LysGlu: 1.206 ± 0.448
0.402LysPhe: 0.402 ± 0.331
4.019LysGly: 4.019 ± 1.483
0.0LysHis: 0.0 ± 0.0
1.608LysIle: 1.608 ± 0.705
2.01LysLys: 2.01 ± 1.033
4.019LysLeu: 4.019 ± 1.341
0.402LysMet: 0.402 ± 0.41
0.402LysAsn: 0.402 ± 0.449
3.215LysPro: 3.215 ± 1.514
2.01LysGln: 2.01 ± 0.878
4.421LysArg: 4.421 ± 1.704
2.01LysSer: 2.01 ± 0.59
3.617LysThr: 3.617 ± 1.023
4.019LysVal: 4.019 ± 1.046
1.206LysTrp: 1.206 ± 0.753
1.608LysTyr: 1.608 ± 0.702
0.0LysXaa: 0.0 ± 0.0
Leu
8.039LeuAla: 8.039 ± 1.423
1.206LeuCys: 1.206 ± 0.84
5.627LeuAsp: 5.627 ± 1.643
2.412LeuGlu: 2.412 ± 0.959
2.814LeuPhe: 2.814 ± 1.251
4.019LeuGly: 4.019 ± 1.627
2.412LeuHis: 2.412 ± 1.1
2.412LeuIle: 2.412 ± 0.824
4.421LeuLys: 4.421 ± 1.127
7.637LeuLeu: 7.637 ± 3.507
1.206LeuMet: 1.206 ± 0.568
3.215LeuAsn: 3.215 ± 1.131
2.412LeuPro: 2.412 ± 0.936
3.215LeuGln: 3.215 ± 1.297
4.421LeuArg: 4.421 ± 2.197
4.019LeuSer: 4.019 ± 1.477
5.225LeuThr: 5.225 ± 1.456
6.029LeuVal: 6.029 ± 1.921
1.608LeuTrp: 1.608 ± 0.53
1.206LeuTyr: 1.206 ± 0.605
0.0LeuXaa: 0.0 ± 0.0
Met
4.019MetAla: 4.019 ± 0.734
0.804MetCys: 0.804 ± 0.409
0.402MetAsp: 0.402 ± 0.418
0.402MetGlu: 0.402 ± 0.323
0.402MetPhe: 0.402 ± 0.447
1.206MetGly: 1.206 ± 0.67
1.608MetHis: 1.608 ± 0.861
1.206MetIle: 1.206 ± 0.519
0.804MetLys: 0.804 ± 0.732
2.01MetLeu: 2.01 ± 0.901
0.402MetMet: 0.402 ± 0.324
1.206MetAsn: 1.206 ± 0.817
1.206MetPro: 1.206 ± 0.493
1.608MetGln: 1.608 ± 0.847
0.804MetArg: 0.804 ± 0.68
2.814MetSer: 2.814 ± 1.085
2.01MetThr: 2.01 ± 1.024
1.206MetVal: 1.206 ± 0.919
0.402MetTrp: 0.402 ± 0.41
0.804MetTyr: 0.804 ± 0.516
0.0MetXaa: 0.0 ± 0.0
Asn
4.421AsnAla: 4.421 ± 1.224
0.804AsnCys: 0.804 ± 0.43
2.01AsnAsp: 2.01 ± 0.959
0.402AsnGlu: 0.402 ± 0.331
0.402AsnPhe: 0.402 ± 0.331
2.412AsnGly: 2.412 ± 1.017
0.0AsnHis: 0.0 ± 0.0
0.804AsnIle: 0.804 ± 0.551
2.01AsnLys: 2.01 ± 0.882
1.206AsnLeu: 1.206 ± 0.608
0.0AsnMet: 0.0 ± 0.0
0.402AsnAsn: 0.402 ± 0.331
3.215AsnPro: 3.215 ± 1.422
1.608AsnGln: 1.608 ± 0.621
2.01AsnArg: 2.01 ± 0.997
1.206AsnSer: 1.206 ± 0.706
2.814AsnThr: 2.814 ± 0.831
2.814AsnVal: 2.814 ± 0.915
0.0AsnTrp: 0.0 ± 0.0
0.402AsnTyr: 0.402 ± 0.36
0.0AsnXaa: 0.0 ± 0.0
Pro
8.039ProAla: 8.039 ± 1.486
0.804ProCys: 0.804 ± 0.408
1.608ProAsp: 1.608 ± 0.604
2.412ProGlu: 2.412 ± 1.422
0.0ProPhe: 0.0 ± 0.0
3.215ProGly: 3.215 ± 0.775
0.804ProHis: 0.804 ± 0.529
1.608ProIle: 1.608 ± 0.621
0.804ProLys: 0.804 ± 0.408
1.206ProLeu: 1.206 ± 0.784
0.0ProMet: 0.0 ± 0.0
0.804ProAsn: 0.804 ± 0.461
2.01ProPro: 2.01 ± 0.784
2.814ProGln: 2.814 ± 0.943
2.814ProArg: 2.814 ± 1.452
4.421ProSer: 4.421 ± 1.791
3.617ProThr: 3.617 ± 1.937
4.421ProVal: 4.421 ± 1.743
0.402ProTrp: 0.402 ± 0.324
0.804ProTyr: 0.804 ± 0.487
0.0ProXaa: 0.0 ± 0.0
Gln
4.823GlnAla: 4.823 ± 1.499
0.804GlnCys: 0.804 ± 0.591
2.01GlnAsp: 2.01 ± 0.997
0.804GlnGlu: 0.804 ± 0.82
0.804GlnPhe: 0.804 ± 0.82
3.617GlnGly: 3.617 ± 1.151
0.0GlnHis: 0.0 ± 0.0
2.412GlnIle: 2.412 ± 0.859
4.019GlnLys: 4.019 ± 1.03
3.215GlnLeu: 3.215 ± 1.198
1.608GlnMet: 1.608 ± 0.769
0.804GlnAsn: 0.804 ± 0.405
2.814GlnPro: 2.814 ± 0.995
3.617GlnGln: 3.617 ± 1.57
5.225GlnArg: 5.225 ± 1.331
2.412GlnSer: 2.412 ± 0.88
3.617GlnThr: 3.617 ± 1.255
1.608GlnVal: 1.608 ± 0.621
2.01GlnTrp: 2.01 ± 0.845
0.804GlnTyr: 0.804 ± 0.408
0.0GlnXaa: 0.0 ± 0.0
Arg
5.627ArgAla: 5.627 ± 1.568
1.206ArgCys: 1.206 ± 0.725
4.421ArgAsp: 4.421 ± 1.654
3.215ArgGlu: 3.215 ± 1.379
2.01ArgPhe: 2.01 ± 1.056
4.019ArgGly: 4.019 ± 1.41
2.412ArgHis: 2.412 ± 1.057
2.814ArgIle: 2.814 ± 1.257
1.608ArgLys: 1.608 ± 0.562
5.627ArgLeu: 5.627 ± 1.456
2.814ArgMet: 2.814 ± 1.337
0.804ArgAsn: 0.804 ± 0.461
2.814ArgPro: 2.814 ± 1.568
4.019ArgGln: 4.019 ± 1.479
6.029ArgArg: 6.029 ± 2.988
5.225ArgSer: 5.225 ± 2.447
4.019ArgThr: 4.019 ± 1.703
8.039ArgVal: 8.039 ± 1.545
2.814ArgTrp: 2.814 ± 1.515
0.402ArgTyr: 0.402 ± 0.331
0.0ArgXaa: 0.0 ± 0.0
Ser
10.45SerAla: 10.45 ± 1.533
0.804SerCys: 0.804 ± 0.473
2.01SerAsp: 2.01 ± 0.665
2.01SerGlu: 2.01 ± 0.557
1.608SerPhe: 1.608 ± 0.793
8.441SerGly: 8.441 ± 2.191
0.804SerHis: 0.804 ± 0.661
2.01SerIle: 2.01 ± 0.998
2.814SerLys: 2.814 ± 0.933
3.617SerLeu: 3.617 ± 2.426
3.215SerMet: 3.215 ± 0.978
1.608SerAsn: 1.608 ± 0.673
2.01SerPro: 2.01 ± 0.951
2.412SerGln: 2.412 ± 0.746
4.019SerArg: 4.019 ± 1.791
4.019SerSer: 4.019 ± 1.107
3.617SerThr: 3.617 ± 2.044
5.225SerVal: 5.225 ± 2.014
0.804SerTrp: 0.804 ± 0.496
0.804SerTyr: 0.804 ± 0.721
0.0SerXaa: 0.0 ± 0.0
Thr
7.235ThrAla: 7.235 ± 1.445
0.804ThrCys: 0.804 ± 0.771
1.608ThrAsp: 1.608 ± 0.997
5.627ThrGlu: 5.627 ± 1.118
1.608ThrPhe: 1.608 ± 0.727
4.823ThrGly: 4.823 ± 1.785
1.206ThrHis: 1.206 ± 0.845
3.215ThrIle: 3.215 ± 0.853
3.215ThrLys: 3.215 ± 0.871
4.421ThrLeu: 4.421 ± 1.361
1.608ThrMet: 1.608 ± 0.501
1.608ThrAsn: 1.608 ± 0.922
3.617ThrPro: 3.617 ± 1.497
2.814ThrGln: 2.814 ± 1.104
3.215ThrArg: 3.215 ± 1.289
4.019ThrSer: 4.019 ± 1.609
10.852ThrThr: 10.852 ± 5.704
4.823ThrVal: 4.823 ± 1.185
0.804ThrTrp: 0.804 ± 0.518
1.206ThrTyr: 1.206 ± 0.754
0.0ThrXaa: 0.0 ± 0.0
Val
16.881ValAla: 16.881 ± 2.742
1.206ValCys: 1.206 ± 0.749
1.206ValAsp: 1.206 ± 0.496
3.215ValGlu: 3.215 ± 1.319
1.608ValPhe: 1.608 ± 0.784
6.833ValGly: 6.833 ± 1.787
1.206ValHis: 1.206 ± 0.992
2.412ValIle: 2.412 ± 0.91
3.215ValLys: 3.215 ± 0.998
7.235ValLeu: 7.235 ± 1.419
1.206ValMet: 1.206 ± 0.603
2.412ValAsn: 2.412 ± 0.906
4.421ValPro: 4.421 ± 1.202
3.617ValGln: 3.617 ± 1.09
4.823ValArg: 4.823 ± 1.769
10.048ValSer: 10.048 ± 1.516
3.215ValThr: 3.215 ± 1.035
11.254ValVal: 11.254 ± 2.683
3.215ValTrp: 3.215 ± 1.182
2.412ValTyr: 2.412 ± 0.603
0.0ValXaa: 0.0 ± 0.0
Trp
2.01TrpAla: 2.01 ± 0.621
0.0TrpCys: 0.0 ± 0.0
1.206TrpAsp: 1.206 ± 0.653
0.804TrpGlu: 0.804 ± 0.601
0.0TrpPhe: 0.0 ± 0.0
0.804TrpGly: 0.804 ± 0.405
0.804TrpHis: 0.804 ± 0.405
0.804TrpIle: 0.804 ± 0.496
1.608TrpLys: 1.608 ± 1.06
2.814TrpLeu: 2.814 ± 0.739
0.804TrpMet: 0.804 ± 0.61
2.01TrpAsn: 2.01 ± 0.78
2.01TrpPro: 2.01 ± 0.665
0.804TrpGln: 0.804 ± 0.551
1.608TrpArg: 1.608 ± 0.754
1.608TrpSer: 1.608 ± 0.967
1.608TrpThr: 1.608 ± 0.637
0.804TrpVal: 0.804 ± 0.382
0.0TrpTrp: 0.0 ± 0.0
2.412TrpTyr: 2.412 ± 0.746
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.814TyrAla: 2.814 ± 0.806
0.804TyrCys: 0.804 ± 0.546
1.206TyrAsp: 1.206 ± 0.67
0.402TyrGlu: 0.402 ± 0.449
0.804TyrPhe: 0.804 ± 0.405
2.412TyrGly: 2.412 ± 0.605
0.402TyrHis: 0.402 ± 0.323
2.01TyrIle: 2.01 ± 1.207
2.01TyrLys: 2.01 ± 0.62
0.402TyrLeu: 0.402 ± 0.331
0.0TyrMet: 0.0 ± 0.0
1.206TyrAsn: 1.206 ± 0.663
0.402TyrPro: 0.402 ± 0.413
0.804TyrGln: 0.804 ± 0.382
0.804TyrArg: 0.804 ± 0.43
1.206TyrSer: 1.206 ± 0.565
0.804TyrThr: 0.804 ± 0.461
3.215TyrVal: 3.215 ± 1.385
0.402TyrTrp: 0.402 ± 0.323
0.402TyrTyr: 0.402 ± 0.36
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 15 proteins (2489 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski