Amino acid dipepetide frequency for Ralstonia phage 1 NP-2014

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
17.506AlaAla: 17.506 ± 4.413
2.135AlaCys: 2.135 ± 0.871
7.259AlaAsp: 7.259 ± 2.305
6.405AlaGlu: 6.405 ± 1.24
2.989AlaPhe: 2.989 ± 1.613
7.259AlaGly: 7.259 ± 1.61
2.562AlaHis: 2.562 ± 1.275
7.259AlaIle: 7.259 ± 1.716
6.832AlaLys: 6.832 ± 1.802
10.248AlaLeu: 10.248 ± 1.529
5.978AlaMet: 5.978 ± 2.292
3.416AlaAsn: 3.416 ± 1.362
1.281AlaPro: 1.281 ± 0.529
4.697AlaGln: 4.697 ± 1.413
8.967AlaArg: 8.967 ± 2.291
9.394AlaSer: 9.394 ± 2.021
2.989AlaThr: 2.989 ± 1.107
8.113AlaVal: 8.113 ± 1.704
2.135AlaTrp: 2.135 ± 0.862
2.135AlaTyr: 2.135 ± 0.951
0.0AlaXaa: 0.0 ± 0.0
Cys
1.708CysAla: 1.708 ± 0.8
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.427CysGlu: 0.427 ± 0.507
0.854CysPhe: 0.854 ± 0.628
2.989CysGly: 2.989 ± 0.975
0.854CysHis: 0.854 ± 0.445
1.281CysIle: 1.281 ± 0.812
0.854CysLys: 0.854 ± 0.445
0.854CysLeu: 0.854 ± 0.557
0.854CysMet: 0.854 ± 0.552
1.281CysAsn: 1.281 ± 0.725
0.427CysPro: 0.427 ± 0.369
0.854CysGln: 0.854 ± 0.462
0.854CysArg: 0.854 ± 0.636
0.427CysSer: 0.427 ± 0.309
2.562CysThr: 2.562 ± 1.257
1.281CysVal: 1.281 ± 0.708
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
8.967AspAla: 8.967 ± 1.968
0.427AspCys: 0.427 ± 0.4
2.135AspAsp: 2.135 ± 1.148
4.697AspGlu: 4.697 ± 1.934
1.708AspPhe: 1.708 ± 0.907
5.124AspGly: 5.124 ± 1.475
0.427AspHis: 0.427 ± 0.495
0.854AspIle: 0.854 ± 0.8
2.135AspLys: 2.135 ± 0.901
5.978AspLeu: 5.978 ± 1.956
0.427AspMet: 0.427 ± 0.4
0.427AspAsn: 0.427 ± 0.495
2.562AspPro: 2.562 ± 0.959
1.281AspGln: 1.281 ± 0.676
4.697AspArg: 4.697 ± 2.517
3.843AspSer: 3.843 ± 0.927
1.708AspThr: 1.708 ± 0.877
2.135AspVal: 2.135 ± 0.767
1.708AspTrp: 1.708 ± 0.783
1.281AspTyr: 1.281 ± 0.718
0.0AspXaa: 0.0 ± 0.0
Glu
6.405GluAla: 6.405 ± 2.179
1.281GluCys: 1.281 ± 0.606
1.708GluAsp: 1.708 ± 0.721
3.416GluGlu: 3.416 ± 1.429
2.562GluPhe: 2.562 ± 1.229
2.989GluGly: 2.989 ± 1.023
0.427GluHis: 0.427 ± 0.538
0.854GluIle: 0.854 ± 0.645
2.135GluLys: 2.135 ± 1.308
3.843GluLeu: 3.843 ± 1.181
1.281GluMet: 1.281 ± 0.996
0.854GluAsn: 0.854 ± 0.8
1.281GluPro: 1.281 ± 1.088
3.843GluGln: 3.843 ± 1.329
4.27GluArg: 4.27 ± 1.377
1.708GluSer: 1.708 ± 0.815
4.27GluThr: 4.27 ± 1.082
1.708GluVal: 1.708 ± 0.777
1.281GluTrp: 1.281 ± 0.601
2.989GluTyr: 2.989 ± 1.141
0.0GluXaa: 0.0 ± 0.0
Phe
3.843PheAla: 3.843 ± 1.675
0.854PheCys: 0.854 ± 0.65
2.562PheAsp: 2.562 ± 1.126
0.854PheGlu: 0.854 ± 0.523
1.281PhePhe: 1.281 ± 0.849
1.281PheGly: 1.281 ± 1.0
0.854PheHis: 0.854 ± 0.507
2.135PheIle: 2.135 ± 0.98
0.427PheLys: 0.427 ± 0.457
3.843PheLeu: 3.843 ± 1.378
1.708PheMet: 1.708 ± 0.764
1.708PheAsn: 1.708 ± 0.739
0.854PhePro: 0.854 ± 0.422
1.281PheGln: 1.281 ± 0.681
2.989PheArg: 2.989 ± 1.128
2.135PheSer: 2.135 ± 0.652
1.281PheThr: 1.281 ± 0.81
2.135PheVal: 2.135 ± 1.067
0.854PheTrp: 0.854 ± 0.66
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
6.405GlyAla: 6.405 ± 1.417
0.0GlyCys: 0.0 ± 0.0
3.843GlyAsp: 3.843 ± 1.041
1.708GlyGlu: 1.708 ± 0.834
4.27GlyPhe: 4.27 ± 1.751
4.697GlyGly: 4.697 ± 1.804
0.854GlyHis: 0.854 ± 0.585
3.843GlyIle: 3.843 ± 1.193
3.843GlyLys: 3.843 ± 1.546
5.978GlyLeu: 5.978 ± 2.056
4.27GlyMet: 4.27 ± 1.357
2.562GlyAsn: 2.562 ± 0.879
3.843GlyPro: 3.843 ± 1.156
1.708GlyGln: 1.708 ± 0.599
7.259GlyArg: 7.259 ± 1.814
5.551GlySer: 5.551 ± 1.73
5.124GlyThr: 5.124 ± 2.255
4.697GlyVal: 4.697 ± 1.934
2.135GlyTrp: 2.135 ± 1.001
2.989GlyTyr: 2.989 ± 1.062
0.0GlyXaa: 0.0 ± 0.0
His
0.854HisAla: 0.854 ± 0.654
0.0HisCys: 0.0 ± 0.0
1.708HisAsp: 1.708 ± 1.229
0.427HisGlu: 0.427 ± 0.457
1.281HisPhe: 1.281 ± 0.63
1.708HisGly: 1.708 ± 0.926
0.427HisHis: 0.427 ± 0.538
1.281HisIle: 1.281 ± 0.468
0.427HisLys: 0.427 ± 0.405
2.135HisLeu: 2.135 ± 1.406
2.562HisMet: 2.562 ± 1.043
0.427HisAsn: 0.427 ± 0.507
0.0HisPro: 0.0 ± 0.0
0.854HisGln: 0.854 ± 0.847
0.854HisArg: 0.854 ± 0.8
0.427HisSer: 0.427 ± 0.369
0.0HisThr: 0.0 ± 0.0
1.708HisVal: 1.708 ± 0.556
0.854HisTrp: 0.854 ± 1.077
1.281HisTyr: 1.281 ± 0.641
0.0HisXaa: 0.0 ± 0.0
Ile
7.686IleAla: 7.686 ± 2.518
0.427IleCys: 0.427 ± 0.369
2.135IleAsp: 2.135 ± 0.786
2.989IleGlu: 2.989 ± 0.933
1.708IlePhe: 1.708 ± 0.813
2.989IleGly: 2.989 ± 1.148
0.427IleHis: 0.427 ± 0.538
2.135IleIle: 2.135 ± 0.942
2.562IleLys: 2.562 ± 0.973
2.562IleLeu: 2.562 ± 0.768
0.427IleMet: 0.427 ± 0.4
1.281IleAsn: 1.281 ± 0.552
1.708IlePro: 1.708 ± 0.97
0.854IleGln: 0.854 ± 0.657
4.27IleArg: 4.27 ± 1.023
2.135IleSer: 2.135 ± 1.177
2.562IleThr: 2.562 ± 0.812
4.697IleVal: 4.697 ± 0.995
0.854IleTrp: 0.854 ± 0.674
0.854IleTyr: 0.854 ± 0.527
0.0IleXaa: 0.0 ± 0.0
Lys
3.843LysAla: 3.843 ± 1.229
0.427LysCys: 0.427 ± 0.309
2.562LysAsp: 2.562 ± 0.923
0.427LysGlu: 0.427 ± 0.4
0.427LysPhe: 0.427 ± 0.402
1.708LysGly: 1.708 ± 0.609
1.281LysHis: 1.281 ± 0.846
1.708LysIle: 1.708 ± 0.892
1.281LysLys: 1.281 ± 0.762
8.54LysLeu: 8.54 ± 2.119
0.427LysMet: 0.427 ± 0.417
0.854LysAsn: 0.854 ± 0.59
2.989LysPro: 2.989 ± 0.705
2.135LysGln: 2.135 ± 1.321
4.27LysArg: 4.27 ± 1.264
1.708LysSer: 1.708 ± 0.609
1.708LysThr: 1.708 ± 0.714
4.697LysVal: 4.697 ± 1.615
0.427LysTrp: 0.427 ± 0.463
1.708LysTyr: 1.708 ± 0.791
0.0LysXaa: 0.0 ± 0.0
Leu
11.102LeuAla: 11.102 ± 2.355
1.708LeuCys: 1.708 ± 0.587
7.686LeuAsp: 7.686 ± 2.294
2.989LeuGlu: 2.989 ± 0.991
4.27LeuPhe: 4.27 ± 1.583
6.405LeuGly: 6.405 ± 1.521
1.281LeuHis: 1.281 ± 0.981
5.124LeuIle: 5.124 ± 1.743
5.978LeuLys: 5.978 ± 1.65
8.967LeuLeu: 8.967 ± 1.733
0.854LeuMet: 0.854 ± 0.668
2.562LeuAsn: 2.562 ± 1.109
3.843LeuPro: 3.843 ± 1.719
2.989LeuGln: 2.989 ± 1.256
5.124LeuArg: 5.124 ± 1.323
3.843LeuSer: 3.843 ± 1.019
4.697LeuThr: 4.697 ± 1.65
6.832LeuVal: 6.832 ± 1.606
2.989LeuTrp: 2.989 ± 0.895
2.135LeuTyr: 2.135 ± 1.008
0.0LeuXaa: 0.0 ± 0.0
Met
3.416MetAla: 3.416 ± 1.019
0.0MetCys: 0.0 ± 0.0
1.708MetAsp: 1.708 ± 1.024
1.281MetGlu: 1.281 ± 0.695
0.854MetPhe: 0.854 ± 1.068
2.135MetGly: 2.135 ± 1.326
2.135MetHis: 2.135 ± 0.839
0.854MetIle: 0.854 ± 0.645
0.854MetLys: 0.854 ± 0.573
2.562MetLeu: 2.562 ± 0.985
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
2.562MetPro: 2.562 ± 0.965
1.708MetGln: 1.708 ± 0.984
0.854MetArg: 0.854 ± 0.584
4.27MetSer: 4.27 ± 1.378
0.854MetThr: 0.854 ± 0.498
2.989MetVal: 2.989 ± 0.783
0.427MetTrp: 0.427 ± 0.463
0.427MetTyr: 0.427 ± 0.424
0.0MetXaa: 0.0 ± 0.0
Asn
3.843AsnAla: 3.843 ± 0.958
0.427AsnCys: 0.427 ± 0.507
0.854AsnAsp: 0.854 ± 0.739
1.281AsnGlu: 1.281 ± 0.601
0.427AsnPhe: 0.427 ± 0.369
4.27AsnGly: 4.27 ± 1.279
0.0AsnHis: 0.0 ± 0.0
0.854AsnIle: 0.854 ± 0.472
2.135AsnLys: 2.135 ± 0.828
2.562AsnLeu: 2.562 ± 1.176
0.854AsnMet: 0.854 ± 0.447
0.854AsnAsn: 0.854 ± 0.445
2.562AsnPro: 2.562 ± 1.504
0.0AsnGln: 0.0 ± 0.0
0.854AsnArg: 0.854 ± 0.53
1.708AsnSer: 1.708 ± 0.918
0.854AsnThr: 0.854 ± 0.557
2.135AsnVal: 2.135 ± 1.291
0.854AsnTrp: 0.854 ± 0.59
0.427AsnTyr: 0.427 ± 0.4
0.0AsnXaa: 0.0 ± 0.0
Pro
4.27ProAla: 4.27 ± 1.395
0.854ProCys: 0.854 ± 0.549
2.562ProAsp: 2.562 ± 0.881
3.416ProGlu: 3.416 ± 1.915
0.854ProPhe: 0.854 ± 0.65
3.843ProGly: 3.843 ± 1.099
0.427ProHis: 0.427 ± 0.309
2.135ProIle: 2.135 ± 0.885
2.562ProLys: 2.562 ± 0.798
1.708ProLeu: 1.708 ± 0.963
0.427ProMet: 0.427 ± 0.309
1.281ProAsn: 1.281 ± 0.927
0.427ProPro: 0.427 ± 0.538
2.562ProGln: 2.562 ± 0.784
2.135ProArg: 2.135 ± 0.751
4.27ProSer: 4.27 ± 1.409
1.708ProThr: 1.708 ± 0.97
3.416ProVal: 3.416 ± 1.707
0.854ProTrp: 0.854 ± 0.585
1.281ProTyr: 1.281 ± 0.789
0.0ProXaa: 0.0 ± 0.0
Gln
5.551GlnAla: 5.551 ± 1.65
1.281GlnCys: 1.281 ± 0.635
0.854GlnAsp: 0.854 ± 0.738
2.135GlnGlu: 2.135 ± 1.361
1.281GlnPhe: 1.281 ± 0.871
1.708GlnGly: 1.708 ± 0.648
0.854GlnHis: 0.854 ± 0.514
1.281GlnIle: 1.281 ± 0.653
2.562GlnLys: 2.562 ± 1.053
5.124GlnLeu: 5.124 ± 1.922
0.0GlnMet: 0.0 ± 0.0
1.281GlnAsn: 1.281 ± 0.568
2.989GlnPro: 2.989 ± 0.968
2.562GlnGln: 2.562 ± 0.983
3.416GlnArg: 3.416 ± 1.184
0.854GlnSer: 0.854 ± 0.422
1.708GlnThr: 1.708 ± 0.725
2.989GlnVal: 2.989 ± 1.178
1.281GlnTrp: 1.281 ± 0.468
0.854GlnTyr: 0.854 ± 0.452
0.0GlnXaa: 0.0 ± 0.0
Arg
8.113ArgAla: 8.113 ± 2.477
1.708ArgCys: 1.708 ± 0.865
2.562ArgAsp: 2.562 ± 1.233
8.54ArgGlu: 8.54 ± 2.956
1.708ArgPhe: 1.708 ± 0.829
5.978ArgGly: 5.978 ± 1.527
2.135ArgHis: 2.135 ± 0.966
3.416ArgIle: 3.416 ± 1.405
3.416ArgLys: 3.416 ± 1.687
5.124ArgLeu: 5.124 ± 1.648
4.27ArgMet: 4.27 ± 1.931
2.135ArgAsn: 2.135 ± 0.769
2.562ArgPro: 2.562 ± 1.589
2.562ArgGln: 2.562 ± 1.209
5.978ArgArg: 5.978 ± 2.104
4.27ArgSer: 4.27 ± 2.142
2.562ArgThr: 2.562 ± 1.436
4.697ArgVal: 4.697 ± 0.968
0.854ArgTrp: 0.854 ± 0.575
0.854ArgTyr: 0.854 ± 0.676
0.0ArgXaa: 0.0 ± 0.0
Ser
7.259SerAla: 7.259 ± 2.12
0.854SerCys: 0.854 ± 0.59
3.416SerAsp: 3.416 ± 1.163
1.708SerGlu: 1.708 ± 1.001
0.854SerPhe: 0.854 ± 0.755
4.697SerGly: 4.697 ± 1.628
0.854SerHis: 0.854 ± 0.657
2.135SerIle: 2.135 ± 0.863
0.0SerLys: 0.0 ± 0.0
5.978SerLeu: 5.978 ± 1.701
1.708SerMet: 1.708 ± 1.064
1.708SerAsn: 1.708 ± 0.589
3.416SerPro: 3.416 ± 2.173
2.989SerGln: 2.989 ± 1.222
3.843SerArg: 3.843 ± 1.09
5.978SerSer: 5.978 ± 3.267
4.27SerThr: 4.27 ± 1.5
4.697SerVal: 4.697 ± 1.551
1.708SerTrp: 1.708 ± 0.97
3.416SerTyr: 3.416 ± 1.092
0.0SerXaa: 0.0 ± 0.0
Thr
3.843ThrAla: 3.843 ± 1.448
0.854ThrCys: 0.854 ± 0.618
2.135ThrAsp: 2.135 ± 0.617
2.562ThrGlu: 2.562 ± 0.917
1.708ThrPhe: 1.708 ± 0.939
3.843ThrGly: 3.843 ± 1.122
1.708ThrHis: 1.708 ± 0.684
2.562ThrIle: 2.562 ± 1.116
1.708ThrLys: 1.708 ± 0.642
4.27ThrLeu: 4.27 ± 0.741
1.281ThrMet: 1.281 ± 0.669
2.135ThrAsn: 2.135 ± 0.947
2.562ThrPro: 2.562 ± 0.744
3.843ThrGln: 3.843 ± 1.774
2.989ThrArg: 2.989 ± 1.278
2.135ThrSer: 2.135 ± 0.867
9.821ThrThr: 9.821 ± 3.692
4.27ThrVal: 4.27 ± 1.285
0.427ThrTrp: 0.427 ± 0.309
0.854ThrTyr: 0.854 ± 0.496
0.0ThrXaa: 0.0 ± 0.0
Val
11.529ValAla: 11.529 ± 2.603
3.416ValCys: 3.416 ± 1.242
1.708ValAsp: 1.708 ± 0.924
2.562ValGlu: 2.562 ± 1.246
1.281ValPhe: 1.281 ± 0.628
6.832ValGly: 6.832 ± 1.957
0.854ValHis: 0.854 ± 0.522
2.989ValIle: 2.989 ± 0.917
2.562ValLys: 2.562 ± 0.961
5.124ValLeu: 5.124 ± 2.203
1.281ValMet: 1.281 ± 0.717
2.135ValAsn: 2.135 ± 0.801
2.989ValPro: 2.989 ± 0.983
2.135ValGln: 2.135 ± 0.805
6.405ValArg: 6.405 ± 1.032
4.697ValSer: 4.697 ± 1.449
3.416ValThr: 3.416 ± 1.218
8.113ValVal: 8.113 ± 1.753
1.281ValTrp: 1.281 ± 0.62
2.562ValTyr: 2.562 ± 0.671
0.0ValXaa: 0.0 ± 0.0
Trp
0.427TrpAla: 0.427 ± 0.417
0.854TrpCys: 0.854 ± 0.532
1.708TrpAsp: 1.708 ± 0.56
0.427TrpGlu: 0.427 ± 0.417
1.708TrpPhe: 1.708 ± 0.711
2.562TrpGly: 2.562 ± 0.836
0.854TrpHis: 0.854 ± 0.507
0.427TrpIle: 0.427 ± 0.538
0.427TrpLys: 0.427 ± 0.4
3.416TrpLeu: 3.416 ± 0.921
0.854TrpMet: 0.854 ± 0.574
0.427TrpAsn: 0.427 ± 0.309
1.281TrpPro: 1.281 ± 0.684
1.281TrpGln: 1.281 ± 0.774
1.281TrpArg: 1.281 ± 0.694
0.427TrpSer: 0.427 ± 0.369
1.708TrpThr: 1.708 ± 0.889
0.0TrpVal: 0.0 ± 0.0
0.427TrpTrp: 0.427 ± 0.369
1.281TrpTyr: 1.281 ± 0.877
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.416TyrAla: 3.416 ± 0.766
0.854TyrCys: 0.854 ± 0.645
3.416TyrAsp: 3.416 ± 1.389
0.854TyrGlu: 0.854 ± 0.645
0.854TyrPhe: 0.854 ± 0.446
2.562TyrGly: 2.562 ± 0.887
0.0TyrHis: 0.0 ± 0.0
2.135TyrIle: 2.135 ± 1.172
0.427TyrLys: 0.427 ± 0.424
2.562TyrLeu: 2.562 ± 1.251
0.0TyrMet: 0.0 ± 0.0
0.427TyrAsn: 0.427 ± 0.369
0.854TyrPro: 0.854 ± 0.618
0.427TyrGln: 0.427 ± 0.507
2.135TyrArg: 2.135 ± 0.903
1.708TyrSer: 1.708 ± 0.839
1.708TyrThr: 1.708 ± 0.674
2.562TyrVal: 2.562 ± 1.243
0.427TyrTrp: 0.427 ± 0.309
0.427TyrTyr: 0.427 ± 0.4
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 16 proteins (2343 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski