Amino acid dipepetide frequency for Erwiniaceae bacterium PD-1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.167AlaAla: 11.167 ± 0.116
1.055AlaCys: 1.055 ± 0.028
5.069AlaAsp: 5.069 ± 0.064
6.065AlaGlu: 6.065 ± 0.071
3.632AlaPhe: 3.632 ± 0.043
7.852AlaGly: 7.852 ± 0.107
2.02AlaHis: 2.02 ± 0.04
5.893AlaIle: 5.893 ± 0.062
3.603AlaLys: 3.603 ± 0.067
12.762AlaLeu: 12.762 ± 0.13
2.968AlaMet: 2.968 ± 0.045
3.16AlaAsn: 3.16 ± 0.053
3.831AlaPro: 3.831 ± 0.058
4.971AlaGln: 4.971 ± 0.084
5.979AlaArg: 5.979 ± 0.085
5.776AlaSer: 5.776 ± 0.071
4.96AlaThr: 4.96 ± 0.066
6.83AlaVal: 6.83 ± 0.072
1.764AlaTrp: 1.764 ± 0.036
1.973AlaTyr: 1.973 ± 0.038
0.0AlaXaa: 0.0 ± 0.0
Cys
0.898CysAla: 0.898 ± 0.024
0.164CysCys: 0.164 ± 0.012
0.55CysAsp: 0.55 ± 0.02
0.518CysGlu: 0.518 ± 0.018
0.407CysPhe: 0.407 ± 0.017
1.0CysGly: 1.0 ± 0.025
0.3CysHis: 0.3 ± 0.016
0.507CysIle: 0.507 ± 0.022
0.275CysLys: 0.275 ± 0.014
1.016CysLeu: 1.016 ± 0.027
0.219CysMet: 0.219 ± 0.013
0.273CysAsn: 0.273 ± 0.013
0.433CysPro: 0.433 ± 0.017
0.491CysGln: 0.491 ± 0.02
0.62CysArg: 0.62 ± 0.021
0.627CysSer: 0.627 ± 0.022
0.408CysThr: 0.408 ± 0.016
0.728CysVal: 0.728 ± 0.026
0.172CysTrp: 0.172 ± 0.012
0.318CysTyr: 0.318 ± 0.015
0.0CysXaa: 0.0 ± 0.0
Asp
5.245AspAla: 5.245 ± 0.07
0.462AspCys: 0.462 ± 0.02
2.844AspAsp: 2.844 ± 0.058
3.541AspGlu: 3.541 ± 0.055
2.262AspPhe: 2.262 ± 0.04
3.741AspGly: 3.741 ± 0.071
1.008AspHis: 1.008 ± 0.026
3.25AspIle: 3.25 ± 0.049
2.385AspLys: 2.385 ± 0.051
4.621AspLeu: 4.621 ± 0.064
1.272AspMet: 1.272 ± 0.028
2.155AspAsn: 2.155 ± 0.045
2.223AspPro: 2.223 ± 0.041
1.659AspGln: 1.659 ± 0.039
2.946AspArg: 2.946 ± 0.051
2.785AspSer: 2.785 ± 0.046
2.476AspThr: 2.476 ± 0.043
3.529AspVal: 3.529 ± 0.061
0.792AspTrp: 0.792 ± 0.028
1.849AspTyr: 1.849 ± 0.038
0.0AspXaa: 0.0 ± 0.0
Glu
5.489GluAla: 5.489 ± 0.062
0.445GluCys: 0.445 ± 0.016
2.177GluAsp: 2.177 ± 0.047
3.258GluGlu: 3.258 ± 0.058
1.676GluPhe: 1.676 ± 0.039
3.531GluGly: 3.531 ± 0.052
1.331GluHis: 1.331 ± 0.032
3.233GluIle: 3.233 ± 0.055
3.068GluLys: 3.068 ± 0.057
5.53GluLeu: 5.53 ± 0.066
1.748GluMet: 1.748 ± 0.041
2.308GluAsn: 2.308 ± 0.046
2.02GluPro: 2.02 ± 0.035
3.488GluGln: 3.488 ± 0.055
3.745GluArg: 3.745 ± 0.062
2.968GluSer: 2.968 ± 0.048
3.016GluThr: 3.016 ± 0.044
3.828GluVal: 3.828 ± 0.06
0.775GluTrp: 0.775 ± 0.023
1.376GluTyr: 1.376 ± 0.032
0.0GluXaa: 0.0 ± 0.0
Phe
3.697PheAla: 3.697 ± 0.055
0.519PheCys: 0.519 ± 0.021
2.27PheAsp: 2.27 ± 0.042
1.668PheGlu: 1.668 ± 0.043
1.673PhePhe: 1.673 ± 0.043
3.188PheGly: 3.188 ± 0.054
0.824PheHis: 0.824 ± 0.024
2.553PheIle: 2.553 ± 0.049
1.172PheLys: 1.172 ± 0.029
3.445PheLeu: 3.445 ± 0.058
0.943PheMet: 0.943 ± 0.028
1.743PheAsn: 1.743 ± 0.037
1.563PhePro: 1.563 ± 0.036
1.196PheGln: 1.196 ± 0.031
1.947PheArg: 1.947 ± 0.037
3.194PheSer: 3.194 ± 0.046
2.34PheThr: 2.34 ± 0.046
2.297PheVal: 2.297 ± 0.05
0.617PheTrp: 0.617 ± 0.023
1.233PheTyr: 1.233 ± 0.028
0.0PheXaa: 0.0 ± 0.0
Gly
6.585GlyAla: 6.585 ± 0.08
0.968GlyCys: 0.968 ± 0.025
3.673GlyAsp: 3.673 ± 0.06
4.644GlyGlu: 4.644 ± 0.068
3.277GlyPhe: 3.277 ± 0.055
5.531GlyGly: 5.531 ± 0.084
1.676GlyHis: 1.676 ± 0.037
4.922GlyIle: 4.922 ± 0.061
3.819GlyLys: 3.819 ± 0.059
7.65GlyLeu: 7.65 ± 0.084
2.273GlyMet: 2.273 ± 0.047
2.811GlyAsn: 2.811 ± 0.067
2.058GlyPro: 2.058 ± 0.037
3.048GlyGln: 3.048 ± 0.051
3.976GlyArg: 3.976 ± 0.058
4.378GlySer: 4.378 ± 0.069
3.724GlyThr: 3.724 ± 0.072
5.729GlyVal: 5.729 ± 0.078
1.385GlyTrp: 1.385 ± 0.033
2.608GlyTyr: 2.608 ± 0.047
0.0GlyXaa: 0.0 ± 0.0
His
2.1HisAla: 2.1 ± 0.04
0.336HisCys: 0.336 ± 0.016
1.204HisAsp: 1.204 ± 0.033
1.005HisGlu: 1.005 ± 0.028
1.086HisPhe: 1.086 ± 0.029
1.775HisGly: 1.775 ± 0.039
0.855HisHis: 0.855 ± 0.027
1.28HisIle: 1.28 ± 0.033
0.647HisLys: 0.647 ± 0.022
2.422HisLeu: 2.422 ± 0.043
0.522HisMet: 0.522 ± 0.018
0.811HisAsn: 0.811 ± 0.023
1.417HisPro: 1.417 ± 0.033
1.449HisGln: 1.449 ± 0.037
1.326HisArg: 1.326 ± 0.031
1.298HisSer: 1.298 ± 0.032
1.105HisThr: 1.105 ± 0.031
1.247HisVal: 1.247 ± 0.029
0.452HisTrp: 0.452 ± 0.019
0.954HisTyr: 0.954 ± 0.027
0.0HisXaa: 0.0 ± 0.0
Ile
6.436IleAla: 6.436 ± 0.072
0.591IleCys: 0.591 ± 0.02
3.544IleAsp: 3.544 ± 0.053
3.414IleGlu: 3.414 ± 0.053
1.943IlePhe: 1.943 ± 0.039
4.795IleGly: 4.795 ± 0.072
1.156IleHis: 1.156 ± 0.028
3.287IleIle: 3.287 ± 0.06
2.282IleLys: 2.282 ± 0.052
4.879IleLeu: 4.879 ± 0.07
1.202IleMet: 1.202 ± 0.033
2.556IleAsn: 2.556 ± 0.048
2.54IlePro: 2.54 ± 0.039
1.767IleGln: 1.767 ± 0.032
3.0IleArg: 3.0 ± 0.047
3.706IleSer: 3.706 ± 0.054
3.506IleThr: 3.506 ± 0.058
3.724IleVal: 3.724 ± 0.052
0.693IleTrp: 0.693 ± 0.022
1.484IleTyr: 1.484 ± 0.032
0.0IleXaa: 0.0 ± 0.0
Lys
3.966LysAla: 3.966 ± 0.069
0.22LysCys: 0.22 ± 0.013
1.794LysAsp: 1.794 ± 0.045
2.122LysGlu: 2.122 ± 0.048
1.073LysPhe: 1.073 ± 0.024
2.748LysGly: 2.748 ± 0.051
0.838LysHis: 0.838 ± 0.022
2.224LysIle: 2.224 ± 0.041
2.088LysLys: 2.088 ± 0.05
4.078LysLeu: 4.078 ± 0.06
1.149LysMet: 1.149 ± 0.033
1.643LysAsn: 1.643 ± 0.039
2.036LysPro: 2.036 ± 0.043
2.04LysGln: 2.04 ± 0.042
2.403LysArg: 2.403 ± 0.046
2.214LysSer: 2.214 ± 0.048
2.405LysThr: 2.405 ± 0.041
2.779LysVal: 2.779 ± 0.046
0.435LysTrp: 0.435 ± 0.019
1.1LysTyr: 1.1 ± 0.036
0.0LysXaa: 0.0 ± 0.0
Leu
12.095LeuAla: 12.095 ± 0.115
1.18LeuCys: 1.18 ± 0.035
5.366LeuAsp: 5.366 ± 0.057
5.32LeuGlu: 5.32 ± 0.071
4.434LeuPhe: 4.434 ± 0.071
7.363LeuGly: 7.363 ± 0.083
2.355LeuHis: 2.355 ± 0.041
5.883LeuIle: 5.883 ± 0.073
4.264LeuLys: 4.264 ± 0.069
12.381LeuLeu: 12.381 ± 0.158
2.928LeuMet: 2.928 ± 0.046
4.268LeuAsn: 4.268 ± 0.066
5.975LeuPro: 5.975 ± 0.073
4.511LeuGln: 4.511 ± 0.062
6.604LeuArg: 6.604 ± 0.079
7.644LeuSer: 7.644 ± 0.079
6.602LeuThr: 6.602 ± 0.086
7.239LeuVal: 7.239 ± 0.078
1.46LeuTrp: 1.46 ± 0.034
2.612LeuTyr: 2.612 ± 0.043
0.0LeuXaa: 0.0 ± 0.0
Met
2.892MetAla: 2.892 ± 0.041
0.172MetCys: 0.172 ± 0.011
1.131MetAsp: 1.131 ± 0.03
1.15MetGlu: 1.15 ± 0.03
0.799MetPhe: 0.799 ± 0.023
1.735MetGly: 1.735 ± 0.039
0.51MetHis: 0.51 ± 0.02
1.386MetIle: 1.386 ± 0.035
1.359MetLys: 1.359 ± 0.035
3.08MetLeu: 3.08 ± 0.058
0.869MetMet: 0.869 ± 0.026
1.045MetAsn: 1.045 ± 0.03
1.288MetPro: 1.288 ± 0.031
1.176MetGln: 1.176 ± 0.028
1.584MetArg: 1.584 ± 0.033
1.868MetSer: 1.868 ± 0.034
1.734MetThr: 1.734 ± 0.031
1.883MetVal: 1.883 ± 0.04
0.228MetTrp: 0.228 ± 0.012
0.451MetTyr: 0.451 ± 0.019
0.0MetXaa: 0.0 ± 0.0
Asn
3.723AsnAla: 3.723 ± 0.054
0.328AsnCys: 0.328 ± 0.017
1.982AsnAsp: 1.982 ± 0.04
1.721AsnGlu: 1.721 ± 0.039
1.277AsnPhe: 1.277 ± 0.034
3.146AsnGly: 3.146 ± 0.071
0.905AsnHis: 0.905 ± 0.024
2.2AsnIle: 2.2 ± 0.043
1.456AsnLys: 1.456 ± 0.036
3.573AsnLeu: 3.573 ± 0.059
0.83AsnMet: 0.83 ± 0.028
1.58AsnAsn: 1.58 ± 0.045
2.111AsnPro: 2.111 ± 0.041
1.805AsnGln: 1.805 ± 0.034
2.087AsnArg: 2.087 ± 0.04
2.063AsnSer: 2.063 ± 0.046
1.923AsnThr: 1.923 ± 0.038
2.33AsnVal: 2.33 ± 0.045
0.559AsnTrp: 0.559 ± 0.022
1.129AsnTyr: 1.129 ± 0.029
0.0AsnXaa: 0.0 ± 0.0
Pro
4.809ProAla: 4.809 ± 0.072
0.34ProCys: 0.34 ± 0.014
2.771ProAsp: 2.771 ± 0.049
3.215ProGlu: 3.215 ± 0.046
1.774ProPhe: 1.774 ± 0.034
3.613ProGly: 3.613 ± 0.049
1.165ProHis: 1.165 ± 0.029
1.982ProIle: 1.982 ± 0.037
1.421ProLys: 1.421 ± 0.036
5.332ProLeu: 5.332 ± 0.067
1.062ProMet: 1.062 ± 0.03
1.343ProAsn: 1.343 ± 0.035
1.92ProPro: 1.92 ± 0.041
2.358ProGln: 2.358 ± 0.044
2.114ProArg: 2.114 ± 0.042
2.163ProSer: 2.163 ± 0.042
2.2ProThr: 2.2 ± 0.038
3.692ProVal: 3.692 ± 0.052
0.752ProTrp: 0.752 ± 0.022
1.179ProTyr: 1.179 ± 0.03
0.0ProXaa: 0.0 ± 0.0
Gln
5.128GlnAla: 5.128 ± 0.078
0.37GlnCys: 0.37 ± 0.018
1.953GlnAsp: 1.953 ± 0.045
2.182GlnGlu: 2.182 ± 0.041
1.509GlnPhe: 1.509 ± 0.032
3.26GlnGly: 3.26 ± 0.054
1.475GlnHis: 1.475 ± 0.035
2.311GlnIle: 2.311 ± 0.047
1.668GlnLys: 1.668 ± 0.041
5.317GlnLeu: 5.317 ± 0.083
1.226GlnMet: 1.226 ± 0.028
1.425GlnAsn: 1.425 ± 0.035
2.568GlnPro: 2.568 ± 0.044
4.098GlnGln: 4.098 ± 0.082
3.713GlnArg: 3.713 ± 0.062
2.384GlnSer: 2.384 ± 0.039
2.343GlnThr: 2.343 ± 0.045
3.188GlnVal: 3.188 ± 0.057
0.723GlnTrp: 0.723 ± 0.022
1.129GlnTyr: 1.129 ± 0.028
0.0GlnXaa: 0.0 ± 0.0
Arg
5.199ArgAla: 5.199 ± 0.064
0.571ArgCys: 0.571 ± 0.025
3.139ArgAsp: 3.139 ± 0.057
3.818ArgGlu: 3.818 ± 0.063
2.706ArgPhe: 2.706 ± 0.042
3.511ArgGly: 3.511 ± 0.045
1.697ArgHis: 1.697 ± 0.04
3.384ArgIle: 3.384 ± 0.052
2.209ArgLys: 2.209 ± 0.048
6.911ArgLeu: 6.911 ± 0.088
1.557ArgMet: 1.557 ± 0.031
2.05ArgAsn: 2.05 ± 0.042
2.288ArgPro: 2.288 ± 0.044
3.618ArgGln: 3.618 ± 0.059
3.764ArgArg: 3.764 ± 0.064
2.896ArgSer: 2.896 ± 0.046
2.578ArgThr: 2.578 ± 0.041
4.08ArgVal: 4.08 ± 0.061
1.03ArgTrp: 1.03 ± 0.03
2.217ArgTyr: 2.217 ± 0.042
0.0ArgXaa: 0.0 ± 0.0
Ser
6.122SerAla: 6.122 ± 0.075
0.518SerCys: 0.518 ± 0.019
3.136SerAsp: 3.136 ± 0.05
3.252SerGlu: 3.252 ± 0.057
2.26SerPhe: 2.26 ± 0.044
5.64SerGly: 5.64 ± 0.081
1.446SerHis: 1.446 ± 0.037
2.927SerIle: 2.927 ± 0.056
1.912SerLys: 1.912 ± 0.041
6.962SerLeu: 6.962 ± 0.068
1.461SerMet: 1.461 ± 0.035
1.84SerAsn: 1.84 ± 0.039
2.699SerPro: 2.699 ± 0.047
2.629SerGln: 2.629 ± 0.047
3.435SerArg: 3.435 ± 0.052
3.624SerSer: 3.624 ± 0.053
2.997SerThr: 2.997 ± 0.051
4.247SerVal: 4.247 ± 0.067
1.013SerTrp: 1.013 ± 0.027
1.56SerTyr: 1.56 ± 0.033
0.0SerXaa: 0.0 ± 0.0
Thr
5.088ThrAla: 5.088 ± 0.058
0.435ThrCys: 0.435 ± 0.017
2.615ThrAsp: 2.615 ± 0.049
2.599ThrGlu: 2.599 ± 0.044
1.983ThrPhe: 1.983 ± 0.038
4.33ThrGly: 4.33 ± 0.064
1.33ThrHis: 1.33 ± 0.032
2.769ThrIle: 2.769 ± 0.051
1.464ThrLys: 1.464 ± 0.036
7.802ThrLeu: 7.802 ± 0.098
1.049ThrMet: 1.049 ± 0.028
1.541ThrAsn: 1.541 ± 0.042
3.237ThrPro: 3.237 ± 0.05
2.336ThrGln: 2.336 ± 0.048
3.215ThrArg: 3.215 ± 0.054
2.908ThrSer: 2.908 ± 0.045
2.765ThrThr: 2.765 ± 0.052
3.902ThrVal: 3.902 ± 0.083
0.805ThrTrp: 0.805 ± 0.024
1.109ThrTyr: 1.109 ± 0.027
0.0ThrXaa: 0.0 ± 0.0
Val
7.18ValAla: 7.18 ± 0.074
0.688ValCys: 0.688 ± 0.024
3.64ValAsp: 3.64 ± 0.058
3.834ValGlu: 3.834 ± 0.06
2.45ValPhe: 2.45 ± 0.04
4.871ValGly: 4.871 ± 0.073
1.18ValHis: 1.18 ± 0.028
4.453ValIle: 4.453 ± 0.069
2.897ValLys: 2.897 ± 0.049
7.198ValLeu: 7.198 ± 0.085
2.125ValMet: 2.125 ± 0.04
2.736ValAsn: 2.736 ± 0.047
2.964ValPro: 2.964 ± 0.047
2.346ValGln: 2.346 ± 0.039
3.664ValArg: 3.664 ± 0.059
4.696ValSer: 4.696 ± 0.058
4.305ValThr: 4.305 ± 0.073
5.478ValVal: 5.478 ± 0.08
0.934ValTrp: 0.934 ± 0.028
1.646ValTyr: 1.646 ± 0.035
0.0ValXaa: 0.0 ± 0.0
Trp
1.048TrpAla: 1.048 ± 0.03
0.19TrpCys: 0.19 ± 0.011
0.627TrpAsp: 0.627 ± 0.018
0.577TrpGlu: 0.577 ± 0.02
0.739TrpPhe: 0.739 ± 0.025
0.897TrpGly: 0.897 ± 0.032
0.478TrpHis: 0.478 ± 0.017
0.719TrpIle: 0.719 ± 0.023
0.499TrpLys: 0.499 ± 0.017
2.466TrpLeu: 2.466 ± 0.048
0.4TrpMet: 0.4 ± 0.015
0.505TrpAsn: 0.505 ± 0.019
0.693TrpPro: 0.693 ± 0.022
1.23TrpGln: 1.23 ± 0.034
1.161TrpArg: 1.161 ± 0.032
0.876TrpSer: 0.876 ± 0.029
0.551TrpThr: 0.551 ± 0.02
0.937TrpVal: 0.937 ± 0.027
0.245TrpTrp: 0.245 ± 0.013
0.442TrpTyr: 0.442 ± 0.021
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.463TyrAla: 2.463 ± 0.039
0.37TyrCys: 0.37 ± 0.017
1.558TyrAsp: 1.558 ± 0.041
1.106TyrGlu: 1.106 ± 0.032
1.124TyrPhe: 1.124 ± 0.032
2.211TyrGly: 2.211 ± 0.038
0.743TyrHis: 0.743 ± 0.025
1.343TyrIle: 1.343 ± 0.031
0.867TyrLys: 0.867 ± 0.024
3.062TyrLeu: 3.062 ± 0.049
0.538TyrMet: 0.538 ± 0.02
0.936TyrAsn: 0.936 ± 0.026
1.34TyrPro: 1.34 ± 0.031
1.752TyrGln: 1.752 ± 0.039
1.939TyrArg: 1.939 ± 0.043
1.614TyrSer: 1.614 ± 0.035
1.344TyrThr: 1.344 ± 0.035
1.616TyrVal: 1.616 ± 0.038
0.445TyrTrp: 0.445 ± 0.019
0.876TyrTyr: 0.876 ± 0.029
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4353 proteins (1371518 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski