Amino acid dipepetide frequency for Sphingomonadaceae bacterium PASS1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
15.924AlaAla: 15.924 ± 0.186
1.056AlaCys: 1.056 ± 0.035
7.247AlaAsp: 7.247 ± 0.089
7.077AlaGlu: 7.077 ± 0.11
4.478AlaPhe: 4.478 ± 0.071
9.897AlaGly: 9.897 ± 0.126
2.311AlaHis: 2.311 ± 0.05
6.995AlaIle: 6.995 ± 0.088
5.079AlaLys: 5.079 ± 0.081
12.115AlaLeu: 12.115 ± 0.139
3.845AlaMet: 3.845 ± 0.08
3.764AlaAsn: 3.764 ± 0.068
5.376AlaPro: 5.376 ± 0.084
4.411AlaGln: 4.411 ± 0.083
7.299AlaArg: 7.299 ± 0.11
6.534AlaSer: 6.534 ± 0.086
6.192AlaThr: 6.192 ± 0.096
8.095AlaVal: 8.095 ± 0.102
1.419AlaTrp: 1.419 ± 0.042
2.67AlaTyr: 2.67 ± 0.059
0.0AlaXaa: 0.0 ± 0.0
Cys
1.017CysAla: 1.017 ± 0.034
0.107CysCys: 0.107 ± 0.01
0.625CysAsp: 0.625 ± 0.026
0.421CysGlu: 0.421 ± 0.022
0.343CysPhe: 0.343 ± 0.019
0.931CysGly: 0.931 ± 0.031
0.236CysHis: 0.236 ± 0.016
0.452CysIle: 0.452 ± 0.024
0.263CysLys: 0.263 ± 0.018
0.699CysLeu: 0.699 ± 0.03
0.159CysMet: 0.159 ± 0.015
0.279CysAsn: 0.279 ± 0.019
0.458CysPro: 0.458 ± 0.022
0.238CysGln: 0.238 ± 0.015
0.515CysArg: 0.515 ± 0.023
0.527CysSer: 0.527 ± 0.025
0.409CysThr: 0.409 ± 0.017
0.616CysVal: 0.616 ± 0.026
0.122CysTrp: 0.122 ± 0.012
0.209CysTyr: 0.209 ± 0.015
0.0CysXaa: 0.0 ± 0.0
Asp
7.481AspAla: 7.481 ± 0.1
0.504AspCys: 0.504 ± 0.026
3.408AspAsp: 3.408 ± 0.077
3.229AspGlu: 3.229 ± 0.066
2.303AspPhe: 2.303 ± 0.053
5.581AspGly: 5.581 ± 0.109
1.248AspHis: 1.248 ± 0.035
3.665AspIle: 3.665 ± 0.073
2.163AspLys: 2.163 ± 0.048
5.368AspLeu: 5.368 ± 0.078
1.684AspMet: 1.684 ± 0.043
1.742AspAsn: 1.742 ± 0.047
3.202AspPro: 3.202 ± 0.06
1.897AspGln: 1.897 ± 0.044
3.908AspArg: 3.908 ± 0.066
2.7AspSer: 2.7 ± 0.054
2.716AspThr: 2.716 ± 0.07
4.413AspVal: 4.413 ± 0.078
1.087AspTrp: 1.087 ± 0.035
1.668AspTyr: 1.668 ± 0.049
0.0AspXaa: 0.0 ± 0.0
Glu
6.797GluAla: 6.797 ± 0.092
0.39GluCys: 0.39 ± 0.022
2.645GluAsp: 2.645 ± 0.056
2.743GluGlu: 2.743 ± 0.064
1.818GluPhe: 1.818 ± 0.043
4.055GluGly: 4.055 ± 0.071
1.146GluHis: 1.146 ± 0.04
3.211GluIle: 3.211 ± 0.065
2.493GluLys: 2.493 ± 0.059
4.986GluLeu: 4.986 ± 0.076
1.644GluMet: 1.644 ± 0.046
1.812GluAsn: 1.812 ± 0.041
2.235GluPro: 2.235 ± 0.051
2.079GluGln: 2.079 ± 0.062
4.026GluArg: 4.026 ± 0.08
2.456GluSer: 2.456 ± 0.057
3.128GluThr: 3.128 ± 0.062
3.434GluVal: 3.434 ± 0.06
0.766GluTrp: 0.766 ± 0.031
1.246GluTyr: 1.246 ± 0.033
0.0GluXaa: 0.0 ± 0.0
Phe
4.89PheAla: 4.89 ± 0.076
0.394PheCys: 0.394 ± 0.02
2.889PheAsp: 2.889 ± 0.056
2.248PheGlu: 2.248 ± 0.048
1.463PhePhe: 1.463 ± 0.044
3.996PheGly: 3.996 ± 0.065
0.802PheHis: 0.802 ± 0.03
1.824PheIle: 1.824 ± 0.046
1.149PheLys: 1.149 ± 0.035
3.213PheLeu: 3.213 ± 0.067
0.9PheMet: 0.9 ± 0.033
1.393PheAsn: 1.393 ± 0.043
1.567PhePro: 1.567 ± 0.045
1.067PheGln: 1.067 ± 0.036
1.991PheArg: 1.991 ± 0.051
2.315PheSer: 2.315 ± 0.054
2.127PheThr: 2.127 ± 0.05
2.704PheVal: 2.704 ± 0.051
0.563PheTrp: 0.563 ± 0.025
0.944PheTyr: 0.944 ± 0.028
0.0PheXaa: 0.0 ± 0.0
Gly
9.109GlyAla: 9.109 ± 0.12
0.855GlyCys: 0.855 ± 0.032
4.705GlyAsp: 4.705 ± 0.09
4.269GlyGlu: 4.269 ± 0.071
3.749GlyPhe: 3.749 ± 0.074
7.698GlyGly: 7.698 ± 0.161
1.876GlyHis: 1.876 ± 0.047
4.869GlyIle: 4.869 ± 0.077
4.036GlyLys: 4.036 ± 0.072
8.124GlyLeu: 8.124 ± 0.099
2.405GlyMet: 2.405 ± 0.049
2.762GlyAsn: 2.762 ± 0.084
3.402GlyPro: 3.402 ± 0.057
3.019GlyGln: 3.019 ± 0.056
5.224GlyArg: 5.224 ± 0.08
5.046GlySer: 5.046 ± 0.095
4.757GlyThr: 4.757 ± 0.097
6.025GlyVal: 6.025 ± 0.093
1.509GlyTrp: 1.509 ± 0.041
2.523GlyTyr: 2.523 ± 0.059
0.0GlyXaa: 0.0 ± 0.0
His
2.236HisAla: 2.236 ± 0.054
0.245HisCys: 0.245 ± 0.017
1.281HisAsp: 1.281 ± 0.037
0.998HisGlu: 0.998 ± 0.032
0.963HisPhe: 0.963 ± 0.035
1.881HisGly: 1.881 ± 0.049
0.603HisHis: 0.603 ± 0.029
1.229HisIle: 1.229 ± 0.034
0.66HisLys: 0.66 ± 0.028
1.804HisLeu: 1.804 ± 0.045
0.583HisMet: 0.583 ± 0.028
0.652HisAsn: 0.652 ± 0.024
1.288HisPro: 1.288 ± 0.045
0.597HisGln: 0.597 ± 0.026
1.238HisArg: 1.238 ± 0.04
1.096HisSer: 1.096 ± 0.037
0.747HisThr: 0.747 ± 0.029
1.508HisVal: 1.508 ± 0.041
0.365HisTrp: 0.365 ± 0.023
0.7HisTyr: 0.7 ± 0.027
0.0HisXaa: 0.0 ± 0.0
Ile
7.742IleAla: 7.742 ± 0.1
0.612IleCys: 0.612 ± 0.027
4.062IleAsp: 4.062 ± 0.072
3.618IleGlu: 3.618 ± 0.068
2.083IlePhe: 2.083 ± 0.048
5.283IleGly: 5.283 ± 0.08
0.964IleHis: 0.964 ± 0.034
3.176IleIle: 3.176 ± 0.064
1.845IleLys: 1.845 ± 0.047
4.656IleLeu: 4.656 ± 0.078
1.274IleMet: 1.274 ± 0.041
1.88IleAsn: 1.88 ± 0.044
2.51IlePro: 2.51 ± 0.058
1.363IleGln: 1.363 ± 0.038
3.09IleArg: 3.09 ± 0.061
3.65IleSer: 3.65 ± 0.066
2.956IleThr: 2.956 ± 0.053
4.266IleVal: 4.266 ± 0.075
0.801IleTrp: 0.801 ± 0.032
1.241IleTyr: 1.241 ± 0.038
0.0IleXaa: 0.0 ± 0.0
Lys
5.072LysAla: 5.072 ± 0.091
0.231LysCys: 0.231 ± 0.017
2.011LysAsp: 2.011 ± 0.049
1.612LysGlu: 1.612 ± 0.05
1.214LysPhe: 1.214 ± 0.037
3.114LysGly: 3.114 ± 0.067
0.785LysHis: 0.785 ± 0.031
2.26LysIle: 2.26 ± 0.055
1.601LysLys: 1.601 ± 0.049
3.893LysLeu: 3.893 ± 0.066
1.155LysMet: 1.155 ± 0.04
1.161LysAsn: 1.161 ± 0.034
2.286LysPro: 2.286 ± 0.056
1.173LysGln: 1.173 ± 0.034
2.543LysArg: 2.543 ± 0.052
2.353LysSer: 2.353 ± 0.053
2.153LysThr: 2.153 ± 0.05
2.628LysVal: 2.628 ± 0.059
0.556LysTrp: 0.556 ± 0.028
0.869LysTyr: 0.869 ± 0.033
0.0LysXaa: 0.0 ± 0.0
Leu
12.021LeuAla: 12.021 ± 0.126
0.824LeuCys: 0.824 ± 0.032
5.527LeuAsp: 5.527 ± 0.089
4.57LeuGlu: 4.57 ± 0.08
3.59LeuPhe: 3.59 ± 0.072
7.769LeuGly: 7.769 ± 0.096
1.803LeuHis: 1.803 ± 0.047
5.073LeuIle: 5.073 ± 0.074
3.699LeuLys: 3.699 ± 0.07
8.619LeuLeu: 8.619 ± 0.13
2.308LeuMet: 2.308 ± 0.06
3.122LeuAsn: 3.122 ± 0.065
5.052LeuPro: 5.052 ± 0.075
2.689LeuGln: 2.689 ± 0.061
5.616LeuArg: 5.616 ± 0.089
6.28LeuSer: 6.28 ± 0.09
5.372LeuThr: 5.372 ± 0.079
6.407LeuVal: 6.407 ± 0.1
1.191LeuTrp: 1.191 ± 0.036
1.995LeuTyr: 1.995 ± 0.047
0.0LeuXaa: 0.0 ± 0.0
Met
3.481MetAla: 3.481 ± 0.062
0.196MetCys: 0.196 ± 0.013
1.336MetAsp: 1.336 ± 0.036
1.182MetGlu: 1.182 ± 0.039
0.89MetPhe: 0.89 ± 0.036
2.185MetGly: 2.185 ± 0.053
0.547MetHis: 0.547 ± 0.029
1.513MetIle: 1.513 ± 0.037
1.151MetLys: 1.151 ± 0.04
2.667MetLeu: 2.667 ± 0.057
0.771MetMet: 0.771 ± 0.029
0.872MetAsn: 0.872 ± 0.03
1.562MetPro: 1.562 ± 0.043
0.945MetGln: 0.945 ± 0.03
1.845MetArg: 1.845 ± 0.042
1.658MetSer: 1.658 ± 0.039
1.979MetThr: 1.979 ± 0.047
1.791MetVal: 1.791 ± 0.046
0.297MetTrp: 0.297 ± 0.018
0.3MetTyr: 0.3 ± 0.018
0.0MetXaa: 0.0 ± 0.0
Asn
4.023AsnAla: 4.023 ± 0.075
0.305AsnCys: 0.305 ± 0.018
1.934AsnAsp: 1.934 ± 0.076
1.38AsnGlu: 1.38 ± 0.038
1.342AsnPhe: 1.342 ± 0.04
3.027AsnGly: 3.027 ± 0.08
0.583AsnHis: 0.583 ± 0.024
2.091AsnIle: 2.091 ± 0.05
0.995AsnLys: 0.995 ± 0.037
2.887AsnLeu: 2.887 ± 0.064
0.825AsnMet: 0.825 ± 0.029
1.082AsnAsn: 1.082 ± 0.04
2.093AsnPro: 2.093 ± 0.051
0.942AsnGln: 0.942 ± 0.031
2.023AsnArg: 2.023 ± 0.049
1.669AsnSer: 1.669 ± 0.051
1.4AsnThr: 1.4 ± 0.044
2.349AsnVal: 2.349 ± 0.057
0.554AsnTrp: 0.554 ± 0.025
0.912AsnTyr: 0.912 ± 0.037
0.0AsnXaa: 0.0 ± 0.0
Pro
5.755ProAla: 5.755 ± 0.082
0.326ProCys: 0.326 ± 0.019
3.457ProAsp: 3.457 ± 0.063
3.417ProGlu: 3.417 ± 0.067
1.94ProPhe: 1.94 ± 0.044
3.778ProGly: 3.778 ± 0.067
1.115ProHis: 1.115 ± 0.038
2.575ProIle: 2.575 ± 0.061
1.935ProLys: 1.935 ± 0.052
4.445ProLeu: 4.445 ± 0.065
1.306ProMet: 1.306 ± 0.038
1.658ProAsn: 1.658 ± 0.039
2.141ProPro: 2.141 ± 0.054
1.645ProGln: 1.645 ± 0.042
2.357ProArg: 2.357 ± 0.051
2.851ProSer: 2.851 ± 0.063
2.534ProThr: 2.534 ± 0.052
4.037ProVal: 4.037 ± 0.071
0.638ProTrp: 0.638 ± 0.024
1.212ProTyr: 1.212 ± 0.04
0.0ProXaa: 0.0 ± 0.0
Gln
3.663GlnAla: 3.663 ± 0.067
0.262GlnCys: 0.262 ± 0.016
1.541GlnAsp: 1.541 ± 0.039
1.448GlnGlu: 1.448 ± 0.046
1.254GlnPhe: 1.254 ± 0.034
2.483GlnGly: 2.483 ± 0.049
0.666GlnHis: 0.666 ± 0.026
2.097GlnIle: 2.097 ± 0.057
1.288GlnLys: 1.288 ± 0.038
3.245GlnLeu: 3.245 ± 0.055
1.027GlnMet: 1.027 ± 0.037
1.095GlnAsn: 1.095 ± 0.036
1.689GlnPro: 1.689 ± 0.045
1.228GlnGln: 1.228 ± 0.04
2.212GlnArg: 2.212 ± 0.056
2.109GlnSer: 2.109 ± 0.044
1.769GlnThr: 1.769 ± 0.041
2.17GlnVal: 2.17 ± 0.051
0.477GlnTrp: 0.477 ± 0.024
0.808GlnTyr: 0.808 ± 0.033
0.0GlnXaa: 0.0 ± 0.0
Arg
6.802ArgAla: 6.802 ± 0.103
0.449ArgCys: 0.449 ± 0.023
3.651ArgAsp: 3.651 ± 0.068
3.281ArgGlu: 3.281 ± 0.066
2.747ArgPhe: 2.747 ± 0.059
4.22ArgGly: 4.22 ± 0.067
1.394ArgHis: 1.394 ± 0.038
3.788ArgIle: 3.788 ± 0.072
2.418ArgLys: 2.418 ± 0.053
6.505ArgLeu: 6.505 ± 0.115
1.781ArgMet: 1.781 ± 0.039
2.08ArgAsn: 2.08 ± 0.045
2.88ArgPro: 2.88 ± 0.062
2.193ArgGln: 2.193 ± 0.05
3.826ArgArg: 3.826 ± 0.083
3.299ArgSer: 3.299 ± 0.061
3.057ArgThr: 3.057 ± 0.049
4.142ArgVal: 4.142 ± 0.076
0.989ArgTrp: 0.989 ± 0.037
1.751ArgTyr: 1.751 ± 0.046
0.0ArgXaa: 0.0 ± 0.0
Ser
6.766SerAla: 6.766 ± 0.093
0.45SerCys: 0.45 ± 0.018
3.562SerAsp: 3.562 ± 0.073
2.923SerGlu: 2.923 ± 0.065
2.371SerPhe: 2.371 ± 0.052
5.931SerGly: 5.931 ± 0.095
1.108SerHis: 1.108 ± 0.034
3.333SerIle: 3.333 ± 0.065
2.111SerLys: 2.111 ± 0.05
5.159SerLeu: 5.159 ± 0.072
1.462SerMet: 1.462 ± 0.04
1.865SerAsn: 1.865 ± 0.05
2.775SerPro: 2.775 ± 0.06
1.731SerGln: 1.731 ± 0.046
3.234SerArg: 3.234 ± 0.062
3.147SerSer: 3.147 ± 0.074
2.824SerThr: 2.824 ± 0.064
4.111SerVal: 4.111 ± 0.072
0.882SerTrp: 0.882 ± 0.027
1.601SerTyr: 1.601 ± 0.046
0.0SerXaa: 0.0 ± 0.0
Thr
6.308ThrAla: 6.308 ± 0.095
0.42ThrCys: 0.42 ± 0.021
3.134ThrAsp: 3.134 ± 0.061
2.691ThrGlu: 2.691 ± 0.05
1.959ThrPhe: 1.959 ± 0.056
5.285ThrGly: 5.285 ± 0.085
1.097ThrHis: 1.097 ± 0.034
3.067ThrIle: 3.067 ± 0.064
1.834ThrLys: 1.834 ± 0.047
5.39ThrLeu: 5.39 ± 0.087
1.291ThrMet: 1.291 ± 0.039
1.609ThrAsn: 1.609 ± 0.046
3.149ThrPro: 3.149 ± 0.061
1.672ThrGln: 1.672 ± 0.041
2.903ThrArg: 2.903 ± 0.047
3.045ThrSer: 3.045 ± 0.058
2.833ThrThr: 2.833 ± 0.066
3.908ThrVal: 3.908 ± 0.072
0.574ThrTrp: 0.574 ± 0.025
1.248ThrTyr: 1.248 ± 0.041
0.0ThrXaa: 0.0 ± 0.0
Val
8.632ValAla: 8.632 ± 0.097
0.597ValCys: 0.597 ± 0.025
4.516ValAsp: 4.516 ± 0.073
4.179ValGlu: 4.179 ± 0.069
2.359ValPhe: 2.359 ± 0.048
5.477ValGly: 5.477 ± 0.084
1.39ValHis: 1.39 ± 0.041
4.033ValIle: 4.033 ± 0.068
2.487ValLys: 2.487 ± 0.059
5.888ValLeu: 5.888 ± 0.084
1.811ValMet: 1.811 ± 0.048
2.342ValAsn: 2.342 ± 0.057
3.63ValPro: 3.63 ± 0.068
2.173ValGln: 2.173 ± 0.044
4.537ValArg: 4.537 ± 0.065
4.311ValSer: 4.311 ± 0.066
4.53ValThr: 4.53 ± 0.081
5.012ValVal: 5.012 ± 0.096
0.855ValTrp: 0.855 ± 0.029
1.475ValTyr: 1.475 ± 0.038
0.0ValXaa: 0.0 ± 0.0
Trp
1.376TrpAla: 1.376 ± 0.039
0.124TrpCys: 0.124 ± 0.012
0.8TrpAsp: 0.8 ± 0.034
0.641TrpGlu: 0.641 ± 0.028
0.548TrpPhe: 0.548 ± 0.027
0.99TrpGly: 0.99 ± 0.031
0.394TrpHis: 0.394 ± 0.019
0.705TrpIle: 0.705 ± 0.026
0.601TrpLys: 0.601 ± 0.026
1.649TrpLeu: 1.649 ± 0.048
0.358TrpMet: 0.358 ± 0.022
0.497TrpAsn: 0.497 ± 0.022
0.695TrpPro: 0.695 ± 0.024
0.665TrpGln: 0.665 ± 0.024
1.126TrpArg: 1.126 ± 0.034
0.909TrpSer: 0.909 ± 0.031
0.81TrpThr: 0.81 ± 0.035
0.895TrpVal: 0.895 ± 0.038
0.213TrpTrp: 0.213 ± 0.014
0.312TrpTyr: 0.312 ± 0.02
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.687TyrAla: 2.687 ± 0.056
0.283TyrCys: 0.283 ± 0.017
1.638TyrAsp: 1.638 ± 0.048
1.181TyrGlu: 1.181 ± 0.037
1.029TyrPhe: 1.029 ± 0.035
2.378TyrGly: 2.378 ± 0.054
0.604TyrHis: 0.604 ± 0.024
1.117TyrIle: 1.117 ± 0.035
0.865TyrLys: 0.865 ± 0.033
2.216TyrLeu: 2.216 ± 0.049
0.574TyrMet: 0.574 ± 0.025
0.748TyrAsn: 0.748 ± 0.033
1.133TyrPro: 1.133 ± 0.033
0.804TyrGln: 0.804 ± 0.028
1.745TyrArg: 1.745 ± 0.045
1.391TyrSer: 1.391 ± 0.038
1.122TyrThr: 1.122 ± 0.039
1.724TyrVal: 1.724 ± 0.042
0.447TyrTrp: 0.447 ± 0.02
0.683TyrTyr: 0.683 ± 0.027
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2834 proteins (930857 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski