Amino acid dipepetide frequency for Betaproteobacteria bacterium GR16-43

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
17.355AlaAla: 17.355 ± 0.192
1.219AlaCys: 1.219 ± 0.037
5.939AlaAsp: 5.939 ± 0.075
6.566AlaGlu: 6.566 ± 0.095
4.895AlaPhe: 4.895 ± 0.069
10.821AlaGly: 10.821 ± 0.155
2.303AlaHis: 2.303 ± 0.048
6.211AlaIle: 6.211 ± 0.064
5.346AlaLys: 5.346 ± 0.079
13.402AlaLeu: 13.402 ± 0.173
3.383AlaMet: 3.383 ± 0.049
3.405AlaAsn: 3.405 ± 0.068
6.142AlaPro: 6.142 ± 0.077
4.062AlaGln: 4.062 ± 0.063
8.866AlaArg: 8.866 ± 0.113
7.123AlaSer: 7.123 ± 0.099
6.937AlaThr: 6.937 ± 0.163
8.727AlaVal: 8.727 ± 0.085
1.877AlaTrp: 1.877 ± 0.041
2.809AlaTyr: 2.809 ± 0.058
0.0AlaXaa: 0.0 ± 0.0
Cys
1.092CysAla: 1.092 ± 0.05
0.107CysCys: 0.107 ± 0.009
0.479CysAsp: 0.479 ± 0.02
0.486CysGlu: 0.486 ± 0.018
0.276CysPhe: 0.276 ± 0.015
0.902CysGly: 0.902 ± 0.025
0.287CysHis: 0.287 ± 0.024
0.34CysIle: 0.34 ± 0.016
0.236CysLys: 0.236 ± 0.013
0.719CysLeu: 0.719 ± 0.033
0.155CysMet: 0.155 ± 0.009
0.217CysAsn: 0.217 ± 0.012
0.425CysPro: 0.425 ± 0.017
0.171CysGln: 0.171 ± 0.011
0.548CysArg: 0.548 ± 0.02
0.459CysSer: 0.459 ± 0.02
0.544CysThr: 0.544 ± 0.038
0.681CysVal: 0.681 ± 0.024
0.113CysTrp: 0.113 ± 0.019
0.201CysTyr: 0.201 ± 0.012
0.0CysXaa: 0.0 ± 0.0
Asp
6.869AspAla: 6.869 ± 0.096
0.411AspCys: 0.411 ± 0.019
2.522AspAsp: 2.522 ± 0.058
3.057AspGlu: 3.057 ± 0.057
2.323AspPhe: 2.323 ± 0.045
5.21AspGly: 5.21 ± 0.119
0.981AspHis: 0.981 ± 0.025
2.231AspIle: 2.231 ± 0.052
1.774AspLys: 1.774 ± 0.042
5.334AspLeu: 5.334 ± 0.087
1.036AspMet: 1.036 ± 0.025
1.218AspAsn: 1.218 ± 0.033
3.366AspPro: 3.366 ± 0.047
1.321AspGln: 1.321 ± 0.03
3.842AspArg: 3.842 ± 0.055
2.257AspSer: 2.257 ± 0.039
2.426AspThr: 2.426 ± 0.047
4.12AspVal: 4.12 ± 0.056
0.838AspTrp: 0.838 ± 0.025
1.352AspTyr: 1.352 ± 0.033
0.0AspXaa: 0.0 ± 0.0
Glu
7.487GluAla: 7.487 ± 0.112
0.389GluCys: 0.389 ± 0.017
2.367GluAsp: 2.367 ± 0.045
2.823GluGlu: 2.823 ± 0.054
1.962GluPhe: 1.962 ± 0.038
4.478GluGly: 4.478 ± 0.068
1.193GluHis: 1.193 ± 0.029
2.944GluIle: 2.944 ± 0.046
2.554GluLys: 2.554 ± 0.058
5.252GluLeu: 5.252 ± 0.076
1.387GluMet: 1.387 ± 0.032
1.439GluAsn: 1.439 ± 0.032
2.612GluPro: 2.612 ± 0.045
1.867GluGln: 1.867 ± 0.035
5.003GluArg: 5.003 ± 0.083
2.921GluSer: 2.921 ± 0.055
2.708GluThr: 2.708 ± 0.054
4.23GluVal: 4.23 ± 0.067
0.758GluTrp: 0.758 ± 0.021
1.219GluTyr: 1.219 ± 0.029
0.0GluXaa: 0.0 ± 0.0
Phe
4.894PheAla: 4.894 ± 0.069
0.334PheCys: 0.334 ± 0.016
2.525PheAsp: 2.525 ± 0.048
2.255PheGlu: 2.255 ± 0.04
1.513PhePhe: 1.513 ± 0.035
3.523PheGly: 3.523 ± 0.059
0.823PheHis: 0.823 ± 0.024
1.619PheIle: 1.619 ± 0.035
1.198PheLys: 1.198 ± 0.033
3.44PheLeu: 3.44 ± 0.055
0.738PheMet: 0.738 ± 0.024
1.22PheAsn: 1.22 ± 0.032
1.789PhePro: 1.789 ± 0.037
1.059PheGln: 1.059 ± 0.027
2.28PheArg: 2.28 ± 0.05
2.196PheSer: 2.196 ± 0.042
2.457PheThr: 2.457 ± 0.082
3.103PheVal: 3.103 ± 0.051
0.496PheTrp: 0.496 ± 0.023
0.917PheTyr: 0.917 ± 0.028
0.0PheXaa: 0.0 ± 0.0
Gly
9.455GlyAla: 9.455 ± 0.095
0.883GlyCys: 0.883 ± 0.033
4.434GlyAsp: 4.434 ± 0.074
4.721GlyGlu: 4.721 ± 0.072
3.56GlyPhe: 3.56 ± 0.054
7.282GlyGly: 7.282 ± 0.104
1.772GlyHis: 1.772 ± 0.042
4.168GlyIle: 4.168 ± 0.056
3.972GlyLys: 3.972 ± 0.066
8.096GlyLeu: 8.096 ± 0.089
2.086GlyMet: 2.086 ± 0.049
2.561GlyAsn: 2.561 ± 0.064
3.488GlyPro: 3.488 ± 0.057
2.566GlyGln: 2.566 ± 0.044
5.754GlyArg: 5.754 ± 0.08
5.095GlySer: 5.095 ± 0.08
5.752GlyThr: 5.752 ± 0.138
6.65GlyVal: 6.65 ± 0.077
1.499GlyTrp: 1.499 ± 0.038
2.4GlyTyr: 2.4 ± 0.04
0.0GlyXaa: 0.0 ± 0.0
His
2.57HisAla: 2.57 ± 0.046
0.234HisCys: 0.234 ± 0.013
1.077HisAsp: 1.077 ± 0.03
1.091HisGlu: 1.091 ± 0.024
0.928HisPhe: 0.928 ± 0.028
1.849HisGly: 1.849 ± 0.048
0.531HisHis: 0.531 ± 0.019
0.752HisIle: 0.752 ± 0.024
0.511HisLys: 0.511 ± 0.02
1.919HisLeu: 1.919 ± 0.04
0.386HisMet: 0.386 ± 0.017
0.421HisAsn: 0.421 ± 0.018
1.342HisPro: 1.342 ± 0.033
0.508HisGln: 0.508 ± 0.019
1.372HisArg: 1.372 ± 0.032
0.863HisSer: 0.863 ± 0.024
0.97HisThr: 0.97 ± 0.034
1.691HisVal: 1.691 ± 0.042
0.321HisTrp: 0.321 ± 0.015
0.558HisTyr: 0.558 ± 0.019
0.0HisXaa: 0.0 ± 0.0
Ile
6.779IleAla: 6.779 ± 0.07
0.381IleCys: 0.381 ± 0.017
2.986IleAsp: 2.986 ± 0.044
3.075IleGlu: 3.075 ± 0.055
1.44IlePhe: 1.44 ± 0.033
4.079IleGly: 4.079 ± 0.064
0.848IleHis: 0.848 ± 0.026
1.387IleIle: 1.387 ± 0.039
1.326IleLys: 1.326 ± 0.035
3.986IleLeu: 3.986 ± 0.056
0.623IleMet: 0.623 ± 0.024
1.181IleAsn: 1.181 ± 0.028
2.442IlePro: 2.442 ± 0.046
1.226IleGln: 1.226 ± 0.033
2.7IleArg: 2.7 ± 0.046
2.211IleSer: 2.211 ± 0.044
2.606IleThr: 2.606 ± 0.069
4.086IleVal: 4.086 ± 0.053
0.477IleTrp: 0.477 ± 0.021
1.03IleTyr: 1.03 ± 0.027
0.0IleXaa: 0.0 ± 0.0
Lys
4.813LysAla: 4.813 ± 0.082
0.226LysCys: 0.226 ± 0.015
2.121LysAsp: 2.121 ± 0.049
1.946LysGlu: 1.946 ± 0.049
1.149LysPhe: 1.149 ± 0.03
2.935LysGly: 2.935 ± 0.061
0.798LysHis: 0.798 ± 0.025
1.685LysIle: 1.685 ± 0.039
1.821LysLys: 1.821 ± 0.051
3.972LysLeu: 3.972 ± 0.065
1.001LysMet: 1.001 ± 0.029
1.047LysAsn: 1.047 ± 0.032
2.521LysPro: 2.521 ± 0.049
1.207LysGln: 1.207 ± 0.029
2.883LysArg: 2.883 ± 0.055
2.095LysSer: 2.095 ± 0.042
2.055LysThr: 2.055 ± 0.046
3.064LysVal: 3.064 ± 0.054
0.433LysTrp: 0.433 ± 0.017
0.826LysTyr: 0.826 ± 0.023
0.0LysXaa: 0.0 ± 0.0
Leu
14.396LeuAla: 14.396 ± 0.155
0.745LeuCys: 0.745 ± 0.027
5.558LeuAsp: 5.558 ± 0.07
5.872LeuGlu: 5.872 ± 0.087
3.556LeuPhe: 3.556 ± 0.06
8.209LeuGly: 8.209 ± 0.083
1.851LeuHis: 1.851 ± 0.04
3.454LeuIle: 3.454 ± 0.061
3.73LeuLys: 3.73 ± 0.07
9.221LeuLeu: 9.221 ± 0.136
2.146LeuMet: 2.146 ± 0.045
2.542LeuAsn: 2.542 ± 0.048
5.35LeuPro: 5.35 ± 0.063
2.915LeuGln: 2.915 ± 0.043
6.879LeuArg: 6.879 ± 0.087
5.32LeuSer: 5.32 ± 0.067
5.042LeuThr: 5.042 ± 0.073
7.999LeuVal: 7.999 ± 0.083
1.261LeuTrp: 1.261 ± 0.038
2.139LeuTyr: 2.139 ± 0.038
0.0LeuXaa: 0.0 ± 0.0
Met
2.7MetAla: 2.7 ± 0.046
0.144MetCys: 0.144 ± 0.009
1.141MetAsp: 1.141 ± 0.029
1.082MetGlu: 1.082 ± 0.029
0.677MetPhe: 0.677 ± 0.023
1.777MetGly: 1.777 ± 0.039
0.435MetHis: 0.435 ± 0.017
1.002MetIle: 1.002 ± 0.03
1.334MetLys: 1.334 ± 0.03
2.23MetLeu: 2.23 ± 0.043
0.542MetMet: 0.542 ± 0.021
1.028MetAsn: 1.028 ± 0.028
1.399MetPro: 1.399 ± 0.03
0.792MetGln: 0.792 ± 0.024
1.732MetArg: 1.732 ± 0.042
1.318MetSer: 1.318 ± 0.029
1.483MetThr: 1.483 ± 0.031
1.591MetVal: 1.591 ± 0.037
0.211MetTrp: 0.211 ± 0.012
0.372MetTyr: 0.372 ± 0.016
0.0MetXaa: 0.0 ± 0.0
Asn
3.725AsnAla: 3.725 ± 0.121
0.268AsnCys: 0.268 ± 0.014
1.379AsnAsp: 1.379 ± 0.033
1.323AsnGlu: 1.323 ± 0.032
1.066AsnPhe: 1.066 ± 0.029
2.774AsnGly: 2.774 ± 0.081
0.479AsnHis: 0.479 ± 0.018
1.252AsnIle: 1.252 ± 0.034
0.821AsnLys: 0.821 ± 0.024
2.762AsnLeu: 2.762 ± 0.044
0.504AsnMet: 0.504 ± 0.019
0.825AsnAsn: 0.825 ± 0.034
2.072AsnPro: 2.072 ± 0.065
0.76AsnGln: 0.76 ± 0.03
1.758AsnArg: 1.758 ± 0.032
1.287AsnSer: 1.287 ± 0.051
1.607AsnThr: 1.607 ± 0.06
2.358AsnVal: 2.358 ± 0.045
0.384AsnTrp: 0.384 ± 0.015
0.739AsnTyr: 0.739 ± 0.022
0.0AsnXaa: 0.0 ± 0.0
Pro
7.142ProAla: 7.142 ± 0.122
0.323ProCys: 0.323 ± 0.016
3.287ProAsp: 3.287 ± 0.048
3.57ProGlu: 3.57 ± 0.06
1.941ProPhe: 1.941 ± 0.039
4.96ProGly: 4.96 ± 0.067
0.961ProHis: 0.961 ± 0.027
2.184ProIle: 2.184 ± 0.039
2.128ProLys: 2.128 ± 0.049
4.528ProLeu: 4.528 ± 0.061
1.313ProMet: 1.313 ± 0.028
1.504ProAsn: 1.504 ± 0.036
2.943ProPro: 2.943 ± 0.061
1.603ProGln: 1.603 ± 0.037
3.111ProArg: 3.111 ± 0.059
2.803ProSer: 2.803 ± 0.046
2.755ProThr: 2.755 ± 0.052
4.411ProVal: 4.411 ± 0.058
0.768ProTrp: 0.768 ± 0.024
1.189ProTyr: 1.189 ± 0.029
0.0ProXaa: 0.0 ± 0.0
Gln
3.891GlnAla: 3.891 ± 0.06
0.263GlnCys: 0.263 ± 0.017
1.313GlnAsp: 1.313 ± 0.027
1.344GlnGlu: 1.344 ± 0.031
1.11GlnPhe: 1.11 ± 0.028
2.481GlnGly: 2.481 ± 0.044
0.616GlnHis: 0.616 ± 0.021
1.326GlnIle: 1.326 ± 0.035
1.082GlnLys: 1.082 ± 0.033
2.928GlnLeu: 2.928 ± 0.039
0.758GlnMet: 0.758 ± 0.025
0.75GlnAsn: 0.75 ± 0.023
1.703GlnPro: 1.703 ± 0.03
1.106GlnGln: 1.106 ± 0.034
2.402GlnArg: 2.402 ± 0.05
1.652GlnSer: 1.652 ± 0.043
1.287GlnThr: 1.287 ± 0.033
2.708GlnVal: 2.708 ± 0.042
0.49GlnTrp: 0.49 ± 0.02
0.741GlnTyr: 0.741 ± 0.021
0.0GlnXaa: 0.0 ± 0.0
Arg
8.017ArgAla: 8.017 ± 0.109
0.516ArgCys: 0.516 ± 0.021
3.919ArgAsp: 3.919 ± 0.055
4.881ArgGlu: 4.881 ± 0.076
3.044ArgPhe: 3.044 ± 0.048
4.937ArgGly: 4.937 ± 0.081
1.538ArgHis: 1.538 ± 0.035
3.794ArgIle: 3.794 ± 0.064
2.54ArgLys: 2.54 ± 0.055
7.154ArgLeu: 7.154 ± 0.088
1.907ArgMet: 1.907 ± 0.039
1.923ArgAsn: 1.923 ± 0.042
3.085ArgPro: 3.085 ± 0.048
1.964ArgGln: 1.964 ± 0.04
4.863ArgArg: 4.863 ± 0.081
3.442ArgSer: 3.442 ± 0.058
3.509ArgThr: 3.509 ± 0.056
5.578ArgVal: 5.578 ± 0.063
1.005ArgTrp: 1.005 ± 0.029
1.899ArgTyr: 1.899 ± 0.031
0.0ArgXaa: 0.0 ± 0.0
Ser
6.113SerAla: 6.113 ± 0.102
0.519SerCys: 0.519 ± 0.058
2.481SerAsp: 2.481 ± 0.042
2.589SerGlu: 2.589 ± 0.049
2.138SerPhe: 2.138 ± 0.04
5.615SerGly: 5.615 ± 0.114
1.066SerHis: 1.066 ± 0.028
2.629SerIle: 2.629 ± 0.047
1.807SerLys: 1.807 ± 0.039
5.37SerLeu: 5.37 ± 0.069
1.281SerMet: 1.281 ± 0.029
1.526SerAsn: 1.526 ± 0.038
3.11SerPro: 3.11 ± 0.048
1.611SerGln: 1.611 ± 0.03
3.656SerArg: 3.656 ± 0.058
2.93SerSer: 2.93 ± 0.058
2.985SerThr: 2.985 ± 0.052
4.178SerVal: 4.178 ± 0.061
0.698SerTrp: 0.698 ± 0.023
1.219SerTyr: 1.219 ± 0.032
0.0SerXaa: 0.0 ± 0.0
Thr
5.968ThrAla: 5.968 ± 0.117
0.486ThrCys: 0.486 ± 0.025
2.646ThrAsp: 2.646 ± 0.053
2.379ThrGlu: 2.379 ± 0.042
2.325ThrPhe: 2.325 ± 0.053
5.358ThrGly: 5.358 ± 0.126
1.154ThrHis: 1.154 ± 0.028
2.682ThrIle: 2.682 ± 0.062
1.747ThrLys: 1.747 ± 0.039
6.149ThrLeu: 6.149 ± 0.098
1.205ThrMet: 1.205 ± 0.033
1.671ThrAsn: 1.671 ± 0.155
3.608ThrPro: 3.608 ± 0.066
1.705ThrGln: 1.705 ± 0.038
3.542ThrArg: 3.542 ± 0.053
2.945ThrSer: 2.945 ± 0.065
3.185ThrThr: 3.185 ± 0.125
4.574ThrVal: 4.574 ± 0.099
0.774ThrTrp: 0.774 ± 0.028
1.389ThrTyr: 1.389 ± 0.036
0.0ThrXaa: 0.0 ± 0.0
Val
10.264ValAla: 10.264 ± 0.106
0.616ValCys: 0.616 ± 0.021
4.308ValAsp: 4.308 ± 0.062
4.684ValGlu: 4.684 ± 0.067
2.868ValPhe: 2.868 ± 0.05
5.892ValGly: 5.892 ± 0.066
1.519ValHis: 1.519 ± 0.034
3.641ValIle: 3.641 ± 0.056
3.199ValLys: 3.199 ± 0.057
7.624ValLeu: 7.624 ± 0.091
1.758ValMet: 1.758 ± 0.038
2.47ValAsn: 2.47 ± 0.064
4.245ValPro: 4.245 ± 0.051
2.23ValGln: 2.23 ± 0.041
5.319ValArg: 5.319 ± 0.064
4.36ValSer: 4.36 ± 0.056
4.974ValThr: 4.974 ± 0.115
6.728ValVal: 6.728 ± 0.08
0.944ValTrp: 0.944 ± 0.026
1.655ValTyr: 1.655 ± 0.034
0.0ValXaa: 0.0 ± 0.0
Trp
1.169TrpAla: 1.169 ± 0.03
0.125TrpCys: 0.125 ± 0.009
0.67TrpAsp: 0.67 ± 0.023
0.601TrpGlu: 0.601 ± 0.02
0.568TrpPhe: 0.568 ± 0.018
1.029TrpGly: 1.029 ± 0.032
0.313TrpHis: 0.313 ± 0.015
0.745TrpIle: 0.745 ± 0.022
0.665TrpLys: 0.665 ± 0.023
1.681TrpLeu: 1.681 ± 0.043
0.372TrpMet: 0.372 ± 0.017
0.479TrpAsn: 0.479 ± 0.017
0.578TrpPro: 0.578 ± 0.021
0.529TrpGln: 0.529 ± 0.023
1.14TrpArg: 1.14 ± 0.035
0.93TrpSer: 0.93 ± 0.03
0.842TrpThr: 0.842 ± 0.024
0.967TrpVal: 0.967 ± 0.026
0.268TrpTrp: 0.268 ± 0.016
0.323TrpTyr: 0.323 ± 0.016
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.69TyrAla: 2.69 ± 0.047
0.247TyrCys: 0.247 ± 0.015
1.338TyrAsp: 1.338 ± 0.035
1.249TyrGlu: 1.249 ± 0.029
1.039TyrPhe: 1.039 ± 0.027
2.083TyrGly: 2.083 ± 0.036
0.425TyrHis: 0.425 ± 0.018
0.821TyrIle: 0.821 ± 0.024
0.809TyrLys: 0.809 ± 0.025
2.416TyrLeu: 2.416 ± 0.047
0.409TyrMet: 0.409 ± 0.015
0.716TyrAsn: 0.716 ± 0.034
1.203TyrPro: 1.203 ± 0.03
0.748TyrGln: 0.748 ± 0.025
1.864TyrArg: 1.864 ± 0.039
1.317TyrSer: 1.317 ± 0.038
1.368TyrThr: 1.368 ± 0.036
1.858TyrVal: 1.858 ± 0.036
0.379TyrTrp: 0.379 ± 0.014
0.656TyrTyr: 0.656 ± 0.027
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4404 proteins (1490001 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski