Amino acid dipepetide frequency for Melittangium boletus DSM 14713

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.929AlaAla: 12.929 ± 0.098
1.271AlaCys: 1.271 ± 0.023
4.817AlaAsp: 4.817 ± 0.043
7.399AlaGlu: 7.399 ± 0.078
3.724AlaPhe: 3.724 ± 0.038
8.813AlaGly: 8.813 ± 0.071
2.543AlaHis: 2.543 ± 0.031
3.307AlaIle: 3.307 ± 0.038
2.873AlaLys: 2.873 ± 0.036
14.609AlaLeu: 14.609 ± 0.103
2.267AlaMet: 2.267 ± 0.03
2.264AlaAsn: 2.264 ± 0.038
7.005AlaPro: 7.005 ± 0.081
4.308AlaGln: 4.308 ± 0.041
10.578AlaArg: 10.578 ± 0.082
7.021AlaSer: 7.021 ± 0.057
5.659AlaThr: 5.659 ± 0.056
8.055AlaVal: 8.055 ± 0.066
1.779AlaTrp: 1.779 ± 0.031
2.328AlaTyr: 2.328 ± 0.028
0.0AlaXaa: 0.0 ± 0.0
Cys
1.152CysAla: 1.152 ± 0.024
0.111CysCys: 0.111 ± 0.008
0.532CysAsp: 0.532 ± 0.017
0.572CysGlu: 0.572 ± 0.015
0.334CysPhe: 0.334 ± 0.012
1.031CysGly: 1.031 ± 0.027
0.256CysHis: 0.256 ± 0.011
0.283CysIle: 0.283 ± 0.01
0.208CysLys: 0.208 ± 0.009
0.916CysLeu: 0.916 ± 0.019
0.177CysMet: 0.177 ± 0.008
0.243CysAsn: 0.243 ± 0.012
0.571CysPro: 0.571 ± 0.018
0.305CysGln: 0.305 ± 0.01
0.66CysArg: 0.66 ± 0.02
0.571CysSer: 0.571 ± 0.018
0.56CysThr: 0.56 ± 0.017
0.731CysVal: 0.731 ± 0.018
0.126CysTrp: 0.126 ± 0.007
0.2CysTyr: 0.2 ± 0.008
0.0CysXaa: 0.0 ± 0.0
Asp
6.421AspAla: 6.421 ± 0.057
0.499AspCys: 0.499 ± 0.017
2.298AspAsp: 2.298 ± 0.032
3.441AspGlu: 3.441 ± 0.04
2.131AspPhe: 2.131 ± 0.029
4.685AspGly: 4.685 ± 0.065
0.835AspHis: 0.835 ± 0.02
1.891AspIle: 1.891 ± 0.028
1.517AspLys: 1.517 ± 0.027
4.917AspLeu: 4.917 ± 0.047
0.921AspMet: 0.921 ± 0.019
1.092AspAsn: 1.092 ± 0.02
3.006AspPro: 3.006 ± 0.041
1.28AspGln: 1.28 ± 0.021
2.927AspArg: 2.927 ± 0.031
2.64AspSer: 2.64 ± 0.036
2.746AspThr: 2.746 ± 0.032
4.371AspVal: 4.371 ± 0.038
0.818AspTrp: 0.818 ± 0.016
1.21AspTyr: 1.21 ± 0.023
0.0AspXaa: 0.0 ± 0.0
Glu
8.304GluAla: 8.304 ± 0.079
0.459GluCys: 0.459 ± 0.012
3.302GluAsp: 3.302 ± 0.033
4.539GluGlu: 4.539 ± 0.05
1.66GluPhe: 1.66 ± 0.027
5.684GluGly: 5.684 ± 0.049
1.542GluHis: 1.542 ± 0.024
1.748GluIle: 1.748 ± 0.029
2.189GluLys: 2.189 ± 0.032
7.468GluLeu: 7.468 ± 0.068
1.213GluMet: 1.213 ± 0.025
1.249GluAsn: 1.249 ± 0.023
3.859GluPro: 3.859 ± 0.047
2.744GluGln: 2.744 ± 0.034
6.741GluArg: 6.741 ± 0.069
3.324GluSer: 3.324 ± 0.039
2.942GluThr: 2.942 ± 0.032
5.247GluVal: 5.247 ± 0.046
0.868GluTrp: 0.868 ± 0.026
1.142GluTyr: 1.142 ± 0.019
0.0GluXaa: 0.0 ± 0.0
Phe
3.396PheAla: 3.396 ± 0.034
0.344PheCys: 0.344 ± 0.01
2.085PheAsp: 2.085 ± 0.026
2.196PheGlu: 2.196 ± 0.026
1.403PhePhe: 1.403 ± 0.024
2.856PheGly: 2.856 ± 0.034
0.847PheHis: 0.847 ± 0.016
1.268PheIle: 1.268 ± 0.021
0.962PheLys: 0.962 ± 0.021
3.57PheLeu: 3.57 ± 0.04
0.621PheMet: 0.621 ± 0.015
0.987PheAsn: 0.987 ± 0.02
1.607PhePro: 1.607 ± 0.023
1.25PheGln: 1.25 ± 0.017
2.251PheArg: 2.251 ± 0.03
2.328PheSer: 2.328 ± 0.028
2.139PheThr: 2.139 ± 0.032
2.495PheVal: 2.495 ± 0.028
0.469PheTrp: 0.469 ± 0.014
0.793PheTyr: 0.793 ± 0.017
0.0PheXaa: 0.0 ± 0.0
Gly
9.054GlyAla: 9.054 ± 0.068
1.033GlyCys: 1.033 ± 0.023
4.088GlyAsp: 4.088 ± 0.042
5.471GlyGlu: 5.471 ± 0.047
3.108GlyPhe: 3.108 ± 0.038
7.859GlyGly: 7.859 ± 0.088
1.918GlyHis: 1.918 ± 0.028
2.969GlyIle: 2.969 ± 0.033
2.985GlyLys: 2.985 ± 0.042
9.221GlyLeu: 9.221 ± 0.07
2.001GlyMet: 2.001 ± 0.028
2.105GlyAsn: 2.105 ± 0.036
4.189GlyPro: 4.189 ± 0.043
3.327GlyGln: 3.327 ± 0.039
6.477GlyArg: 6.477 ± 0.055
5.057GlySer: 5.057 ± 0.052
5.437GlyThr: 5.437 ± 0.063
6.84GlyVal: 6.84 ± 0.058
1.389GlyTrp: 1.389 ± 0.025
2.083GlyTyr: 2.083 ± 0.032
0.0GlyXaa: 0.0 ± 0.0
His
2.509HisAla: 2.509 ± 0.033
0.216HisCys: 0.216 ± 0.009
0.98HisAsp: 0.98 ± 0.016
1.38HisGlu: 1.38 ± 0.024
0.887HisPhe: 0.887 ± 0.017
1.954HisGly: 1.954 ± 0.027
0.658HisHis: 0.658 ± 0.017
0.655HisIle: 0.655 ± 0.014
0.471HisLys: 0.471 ± 0.014
2.514HisLeu: 2.514 ± 0.031
0.38HisMet: 0.38 ± 0.012
0.41HisAsn: 0.41 ± 0.012
1.651HisPro: 1.651 ± 0.029
0.737HisGln: 0.737 ± 0.017
1.678HisArg: 1.678 ± 0.023
1.084HisSer: 1.084 ± 0.022
1.058HisThr: 1.058 ± 0.022
1.712HisVal: 1.712 ± 0.028
0.341HisTrp: 0.341 ± 0.009
0.581HisTyr: 0.581 ± 0.013
0.0HisXaa: 0.0 ± 0.0
Ile
3.525IleAla: 3.525 ± 0.034
0.279IleCys: 0.279 ± 0.009
1.859IleAsp: 1.859 ± 0.029
2.064IleGlu: 2.064 ± 0.027
0.986IlePhe: 0.986 ± 0.02
2.412IleGly: 2.412 ± 0.033
0.871IleHis: 0.871 ± 0.017
1.194IleIle: 1.194 ± 0.025
0.841IleLys: 0.841 ± 0.019
2.811IleLeu: 2.811 ± 0.035
0.369IleMet: 0.369 ± 0.012
0.942IleAsn: 0.942 ± 0.022
1.982IlePro: 1.982 ± 0.026
1.227IleGln: 1.227 ± 0.021
2.237IleArg: 2.237 ± 0.031
1.883IleSer: 1.883 ± 0.029
1.805IleThr: 1.805 ± 0.029
2.237IleVal: 2.237 ± 0.03
0.322IleTrp: 0.322 ± 0.011
0.704IleTyr: 0.704 ± 0.015
0.0IleXaa: 0.0 ± 0.0
Lys
3.412LysAla: 3.412 ± 0.043
0.217LysCys: 0.217 ± 0.009
1.741LysAsp: 1.741 ± 0.028
1.777LysGlu: 1.777 ± 0.027
0.663LysPhe: 0.663 ± 0.016
2.584LysGly: 2.584 ± 0.035
0.555LysHis: 0.555 ± 0.014
0.822LysIle: 0.822 ± 0.019
1.379LysLys: 1.379 ± 0.03
3.069LysLeu: 3.069 ± 0.037
0.635LysMet: 0.635 ± 0.015
0.825LysAsn: 0.825 ± 0.016
1.965LysPro: 1.965 ± 0.031
1.041LysGln: 1.041 ± 0.02
2.164LysArg: 2.164 ± 0.031
1.543LysSer: 1.543 ± 0.026
1.638LysThr: 1.638 ± 0.024
2.438LysVal: 2.438 ± 0.038
0.347LysTrp: 0.347 ± 0.011
0.596LysTyr: 0.596 ± 0.015
0.0LysXaa: 0.0 ± 0.0
Leu
13.545LeuAla: 13.545 ± 0.086
1.028LeuCys: 1.028 ± 0.02
5.803LeuAsp: 5.803 ± 0.047
7.591LeuGlu: 7.591 ± 0.07
3.63LeuPhe: 3.63 ± 0.038
9.804LeuGly: 9.804 ± 0.064
2.376LeuHis: 2.376 ± 0.03
3.105LeuIle: 3.105 ± 0.037
3.367LeuLys: 3.367 ± 0.041
12.349LeuLeu: 12.349 ± 0.102
2.054LeuMet: 2.054 ± 0.026
2.316LeuAsn: 2.316 ± 0.035
6.656LeuPro: 6.656 ± 0.059
3.325LeuGln: 3.325 ± 0.037
9.107LeuArg: 9.107 ± 0.072
7.453LeuSer: 7.453 ± 0.061
6.03LeuThr: 6.03 ± 0.047
8.737LeuVal: 8.737 ± 0.06
1.405LeuTrp: 1.405 ± 0.026
2.378LeuTyr: 2.378 ± 0.032
0.0LeuXaa: 0.0 ± 0.0
Met
2.291MetAla: 2.291 ± 0.033
0.141MetCys: 0.141 ± 0.006
1.013MetAsp: 1.013 ± 0.019
1.183MetGlu: 1.183 ± 0.021
0.435MetPhe: 0.435 ± 0.012
1.766MetGly: 1.766 ± 0.03
0.392MetHis: 0.392 ± 0.013
0.499MetIle: 0.499 ± 0.013
0.907MetLys: 0.907 ± 0.019
2.043MetLeu: 2.043 ± 0.028
0.464MetMet: 0.464 ± 0.014
0.625MetAsn: 0.625 ± 0.015
1.219MetPro: 1.219 ± 0.021
0.566MetGln: 0.566 ± 0.014
1.701MetArg: 1.701 ± 0.026
1.486MetSer: 1.486 ± 0.023
1.163MetThr: 1.163 ± 0.021
1.324MetVal: 1.324 ± 0.022
0.202MetTrp: 0.202 ± 0.008
0.284MetTyr: 0.284 ± 0.011
0.0MetXaa: 0.0 ± 0.0
Asn
2.566AsnAla: 2.566 ± 0.034
0.232AsnCys: 0.232 ± 0.01
1.072AsnAsp: 1.072 ± 0.022
1.2AsnGlu: 1.2 ± 0.023
0.733AsnPhe: 0.733 ± 0.019
2.133AsnGly: 2.133 ± 0.04
0.511AsnHis: 0.511 ± 0.014
0.865AsnIle: 0.865 ± 0.019
0.642AsnLys: 0.642 ± 0.02
2.365AsnLeu: 2.365 ± 0.029
0.368AsnMet: 0.368 ± 0.011
0.714AsnAsn: 0.714 ± 0.02
1.747AsnPro: 1.747 ± 0.027
0.792AsnGln: 0.792 ± 0.016
1.455AsnArg: 1.455 ± 0.023
1.11AsnSer: 1.11 ± 0.024
1.338AsnThr: 1.338 ± 0.025
1.755AsnVal: 1.755 ± 0.027
0.348AsnTrp: 0.348 ± 0.01
0.591AsnTyr: 0.591 ± 0.015
0.0AsnXaa: 0.0 ± 0.0
Pro
6.576ProAla: 6.576 ± 0.071
0.392ProCys: 0.392 ± 0.012
3.283ProAsp: 3.283 ± 0.041
5.261ProGlu: 5.261 ± 0.049
1.975ProPhe: 1.975 ± 0.025
5.706ProGly: 5.706 ± 0.051
1.228ProHis: 1.228 ± 0.021
1.559ProIle: 1.559 ± 0.026
1.478ProLys: 1.478 ± 0.025
6.064ProLeu: 6.064 ± 0.051
1.197ProMet: 1.197 ± 0.019
1.192ProAsn: 1.192 ± 0.023
4.818ProPro: 4.818 ± 0.066
1.782ProGln: 1.782 ± 0.031
4.425ProArg: 4.425 ± 0.042
4.121ProSer: 4.121 ± 0.044
3.176ProThr: 3.176 ± 0.038
4.823ProVal: 4.823 ± 0.053
0.853ProTrp: 0.853 ± 0.018
1.14ProTyr: 1.14 ± 0.023
0.0ProXaa: 0.0 ± 0.0
Gln
4.256GlnAla: 4.256 ± 0.041
0.297GlnCys: 0.297 ± 0.01
1.575GlnAsp: 1.575 ± 0.024
2.217GlnGlu: 2.217 ± 0.032
0.874GlnPhe: 0.874 ± 0.02
3.232GlnGly: 3.232 ± 0.034
0.696GlnHis: 0.696 ± 0.014
0.823GlnIle: 0.823 ± 0.016
1.116GlnLys: 1.116 ± 0.02
3.705GlnLeu: 3.705 ± 0.039
0.683GlnMet: 0.683 ± 0.017
0.679GlnAsn: 0.679 ± 0.017
1.983GlnPro: 1.983 ± 0.035
1.311GlnGln: 1.311 ± 0.026
3.043GlnArg: 3.043 ± 0.041
1.727GlnSer: 1.727 ± 0.025
1.567GlnThr: 1.567 ± 0.023
3.304GlnVal: 3.304 ± 0.04
0.516GlnTrp: 0.516 ± 0.015
0.653GlnTyr: 0.653 ± 0.017
0.0GlnXaa: 0.0 ± 0.0
Arg
9.135ArgAla: 9.135 ± 0.078
0.738ArgCys: 0.738 ± 0.018
3.83ArgAsp: 3.83 ± 0.039
6.123ArgGlu: 6.123 ± 0.059
3.184ArgPhe: 3.184 ± 0.038
5.922ArgGly: 5.922 ± 0.047
1.774ArgHis: 1.774 ± 0.027
2.948ArgIle: 2.948 ± 0.034
2.182ArgLys: 2.182 ± 0.028
9.504ArgLeu: 9.504 ± 0.073
2.077ArgMet: 2.077 ± 0.029
1.503ArgAsn: 1.503 ± 0.025
4.264ArgPro: 4.264 ± 0.042
2.843ArgGln: 2.843 ± 0.033
6.333ArgArg: 6.333 ± 0.056
3.862ArgSer: 3.862 ± 0.042
3.995ArgThr: 3.995 ± 0.038
6.697ArgVal: 6.697 ± 0.058
1.316ArgTrp: 1.316 ± 0.023
2.062ArgTyr: 2.062 ± 0.03
0.0ArgXaa: 0.0 ± 0.0
Ser
6.785SerAla: 6.785 ± 0.06
0.558SerCys: 0.558 ± 0.017
2.677SerAsp: 2.677 ± 0.034
3.385SerGlu: 3.385 ± 0.04
2.262SerPhe: 2.262 ± 0.032
5.806SerGly: 5.806 ± 0.059
1.208SerHis: 1.208 ± 0.019
1.888SerIle: 1.888 ± 0.026
1.445SerLys: 1.445 ± 0.023
6.88SerLeu: 6.88 ± 0.063
1.171SerMet: 1.171 ± 0.022
1.376SerAsn: 1.376 ± 0.024
3.922SerPro: 3.922 ± 0.041
1.871SerGln: 1.871 ± 0.028
4.565SerArg: 4.565 ± 0.045
3.872SerSer: 3.872 ± 0.053
3.344SerThr: 3.344 ± 0.042
4.464SerVal: 4.464 ± 0.046
0.902SerTrp: 0.902 ± 0.017
1.244SerTyr: 1.244 ± 0.022
0.0SerXaa: 0.0 ± 0.0
Thr
5.502ThrAla: 5.502 ± 0.054
0.572ThrCys: 0.572 ± 0.022
2.419ThrAsp: 2.419 ± 0.035
2.873ThrGlu: 2.873 ± 0.03
1.993ThrPhe: 1.993 ± 0.03
5.073ThrGly: 5.073 ± 0.051
1.253ThrHis: 1.253 ± 0.021
1.362ThrIle: 1.362 ± 0.024
1.198ThrLys: 1.198 ± 0.025
6.643ThrLeu: 6.643 ± 0.052
0.742ThrMet: 0.742 ± 0.016
1.18ThrAsn: 1.18 ± 0.023
4.25ThrPro: 4.25 ± 0.035
1.966ThrGln: 1.966 ± 0.03
4.174ThrArg: 4.174 ± 0.037
3.295ThrSer: 3.295 ± 0.046
2.896ThrThr: 2.896 ± 0.04
4.408ThrVal: 4.408 ± 0.051
0.888ThrTrp: 0.888 ± 0.018
1.321ThrTyr: 1.321 ± 0.026
0.0ThrXaa: 0.0 ± 0.0
Val
8.557ValAla: 8.557 ± 0.067
0.776ValCys: 0.776 ± 0.019
4.366ValAsp: 4.366 ± 0.042
5.419ValGlu: 5.419 ± 0.051
2.527ValPhe: 2.527 ± 0.031
6.309ValGly: 6.309 ± 0.05
1.624ValHis: 1.624 ± 0.025
2.385ValIle: 2.385 ± 0.028
2.563ValLys: 2.563 ± 0.033
9.327ValLeu: 9.327 ± 0.066
1.528ValMet: 1.528 ± 0.026
1.8ValAsn: 1.8 ± 0.028
4.522ValPro: 4.522 ± 0.045
2.356ValGln: 2.356 ± 0.031
6.592ValArg: 6.592 ± 0.052
4.918ValSer: 4.918 ± 0.05
4.367ValThr: 4.367 ± 0.054
6.39ValVal: 6.39 ± 0.057
0.998ValTrp: 0.998 ± 0.023
1.599ValTyr: 1.599 ± 0.027
0.0ValXaa: 0.0 ± 0.0
Trp
1.318TrpAla: 1.318 ± 0.026
0.162TrpCys: 0.162 ± 0.008
0.685TrpAsp: 0.685 ± 0.02
0.79TrpGlu: 0.79 ± 0.016
0.478TrpPhe: 0.478 ± 0.014
1.118TrpGly: 1.118 ± 0.025
0.314TrpHis: 0.314 ± 0.01
0.393TrpIle: 0.393 ± 0.011
0.536TrpLys: 0.536 ± 0.015
1.704TrpLeu: 1.704 ± 0.025
0.426TrpMet: 0.426 ± 0.012
0.478TrpAsn: 0.478 ± 0.014
0.708TrpPro: 0.708 ± 0.015
0.399TrpGln: 0.399 ± 0.012
1.384TrpArg: 1.384 ± 0.024
1.014TrpSer: 1.014 ± 0.023
0.907TrpThr: 0.907 ± 0.019
1.138TrpVal: 1.138 ± 0.023
0.245TrpTrp: 0.245 ± 0.01
0.293TrpTyr: 0.293 ± 0.012
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.317TyrAla: 2.317 ± 0.029
0.215TyrCys: 0.215 ± 0.009
1.219TyrAsp: 1.219 ± 0.025
1.341TyrGlu: 1.341 ± 0.026
0.879TyrPhe: 0.879 ± 0.016
1.863TyrGly: 1.863 ± 0.032
0.455TyrHis: 0.455 ± 0.012
0.59TyrIle: 0.59 ± 0.015
0.551TyrLys: 0.551 ± 0.014
2.381TyrLeu: 2.381 ± 0.031
0.41TyrMet: 0.41 ± 0.012
0.559TyrAsn: 0.559 ± 0.016
1.125TyrPro: 1.125 ± 0.022
0.827TyrGln: 0.827 ± 0.019
1.801TyrArg: 1.801 ± 0.025
1.315TyrSer: 1.315 ± 0.026
1.241TyrThr: 1.241 ± 0.025
1.753TyrVal: 1.753 ± 0.027
0.358TyrTrp: 0.358 ± 0.012
0.601TyrTyr: 0.601 ± 0.019
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7973 proteins (2961835 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski