Amino acid dipepetide frequency for Deinococcus peraridilitoris (strain DSM 19664 / LMG 22246 / CIP 109416 / KR-200)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.757AlaAla: 12.757 ± 0.13
0.97AlaCys: 0.97 ± 0.031
4.977AlaAsp: 4.977 ± 0.067
6.298AlaGlu: 6.298 ± 0.083
4.034AlaPhe: 4.034 ± 0.063
9.487AlaGly: 9.487 ± 0.089
2.677AlaHis: 2.677 ± 0.049
3.807AlaIle: 3.807 ± 0.059
2.525AlaLys: 2.525 ± 0.061
15.574AlaLeu: 15.574 ± 0.139
2.324AlaMet: 2.324 ± 0.044
2.43AlaAsn: 2.43 ± 0.048
5.692AlaPro: 5.692 ± 0.107
5.493AlaGln: 5.493 ± 0.07
10.252AlaArg: 10.252 ± 0.094
6.502AlaSer: 6.502 ± 0.082
5.294AlaThr: 5.294 ± 0.064
8.338AlaVal: 8.338 ± 0.095
1.765AlaTrp: 1.765 ± 0.043
2.804AlaTyr: 2.804 ± 0.044
0.0AlaXaa: 0.0 ± 0.0
Cys
0.767CysAla: 0.767 ± 0.026
0.076CysCys: 0.076 ± 0.008
0.364CysAsp: 0.364 ± 0.018
0.343CysGlu: 0.343 ± 0.016
0.194CysPhe: 0.194 ± 0.014
0.744CysGly: 0.744 ± 0.027
0.15CysHis: 0.15 ± 0.012
0.22CysIle: 0.22 ± 0.015
0.12CysLys: 0.12 ± 0.009
0.673CysLeu: 0.673 ± 0.026
0.101CysMet: 0.101 ± 0.009
0.143CysAsn: 0.143 ± 0.012
0.435CysPro: 0.435 ± 0.021
0.213CysGln: 0.213 ± 0.012
0.45CysArg: 0.45 ± 0.019
0.393CysSer: 0.393 ± 0.018
0.406CysThr: 0.406 ± 0.019
0.485CysVal: 0.485 ± 0.018
0.101CysTrp: 0.101 ± 0.01
0.149CysTyr: 0.149 ± 0.01
0.0CysXaa: 0.0 ± 0.0
Asp
5.999AspAla: 5.999 ± 0.075
0.26AspCys: 0.26 ± 0.016
2.418AspAsp: 2.418 ± 0.05
3.267AspGlu: 3.267 ± 0.06
2.046AspPhe: 2.046 ± 0.045
4.121AspGly: 4.121 ± 0.065
1.122AspHis: 1.122 ± 0.031
1.928AspIle: 1.928 ± 0.041
1.187AspLys: 1.187 ± 0.039
6.306AspLeu: 6.306 ± 0.079
0.848AspMet: 0.848 ± 0.029
1.067AspAsn: 1.067 ± 0.033
3.056AspPro: 3.056 ± 0.048
1.599AspGln: 1.599 ± 0.033
3.123AspArg: 3.123 ± 0.053
2.124AspSer: 2.124 ± 0.046
2.632AspThr: 2.632 ± 0.048
4.337AspVal: 4.337 ± 0.06
0.767AspTrp: 0.767 ± 0.026
1.208AspTyr: 1.208 ± 0.034
0.0AspXaa: 0.0 ± 0.0
Glu
7.404GluAla: 7.404 ± 0.084
0.304GluCys: 0.304 ± 0.016
2.662GluAsp: 2.662 ± 0.049
3.498GluGlu: 3.498 ± 0.062
1.87GluPhe: 1.87 ± 0.04
4.732GluGly: 4.732 ± 0.062
1.668GluHis: 1.668 ± 0.036
2.328GluIle: 2.328 ± 0.039
1.622GluLys: 1.622 ± 0.04
7.214GluLeu: 7.214 ± 0.089
1.118GluMet: 1.118 ± 0.031
1.507GluAsn: 1.507 ± 0.034
2.246GluPro: 2.246 ± 0.048
2.818GluGln: 2.818 ± 0.054
6.29GluArg: 6.29 ± 0.091
2.625GluSer: 2.625 ± 0.043
2.827GluThr: 2.827 ± 0.051
5.294GluVal: 5.294 ± 0.061
0.836GluTrp: 0.836 ± 0.027
1.346GluTyr: 1.346 ± 0.039
0.0GluXaa: 0.0 ± 0.0
Phe
3.881PheAla: 3.881 ± 0.069
0.243PheCys: 0.243 ± 0.014
2.115PheAsp: 2.115 ± 0.04
2.221PheGlu: 2.221 ± 0.039
1.168PhePhe: 1.168 ± 0.029
3.339PheGly: 3.339 ± 0.054
0.673PheHis: 0.673 ± 0.024
1.219PheIle: 1.219 ± 0.036
0.882PheLys: 0.882 ± 0.029
3.359PheLeu: 3.359 ± 0.065
0.616PheMet: 0.616 ± 0.022
0.951PheAsn: 0.951 ± 0.027
1.701PhePro: 1.701 ± 0.035
1.156PheGln: 1.156 ± 0.034
2.057PheArg: 2.057 ± 0.041
2.108PheSer: 2.108 ± 0.045
2.284PheThr: 2.284 ± 0.046
2.912PheVal: 2.912 ± 0.055
0.559PheTrp: 0.559 ± 0.021
0.885PheTyr: 0.885 ± 0.029
0.0PheXaa: 0.0 ± 0.0
Gly
9.388GlyAla: 9.388 ± 0.097
0.596GlyCys: 0.596 ± 0.024
3.866GlyAsp: 3.866 ± 0.059
5.728GlyGlu: 5.728 ± 0.07
3.102GlyPhe: 3.102 ± 0.049
7.223GlyGly: 7.223 ± 0.104
1.952GlyHis: 1.952 ± 0.041
3.327GlyIle: 3.327 ± 0.056
2.835GlyLys: 2.835 ± 0.066
9.211GlyLeu: 9.211 ± 0.093
1.821GlyMet: 1.821 ± 0.043
2.376GlyAsn: 2.376 ± 0.057
3.276GlyPro: 3.276 ± 0.047
3.617GlyGln: 3.617 ± 0.066
6.021GlyArg: 6.021 ± 0.079
4.748GlySer: 4.748 ± 0.071
5.049GlyThr: 5.049 ± 0.105
7.576GlyVal: 7.576 ± 0.098
1.401GlyTrp: 1.401 ± 0.039
2.359GlyTyr: 2.359 ± 0.041
0.0GlyXaa: 0.0 ± 0.0
His
2.927HisAla: 2.927 ± 0.053
0.174HisCys: 0.174 ± 0.013
1.435HisAsp: 1.435 ± 0.039
1.411HisGlu: 1.411 ± 0.034
0.872HisPhe: 0.872 ± 0.027
2.137HisGly: 2.137 ± 0.043
0.734HisHis: 0.734 ± 0.028
0.772HisIle: 0.772 ± 0.029
0.504HisLys: 0.504 ± 0.019
2.781HisLeu: 2.781 ± 0.052
0.351HisMet: 0.351 ± 0.016
0.566HisAsn: 0.566 ± 0.023
1.535HisPro: 1.535 ± 0.041
0.767HisGln: 0.767 ± 0.028
1.448HisArg: 1.448 ± 0.035
1.04HisSer: 1.04 ± 0.026
1.243HisThr: 1.243 ± 0.029
1.737HisVal: 1.737 ± 0.038
0.335HisTrp: 0.335 ± 0.017
0.626HisTyr: 0.626 ± 0.021
0.0HisXaa: 0.0 ± 0.0
Ile
4.406IleAla: 4.406 ± 0.068
0.219IleCys: 0.219 ± 0.014
2.205IleAsp: 2.205 ± 0.041
2.63IleGlu: 2.63 ± 0.048
1.195IlePhe: 1.195 ± 0.031
3.472IleGly: 3.472 ± 0.061
0.802IleHis: 0.802 ± 0.023
1.453IleIle: 1.453 ± 0.037
1.07IleLys: 1.07 ± 0.029
3.644IleLeu: 3.644 ± 0.054
0.645IleMet: 0.645 ± 0.026
1.04IleAsn: 1.04 ± 0.03
1.992IlePro: 1.992 ± 0.039
1.194IleGln: 1.194 ± 0.037
2.461IleArg: 2.461 ± 0.046
2.182IleSer: 2.182 ± 0.04
2.414IleThr: 2.414 ± 0.048
3.161IleVal: 3.161 ± 0.058
0.363IleTrp: 0.363 ± 0.019
0.84IleTyr: 0.84 ± 0.023
0.0IleXaa: 0.0 ± 0.0
Lys
2.914LysAla: 2.914 ± 0.064
0.128LysCys: 0.128 ± 0.011
1.24LysAsp: 1.24 ± 0.037
1.265LysGlu: 1.265 ± 0.038
0.791LysPhe: 0.791 ± 0.027
1.987LysGly: 1.987 ± 0.05
0.611LysHis: 0.611 ± 0.022
1.116LysIle: 1.116 ± 0.036
0.976LysLys: 0.976 ± 0.035
2.912LysLeu: 2.912 ± 0.064
0.58LysMet: 0.58 ± 0.026
0.907LysAsn: 0.907 ± 0.033
1.546LysPro: 1.546 ± 0.042
0.924LysGln: 0.924 ± 0.03
2.135LysArg: 2.135 ± 0.046
1.452LysSer: 1.452 ± 0.039
1.717LysThr: 1.717 ± 0.044
2.192LysVal: 2.192 ± 0.046
0.303LysTrp: 0.303 ± 0.016
0.68LysTyr: 0.68 ± 0.023
0.0LysXaa: 0.0 ± 0.0
Leu
13.739LeuAla: 13.739 ± 0.124
0.771LeuCys: 0.771 ± 0.027
5.854LeuAsp: 5.854 ± 0.067
6.918LeuGlu: 6.918 ± 0.102
3.182LeuPhe: 3.182 ± 0.06
10.124LeuGly: 10.124 ± 0.109
2.701LeuHis: 2.701 ± 0.058
4.625LeuIle: 4.625 ± 0.063
3.084LeuLys: 3.084 ± 0.06
13.233LeuLeu: 13.233 ± 0.152
2.059LeuMet: 2.059 ± 0.045
3.161LeuAsn: 3.161 ± 0.056
6.74LeuPro: 6.74 ± 0.079
4.578LeuGln: 4.578 ± 0.072
9.151LeuArg: 9.151 ± 0.108
7.683LeuSer: 7.683 ± 0.089
6.969LeuThr: 6.969 ± 0.089
7.759LeuVal: 7.759 ± 0.098
1.377LeuTrp: 1.377 ± 0.037
2.518LeuTyr: 2.518 ± 0.047
0.0LeuXaa: 0.0 ± 0.0
Met
1.775MetAla: 1.775 ± 0.041
0.089MetCys: 0.089 ± 0.009
0.795MetAsp: 0.795 ± 0.027
0.791MetGlu: 0.791 ± 0.022
0.538MetPhe: 0.538 ± 0.02
1.392MetGly: 1.392 ± 0.034
0.446MetHis: 0.446 ± 0.02
0.818MetIle: 0.818 ± 0.028
0.742MetLys: 0.742 ± 0.026
2.158MetLeu: 2.158 ± 0.041
0.385MetMet: 0.385 ± 0.019
0.814MetAsn: 0.814 ± 0.026
1.124MetPro: 1.124 ± 0.027
0.776MetGln: 0.776 ± 0.027
1.453MetArg: 1.453 ± 0.036
1.254MetSer: 1.254 ± 0.035
1.714MetThr: 1.714 ± 0.036
1.246MetVal: 1.246 ± 0.035
0.178MetTrp: 0.178 ± 0.012
0.43MetTyr: 0.43 ± 0.019
0.0MetXaa: 0.0 ± 0.0
Asn
3.306AsnAla: 3.306 ± 0.054
0.173AsnCys: 0.173 ± 0.013
1.327AsnAsp: 1.327 ± 0.035
1.352AsnGlu: 1.352 ± 0.029
1.037AsnPhe: 1.037 ± 0.032
2.268AsnGly: 2.268 ± 0.061
0.503AsnHis: 0.503 ± 0.024
1.162AsnIle: 1.162 ± 0.032
0.642AsnLys: 0.642 ± 0.03
3.053AsnLeu: 3.053 ± 0.058
0.479AsnMet: 0.479 ± 0.02
0.803AsnAsn: 0.803 ± 0.038
1.833AsnPro: 1.833 ± 0.04
0.767AsnGln: 0.767 ± 0.024
1.591AsnArg: 1.591 ± 0.038
1.342AsnSer: 1.342 ± 0.033
1.622AsnThr: 1.622 ± 0.045
2.364AsnVal: 2.364 ± 0.049
0.385AsnTrp: 0.385 ± 0.018
0.68AsnTyr: 0.68 ± 0.023
0.0AsnXaa: 0.0 ± 0.0
Pro
5.744ProAla: 5.744 ± 0.101
0.301ProCys: 0.301 ± 0.016
3.412ProAsp: 3.412 ± 0.054
4.034ProGlu: 4.034 ± 0.072
1.852ProPhe: 1.852 ± 0.044
4.907ProGly: 4.907 ± 0.068
1.358ProHis: 1.358 ± 0.038
1.734ProIle: 1.734 ± 0.039
1.386ProLys: 1.386 ± 0.032
5.42ProLeu: 5.42 ± 0.07
0.968ProMet: 0.968 ± 0.03
1.445ProAsn: 1.445 ± 0.034
2.632ProPro: 2.632 ± 0.063
2.211ProGln: 2.211 ± 0.049
3.381ProArg: 3.381 ± 0.055
3.012ProSer: 3.012 ± 0.05
2.944ProThr: 2.944 ± 0.054
4.178ProVal: 4.178 ± 0.066
0.792ProTrp: 0.792 ± 0.027
1.312ProTyr: 1.312 ± 0.034
0.0ProXaa: 0.0 ± 0.0
Gln
5.08GlnAla: 5.08 ± 0.067
0.172GlnCys: 0.172 ± 0.014
2.079GlnAsp: 2.079 ± 0.045
2.506GlnGlu: 2.506 ± 0.052
1.193GlnPhe: 1.193 ± 0.033
3.625GlnGly: 3.625 ± 0.06
1.07GlnHis: 1.07 ± 0.034
1.354GlnIle: 1.354 ± 0.036
1.08GlnLys: 1.08 ± 0.04
4.675GlnLeu: 4.675 ± 0.074
0.673GlnMet: 0.673 ± 0.023
1.171GlnAsn: 1.171 ± 0.03
1.952GlnPro: 1.952 ± 0.045
2.001GlnGln: 2.001 ± 0.047
3.031GlnArg: 3.031 ± 0.055
1.87GlnSer: 1.87 ± 0.041
1.949GlnThr: 1.949 ± 0.037
3.221GlnVal: 3.221 ± 0.051
0.451GlnTrp: 0.451 ± 0.02
0.865GlnTyr: 0.865 ± 0.026
0.0GlnXaa: 0.0 ± 0.0
Arg
9.107ArgAla: 9.107 ± 0.102
0.458ArgCys: 0.458 ± 0.02
3.989ArgAsp: 3.989 ± 0.061
5.812ArgGlu: 5.812 ± 0.079
2.928ArgPhe: 2.928 ± 0.048
5.589ArgGly: 5.589 ± 0.092
1.862ArgHis: 1.862 ± 0.037
2.863ArgIle: 2.863 ± 0.053
1.858ArgLys: 1.858 ± 0.04
8.556ArgLeu: 8.556 ± 0.101
1.541ArgMet: 1.541 ± 0.037
1.759ArgAsn: 1.759 ± 0.038
3.709ArgPro: 3.709 ± 0.056
3.055ArgGln: 3.055 ± 0.054
5.753ArgArg: 5.753 ± 0.081
4.091ArgSer: 4.091 ± 0.059
4.165ArgThr: 4.165 ± 0.058
6.198ArgVal: 6.198 ± 0.075
1.157ArgTrp: 1.157 ± 0.034
1.903ArgTyr: 1.903 ± 0.044
0.0ArgXaa: 0.0 ± 0.0
Ser
6.643SerAla: 6.643 ± 0.08
0.423SerCys: 0.423 ± 0.018
2.663SerAsp: 2.663 ± 0.051
3.184SerGlu: 3.184 ± 0.049
2.038SerPhe: 2.038 ± 0.04
6.064SerGly: 6.064 ± 0.089
1.098SerHis: 1.098 ± 0.031
1.996SerIle: 1.996 ± 0.039
1.412SerLys: 1.412 ± 0.037
6.099SerLeu: 6.099 ± 0.071
1.173SerMet: 1.173 ± 0.035
1.384SerAsn: 1.384 ± 0.036
3.084SerPro: 3.084 ± 0.049
1.767SerGln: 1.767 ± 0.042
3.885SerArg: 3.885 ± 0.062
3.442SerSer: 3.442 ± 0.078
3.094SerThr: 3.094 ± 0.052
4.667SerVal: 4.667 ± 0.06
0.886SerTrp: 0.886 ± 0.027
1.231SerTyr: 1.231 ± 0.031
0.0SerXaa: 0.0 ± 0.0
Thr
5.861ThrAla: 5.861 ± 0.073
0.346ThrCys: 0.346 ± 0.018
2.56ThrAsp: 2.56 ± 0.048
2.686ThrGlu: 2.686 ± 0.047
2.25ThrPhe: 2.25 ± 0.045
4.973ThrGly: 4.973 ± 0.09
1.289ThrHis: 1.289 ± 0.032
2.023ThrIle: 2.023 ± 0.044
1.222ThrLys: 1.222 ± 0.033
7.596ThrLeu: 7.596 ± 0.095
0.887ThrMet: 0.887 ± 0.027
1.415ThrAsn: 1.415 ± 0.039
4.214ThrPro: 4.214 ± 0.067
1.952ThrGln: 1.952 ± 0.045
4.068ThrArg: 4.068 ± 0.063
3.411ThrSer: 3.411 ± 0.07
3.387ThrThr: 3.387 ± 0.092
4.908ThrVal: 4.908 ± 0.078
0.826ThrTrp: 0.826 ± 0.029
1.518ThrTyr: 1.518 ± 0.036
0.0ThrXaa: 0.0 ± 0.0
Val
8.114ValAla: 8.114 ± 0.083
0.514ValCys: 0.514 ± 0.024
3.456ValAsp: 3.456 ± 0.052
4.337ValGlu: 4.337 ± 0.063
2.709ValPhe: 2.709 ± 0.053
5.886ValGly: 5.886 ± 0.083
1.718ValHis: 1.718 ± 0.038
3.447ValIle: 3.447 ± 0.05
2.231ValLys: 2.231 ± 0.045
9.228ValLeu: 9.228 ± 0.103
1.744ValMet: 1.744 ± 0.033
2.517ValAsn: 2.517 ± 0.051
4.232ValPro: 4.232 ± 0.07
3.313ValGln: 3.313 ± 0.049
6.493ValArg: 6.493 ± 0.079
4.769ValSer: 4.769 ± 0.061
5.358ValThr: 5.358 ± 0.069
6.389ValVal: 6.389 ± 0.089
1.128ValTrp: 1.128 ± 0.035
1.905ValTyr: 1.905 ± 0.038
0.0ValXaa: 0.0 ± 0.0
Trp
1.364TrpAla: 1.364 ± 0.033
0.118TrpCys: 0.118 ± 0.01
0.611TrpAsp: 0.611 ± 0.024
0.69TrpGlu: 0.69 ± 0.023
0.417TrpPhe: 0.417 ± 0.021
1.079TrpGly: 1.079 ± 0.029
0.352TrpHis: 0.352 ± 0.016
0.525TrpIle: 0.525 ± 0.022
0.408TrpLys: 0.408 ± 0.02
1.678TrpLeu: 1.678 ± 0.041
0.3TrpMet: 0.3 ± 0.016
0.546TrpAsn: 0.546 ± 0.021
0.722TrpPro: 0.722 ± 0.026
0.79TrpGln: 0.79 ± 0.027
1.278TrpArg: 1.278 ± 0.033
0.936TrpSer: 0.936 ± 0.028
0.927TrpThr: 0.927 ± 0.028
0.905TrpVal: 0.905 ± 0.028
0.297TrpTrp: 0.297 ± 0.016
0.334TrpTyr: 0.334 ± 0.016
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.823TyrAla: 2.823 ± 0.049
0.19TyrCys: 0.19 ± 0.013
1.385TyrAsp: 1.385 ± 0.035
1.235TyrGlu: 1.235 ± 0.031
0.913TyrPhe: 0.913 ± 0.027
2.323TyrGly: 2.323 ± 0.044
0.61TyrHis: 0.61 ± 0.024
0.671TyrIle: 0.671 ± 0.026
0.59TyrLys: 0.59 ± 0.022
2.855TyrLeu: 2.855 ± 0.049
0.303TyrMet: 0.303 ± 0.016
0.688TyrAsn: 0.688 ± 0.025
1.303TyrPro: 1.303 ± 0.033
1.031TyrGln: 1.031 ± 0.027
2.034TyrArg: 2.034 ± 0.039
1.25TyrSer: 1.25 ± 0.033
1.399TyrThr: 1.399 ± 0.036
1.618TyrVal: 1.618 ± 0.038
0.37TyrTrp: 0.37 ± 0.017
0.699TyrTyr: 0.699 ± 0.026
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4164 proteins (1261125 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski