Amino acid dipepetide frequency for Rummeliibacillus sp. TYF005

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.141AlaAla: 5.141 ± 0.097
0.544AlaCys: 0.544 ± 0.026
3.223AlaAsp: 3.223 ± 0.064
4.46AlaGlu: 4.46 ± 0.082
3.212AlaPhe: 3.212 ± 0.062
4.693AlaGly: 4.693 ± 0.076
1.21AlaHis: 1.21 ± 0.037
6.462AlaIle: 6.462 ± 0.109
5.383AlaLys: 5.383 ± 0.085
6.761AlaLeu: 6.761 ± 0.089
1.94AlaMet: 1.94 ± 0.045
3.1AlaAsn: 3.1 ± 0.054
1.901AlaPro: 1.901 ± 0.056
2.243AlaGln: 2.243 ± 0.045
2.377AlaArg: 2.377 ± 0.054
3.935AlaSer: 3.935 ± 0.07
4.007AlaThr: 4.007 ± 0.08
4.796AlaVal: 4.796 ± 0.081
0.535AlaTrp: 0.535 ± 0.024
2.379AlaTyr: 2.379 ± 0.051
0.0AlaXaa: 0.0 ± 0.0
Cys
0.458CysAla: 0.458 ± 0.026
0.088CysCys: 0.088 ± 0.01
0.358CysAsp: 0.358 ± 0.019
0.465CysGlu: 0.465 ± 0.024
0.338CysPhe: 0.338 ± 0.02
0.693CysGly: 0.693 ± 0.031
0.194CysHis: 0.194 ± 0.014
0.614CysIle: 0.614 ± 0.025
0.427CysLys: 0.427 ± 0.02
0.622CysLeu: 0.622 ± 0.029
0.161CysMet: 0.161 ± 0.012
0.312CysAsn: 0.312 ± 0.016
0.349CysPro: 0.349 ± 0.017
0.25CysGln: 0.25 ± 0.016
0.266CysArg: 0.266 ± 0.016
0.512CysSer: 0.512 ± 0.026
0.401CysThr: 0.401 ± 0.021
0.407CysVal: 0.407 ± 0.022
0.075CysTrp: 0.075 ± 0.009
0.289CysTyr: 0.289 ± 0.016
0.0CysXaa: 0.0 ± 0.0
Asp
3.567AspAla: 3.567 ± 0.066
0.389AspCys: 0.389 ± 0.022
2.749AspAsp: 2.749 ± 0.059
4.651AspGlu: 4.651 ± 0.093
2.596AspPhe: 2.596 ± 0.05
3.298AspGly: 3.298 ± 0.061
1.1AspHis: 1.1 ± 0.029
4.424AspIle: 4.424 ± 0.075
3.195AspLys: 3.195 ± 0.064
4.947AspLeu: 4.947 ± 0.073
1.353AspMet: 1.353 ± 0.033
1.899AspAsn: 1.899 ± 0.047
1.673AspPro: 1.673 ± 0.048
1.989AspGln: 1.989 ± 0.042
1.884AspArg: 1.884 ± 0.047
2.709AspSer: 2.709 ± 0.053
2.585AspThr: 2.585 ± 0.057
3.605AspVal: 3.605 ± 0.064
0.572AspTrp: 0.572 ± 0.024
2.307AspTyr: 2.307 ± 0.051
0.0AspXaa: 0.0 ± 0.0
Glu
4.833GluAla: 4.833 ± 0.081
0.414GluCys: 0.414 ± 0.023
3.82GluAsp: 3.82 ± 0.074
6.504GluGlu: 6.504 ± 0.109
2.607GluPhe: 2.607 ± 0.054
3.914GluGly: 3.914 ± 0.064
1.453GluHis: 1.453 ± 0.041
5.889GluIle: 5.889 ± 0.09
6.739GluLys: 6.739 ± 0.104
6.769GluLeu: 6.769 ± 0.097
2.047GluMet: 2.047 ± 0.037
3.83GluAsn: 3.83 ± 0.065
1.794GluPro: 1.794 ± 0.039
3.525GluGln: 3.525 ± 0.065
3.062GluArg: 3.062 ± 0.066
3.386GluSer: 3.386 ± 0.06
3.64GluThr: 3.64 ± 0.068
4.925GluVal: 4.925 ± 0.089
0.798GluTrp: 0.798 ± 0.031
2.267GluTyr: 2.267 ± 0.05
0.0GluXaa: 0.0 ± 0.0
Phe
3.032PheAla: 3.032 ± 0.061
0.356PheCys: 0.356 ± 0.019
2.444PheAsp: 2.444 ± 0.046
3.042PheGlu: 3.042 ± 0.063
2.271PhePhe: 2.271 ± 0.062
3.132PheGly: 3.132 ± 0.059
0.923PheHis: 0.923 ± 0.035
3.865PheIle: 3.865 ± 0.075
2.88PheLys: 2.88 ± 0.051
4.253PheLeu: 4.253 ± 0.086
1.137PheMet: 1.137 ± 0.034
2.073PheAsn: 2.073 ± 0.044
1.495PhePro: 1.495 ± 0.038
1.515PheGln: 1.515 ± 0.032
1.459PheArg: 1.459 ± 0.039
3.087PheSer: 3.087 ± 0.051
2.638PheThr: 2.638 ± 0.053
3.175PheVal: 3.175 ± 0.065
0.473PheTrp: 0.473 ± 0.022
1.767PheTyr: 1.767 ± 0.044
0.0PheXaa: 0.0 ± 0.0
Gly
4.44GlyAla: 4.44 ± 0.085
0.581GlyCys: 0.581 ± 0.025
3.045GlyAsp: 3.045 ± 0.067
4.024GlyGlu: 4.024 ± 0.067
3.13GlyPhe: 3.13 ± 0.057
4.234GlyGly: 4.234 ± 0.087
1.339GlyHis: 1.339 ± 0.042
5.786GlyIle: 5.786 ± 0.1
5.299GlyLys: 5.299 ± 0.076
6.068GlyLeu: 6.068 ± 0.094
1.827GlyMet: 1.827 ± 0.041
2.68GlyAsn: 2.68 ± 0.059
1.496GlyPro: 1.496 ± 0.041
2.125GlyGln: 2.125 ± 0.055
2.365GlyArg: 2.365 ± 0.055
3.696GlySer: 3.696 ± 0.062
3.935GlyThr: 3.935 ± 0.071
4.646GlyVal: 4.646 ± 0.072
0.702GlyTrp: 0.702 ± 0.028
2.753GlyTyr: 2.753 ± 0.057
0.0GlyXaa: 0.0 ± 0.0
His
1.262HisAla: 1.262 ± 0.036
0.19HisCys: 0.19 ± 0.015
0.982HisAsp: 0.982 ± 0.03
1.402HisGlu: 1.402 ± 0.04
1.023HisPhe: 1.023 ± 0.028
1.277HisGly: 1.277 ± 0.033
0.626HisHis: 0.626 ± 0.026
1.706HisIle: 1.706 ± 0.04
1.167HisLys: 1.167 ± 0.034
2.08HisLeu: 2.08 ± 0.049
0.522HisMet: 0.522 ± 0.023
0.861HisAsn: 0.861 ± 0.028
1.042HisPro: 1.042 ± 0.034
0.907HisGln: 0.907 ± 0.032
0.813HisArg: 0.813 ± 0.025
1.155HisSer: 1.155 ± 0.036
1.067HisThr: 1.067 ± 0.035
1.345HisVal: 1.345 ± 0.036
0.19HisTrp: 0.19 ± 0.013
0.856HisTyr: 0.856 ± 0.028
0.0HisXaa: 0.0 ± 0.0
Ile
6.563IleAla: 6.563 ± 0.08
0.687IleCys: 0.687 ± 0.029
4.611IleAsp: 4.611 ± 0.074
6.331IleGlu: 6.331 ± 0.094
3.591IlePhe: 3.591 ± 0.078
5.845IleGly: 5.845 ± 0.108
1.753IleHis: 1.753 ± 0.042
6.865IleIle: 6.865 ± 0.109
4.904IleLys: 4.904 ± 0.086
7.706IleLeu: 7.706 ± 0.103
1.879IleMet: 1.879 ± 0.046
3.502IleAsn: 3.502 ± 0.066
3.447IlePro: 3.447 ± 0.059
3.373IleGln: 3.373 ± 0.067
3.178IleArg: 3.178 ± 0.06
5.466IleSer: 5.466 ± 0.077
4.595IleThr: 4.595 ± 0.069
5.885IleVal: 5.885 ± 0.091
0.661IleTrp: 0.661 ± 0.029
2.721IleTyr: 2.721 ± 0.05
0.0IleXaa: 0.0 ± 0.0
Lys
4.989LysAla: 4.989 ± 0.076
0.394LysCys: 0.394 ± 0.023
4.238LysAsp: 4.238 ± 0.066
7.058LysGlu: 7.058 ± 0.095
2.381LysPhe: 2.381 ± 0.044
4.582LysGly: 4.582 ± 0.076
1.334LysHis: 1.334 ± 0.032
5.564LysIle: 5.564 ± 0.079
7.087LysLys: 7.087 ± 0.108
6.429LysLeu: 6.429 ± 0.098
2.422LysMet: 2.422 ± 0.046
3.971LysAsn: 3.971 ± 0.072
2.366LysPro: 2.366 ± 0.053
3.239LysGln: 3.239 ± 0.062
3.215LysArg: 3.215 ± 0.062
4.15LysSer: 4.15 ± 0.075
4.238LysThr: 4.238 ± 0.08
5.218LysVal: 5.218 ± 0.082
0.895LysTrp: 0.895 ± 0.033
2.655LysTyr: 2.655 ± 0.055
0.0LysXaa: 0.0 ± 0.0
Leu
6.938LeuAla: 6.938 ± 0.099
0.708LeuCys: 0.708 ± 0.03
4.772LeuAsp: 4.772 ± 0.073
6.298LeuGlu: 6.298 ± 0.092
4.413LeuPhe: 4.413 ± 0.084
5.726LeuGly: 5.726 ± 0.087
1.963LeuHis: 1.963 ± 0.049
7.202LeuIle: 7.202 ± 0.107
7.364LeuLys: 7.364 ± 0.101
9.649LeuLeu: 9.649 ± 0.133
2.384LeuMet: 2.384 ± 0.05
4.526LeuAsn: 4.526 ± 0.074
3.677LeuPro: 3.677 ± 0.065
3.882LeuGln: 3.882 ± 0.07
3.334LeuArg: 3.334 ± 0.059
6.627LeuSer: 6.627 ± 0.083
5.617LeuThr: 5.617 ± 0.075
6.024LeuVal: 6.024 ± 0.094
0.82LeuTrp: 0.82 ± 0.032
3.131LeuTyr: 3.131 ± 0.062
0.0LeuXaa: 0.0 ± 0.0
Met
1.896MetAla: 1.896 ± 0.045
0.153MetCys: 0.153 ± 0.013
1.493MetAsp: 1.493 ± 0.039
1.721MetGlu: 1.721 ± 0.049
0.977MetPhe: 0.977 ± 0.034
1.648MetGly: 1.648 ± 0.043
0.503MetHis: 0.503 ± 0.022
2.205MetIle: 2.205 ± 0.047
2.479MetLys: 2.479 ± 0.051
2.436MetLeu: 2.436 ± 0.049
0.851MetMet: 0.851 ± 0.031
1.504MetAsn: 1.504 ± 0.041
1.009MetPro: 1.009 ± 0.036
1.013MetGln: 1.013 ± 0.032
1.048MetArg: 1.048 ± 0.037
1.56MetSer: 1.56 ± 0.038
1.722MetThr: 1.722 ± 0.04
1.593MetVal: 1.593 ± 0.044
0.166MetTrp: 0.166 ± 0.012
0.724MetTyr: 0.724 ± 0.026
0.0MetXaa: 0.0 ± 0.0
Asn
3.138AsnAla: 3.138 ± 0.061
0.369AsnCys: 0.369 ± 0.021
2.458AsnAsp: 2.458 ± 0.051
3.092AsnGlu: 3.092 ± 0.066
1.802AsnPhe: 1.802 ± 0.037
3.615AsnGly: 3.615 ± 0.069
1.114AsnHis: 1.114 ± 0.031
3.96AsnIle: 3.96 ± 0.067
3.293AsnLys: 3.293 ± 0.064
4.211AsnLeu: 4.211 ± 0.061
1.196AsnMet: 1.196 ± 0.034
2.583AsnAsn: 2.583 ± 0.073
2.17AsnPro: 2.17 ± 0.046
1.726AsnGln: 1.726 ± 0.045
2.022AsnArg: 2.022 ± 0.042
2.802AsnSer: 2.802 ± 0.058
2.525AsnThr: 2.525 ± 0.047
3.249AsnVal: 3.249 ± 0.058
0.555AsnTrp: 0.555 ± 0.024
1.821AsnTyr: 1.821 ± 0.051
0.0AsnXaa: 0.0 ± 0.0
Pro
1.958ProAla: 1.958 ± 0.05
0.202ProCys: 0.202 ± 0.015
1.833ProAsp: 1.833 ± 0.048
2.569ProGlu: 2.569 ± 0.055
1.918ProPhe: 1.918 ± 0.039
1.978ProGly: 1.978 ± 0.056
0.737ProHis: 0.737 ± 0.026
3.001ProIle: 3.001 ± 0.052
2.47ProLys: 2.47 ± 0.051
3.223ProLeu: 3.223 ± 0.055
0.809ProMet: 0.809 ± 0.033
1.76ProAsn: 1.76 ± 0.04
0.848ProPro: 0.848 ± 0.032
1.177ProGln: 1.177 ± 0.033
1.012ProArg: 1.012 ± 0.036
2.253ProSer: 2.253 ± 0.052
2.12ProThr: 2.12 ± 0.047
2.515ProVal: 2.515 ± 0.056
0.314ProTrp: 0.314 ± 0.02
1.388ProTyr: 1.388 ± 0.041
0.0ProXaa: 0.0 ± 0.0
Gln
2.595GlnAla: 2.595 ± 0.056
0.201GlnCys: 0.201 ± 0.015
1.744GlnAsp: 1.744 ± 0.042
2.764GlnGlu: 2.764 ± 0.056
1.803GlnPhe: 1.803 ± 0.046
1.91GlnGly: 1.91 ± 0.042
0.848GlnHis: 0.848 ± 0.028
2.948GlnIle: 2.948 ± 0.057
3.104GlnLys: 3.104 ± 0.064
4.269GlnLeu: 4.269 ± 0.079
1.149GlnMet: 1.149 ± 0.035
1.903GlnAsn: 1.903 ± 0.047
1.244GlnPro: 1.244 ± 0.04
2.195GlnGln: 2.195 ± 0.057
1.391GlnArg: 1.391 ± 0.037
2.247GlnSer: 2.247 ± 0.053
2.06GlnThr: 2.06 ± 0.055
2.308GlnVal: 2.308 ± 0.049
0.398GlnTrp: 0.398 ± 0.02
1.554GlnTyr: 1.554 ± 0.041
0.0GlnXaa: 0.0 ± 0.0
Arg
2.231ArgAla: 2.231 ± 0.052
0.243ArgCys: 0.243 ± 0.019
1.916ArgAsp: 1.916 ± 0.05
2.768ArgGlu: 2.768 ± 0.054
1.809ArgPhe: 1.809 ± 0.048
2.123ArgGly: 2.123 ± 0.052
0.794ArgHis: 0.794 ± 0.026
3.08ArgIle: 3.08 ± 0.056
3.153ArgLys: 3.153 ± 0.066
3.516ArgLeu: 3.516 ± 0.06
1.134ArgMet: 1.134 ± 0.037
1.865ArgAsn: 1.865 ± 0.038
1.209ArgPro: 1.209 ± 0.034
1.473ArgGln: 1.473 ± 0.041
1.565ArgArg: 1.565 ± 0.043
1.945ArgSer: 1.945 ± 0.043
1.966ArgThr: 1.966 ± 0.044
2.358ArgVal: 2.358 ± 0.053
0.349ArgTrp: 0.349 ± 0.019
1.528ArgTyr: 1.528 ± 0.04
0.0ArgXaa: 0.0 ± 0.0
Ser
3.623SerAla: 3.623 ± 0.071
0.41SerCys: 0.41 ± 0.021
2.818SerAsp: 2.818 ± 0.056
3.732SerGlu: 3.732 ± 0.06
3.116SerPhe: 3.116 ± 0.054
4.108SerGly: 4.108 ± 0.069
1.141SerHis: 1.141 ± 0.036
5.58SerIle: 5.58 ± 0.079
4.738SerLys: 4.738 ± 0.079
5.766SerLeu: 5.766 ± 0.077
1.633SerMet: 1.633 ± 0.039
3.047SerAsn: 3.047 ± 0.057
2.048SerPro: 2.048 ± 0.048
2.002SerGln: 2.002 ± 0.048
1.985SerArg: 1.985 ± 0.046
4.166SerSer: 4.166 ± 0.084
3.501SerThr: 3.501 ± 0.061
4.078SerVal: 4.078 ± 0.065
0.554SerTrp: 0.554 ± 0.023
2.419SerTyr: 2.419 ± 0.053
0.0SerXaa: 0.0 ± 0.0
Thr
3.932ThrAla: 3.932 ± 0.067
0.369ThrCys: 0.369 ± 0.021
2.853ThrAsp: 2.853 ± 0.068
3.494ThrGlu: 3.494 ± 0.07
2.8ThrPhe: 2.8 ± 0.061
4.026ThrGly: 4.026 ± 0.068
1.08ThrHis: 1.08 ± 0.037
5.066ThrIle: 5.066 ± 0.075
4.162ThrLys: 4.162 ± 0.077
5.339ThrLeu: 5.339 ± 0.08
1.28ThrMet: 1.28 ± 0.034
2.871ThrAsn: 2.871 ± 0.054
2.238ThrPro: 2.238 ± 0.044
1.583ThrGln: 1.583 ± 0.043
1.717ThrArg: 1.717 ± 0.044
3.61ThrSer: 3.61 ± 0.067
3.54ThrThr: 3.54 ± 0.077
4.17ThrVal: 4.17 ± 0.068
0.536ThrTrp: 0.536 ± 0.024
2.061ThrTyr: 2.061 ± 0.062
0.0ThrXaa: 0.0 ± 0.0
Val
4.89ValAla: 4.89 ± 0.085
0.565ValCys: 0.565 ± 0.024
3.506ValAsp: 3.506 ± 0.06
4.791ValGlu: 4.791 ± 0.068
2.909ValPhe: 2.909 ± 0.06
4.394ValGly: 4.394 ± 0.082
1.283ValHis: 1.283 ± 0.038
5.614ValIle: 5.614 ± 0.092
5.269ValLys: 5.269 ± 0.078
6.436ValLeu: 6.436 ± 0.089
1.794ValMet: 1.794 ± 0.045
3.121ValAsn: 3.121 ± 0.062
2.442ValPro: 2.442 ± 0.054
2.488ValGln: 2.488 ± 0.056
2.385ValArg: 2.385 ± 0.052
4.299ValSer: 4.299 ± 0.081
4.055ValThr: 4.055 ± 0.066
4.976ValVal: 4.976 ± 0.082
0.575ValTrp: 0.575 ± 0.023
2.293ValTyr: 2.293 ± 0.055
0.0ValXaa: 0.0 ± 0.0
Trp
0.527TrpAla: 0.527 ± 0.026
0.087TrpCys: 0.087 ± 0.009
0.469TrpAsp: 0.469 ± 0.024
0.497TrpGlu: 0.497 ± 0.023
0.482TrpPhe: 0.482 ± 0.024
0.592TrpGly: 0.592 ± 0.027
0.247TrpHis: 0.247 ± 0.016
0.871TrpIle: 0.871 ± 0.031
0.77TrpLys: 0.77 ± 0.033
1.095TrpLeu: 1.095 ± 0.038
0.311TrpMet: 0.311 ± 0.018
0.536TrpAsn: 0.536 ± 0.028
0.247TrpPro: 0.247 ± 0.016
0.424TrpGln: 0.424 ± 0.021
0.402TrpArg: 0.402 ± 0.021
0.59TrpSer: 0.59 ± 0.025
0.496TrpThr: 0.496 ± 0.023
0.579TrpVal: 0.579 ± 0.029
0.11TrpTrp: 0.11 ± 0.009
0.356TrpTyr: 0.356 ± 0.019
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.287TyrAla: 2.287 ± 0.046
0.326TyrCys: 0.326 ± 0.018
2.157TyrAsp: 2.157 ± 0.051
2.552TyrGlu: 2.552 ± 0.048
1.835TyrPhe: 1.835 ± 0.048
2.383TyrGly: 2.383 ± 0.05
0.828TyrHis: 0.828 ± 0.028
2.861TyrIle: 2.861 ± 0.06
2.546TyrLys: 2.546 ± 0.055
3.48TyrLeu: 3.48 ± 0.061
0.871TyrMet: 0.871 ± 0.029
1.759TyrAsn: 1.759 ± 0.043
1.391TyrPro: 1.391 ± 0.036
1.507TyrGln: 1.507 ± 0.04
1.548TyrArg: 1.548 ± 0.039
2.271TyrSer: 2.271 ± 0.048
2.018TyrThr: 2.018 ± 0.05
2.234TyrVal: 2.234 ± 0.049
0.413TyrTrp: 0.413 ± 0.019
1.627TyrTyr: 1.627 ± 0.046
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3544 proteins (1023313 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski