Amino acid dipepetide frequency for Kocuria rhizophila (strain ATCC 9341 / DSM 348 / NBRC 103217 / DC2201)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
19.205AlaAla: 19.205 ± 0.229
0.884AlaCys: 0.884 ± 0.034
6.849AlaAsp: 6.849 ± 0.108
7.943AlaGlu: 7.943 ± 0.108
3.268AlaPhe: 3.268 ± 0.069
11.878AlaGly: 11.878 ± 0.15
2.929AlaHis: 2.929 ± 0.058
3.496AlaIle: 3.496 ± 0.077
2.583AlaLys: 2.583 ± 0.071
13.098AlaLeu: 13.098 ± 0.176
2.761AlaMet: 2.761 ± 0.057
1.966AlaAsn: 1.966 ± 0.056
7.264AlaPro: 7.264 ± 0.131
4.781AlaGln: 4.781 ± 0.088
9.669AlaArg: 9.669 ± 0.144
6.666AlaSer: 6.666 ± 0.089
6.815AlaThr: 6.815 ± 0.096
12.146AlaVal: 12.146 ± 0.16
1.739AlaTrp: 1.739 ± 0.055
2.04AlaTyr: 2.04 ± 0.056
0.0AlaXaa: 0.0 ± 0.0
Cys
0.944CysAla: 0.944 ± 0.035
0.079CysCys: 0.079 ± 0.01
0.358CysAsp: 0.358 ± 0.02
0.373CysGlu: 0.373 ± 0.022
0.229CysPhe: 0.229 ± 0.015
0.786CysGly: 0.786 ± 0.036
0.193CysHis: 0.193 ± 0.017
0.211CysIle: 0.211 ± 0.016
0.086CysLys: 0.086 ± 0.011
0.641CysLeu: 0.641 ± 0.032
0.125CysMet: 0.125 ± 0.013
0.129CysAsn: 0.129 ± 0.013
0.409CysPro: 0.409 ± 0.023
0.161CysGln: 0.161 ± 0.015
0.501CysArg: 0.501 ± 0.024
0.44CysSer: 0.44 ± 0.026
0.446CysThr: 0.446 ± 0.027
0.609CysVal: 0.609 ± 0.027
0.093CysTrp: 0.093 ± 0.011
0.16CysTyr: 0.16 ± 0.012
0.0CysXaa: 0.0 ± 0.0
Asp
8.262AspAla: 8.262 ± 0.129
0.3AspCys: 0.3 ± 0.017
3.176AspAsp: 3.176 ± 0.068
4.002AspGlu: 4.002 ± 0.079
1.538AspPhe: 1.538 ± 0.04
5.205AspGly: 5.205 ± 0.083
1.417AspHis: 1.417 ± 0.043
1.821AspIle: 1.821 ± 0.053
1.072AspLys: 1.072 ± 0.043
5.636AspLeu: 5.636 ± 0.09
1.091AspMet: 1.091 ± 0.043
0.878AspAsn: 0.878 ± 0.039
4.649AspPro: 4.649 ± 0.092
1.566AspGln: 1.566 ± 0.048
4.307AspArg: 4.307 ± 0.09
2.703AspSer: 2.703 ± 0.054
3.435AspThr: 3.435 ± 0.063
5.528AspVal: 5.528 ± 0.086
0.807AspTrp: 0.807 ± 0.032
1.241AspTyr: 1.241 ± 0.045
0.0AspXaa: 0.0 ± 0.0
Glu
6.756GluAla: 6.756 ± 0.108
0.363GluCys: 0.363 ± 0.02
4.127GluAsp: 4.127 ± 0.075
3.346GluGlu: 3.346 ± 0.083
1.691GluPhe: 1.691 ± 0.047
4.258GluGly: 4.258 ± 0.078
2.242GluHis: 2.242 ± 0.062
2.619GluIle: 2.619 ± 0.06
1.626GluLys: 1.626 ± 0.056
6.693GluLeu: 6.693 ± 0.096
1.009GluMet: 1.009 ± 0.036
1.661GluAsn: 1.661 ± 0.05
3.201GluPro: 3.201 ± 0.072
2.913GluGln: 2.913 ± 0.066
5.335GluArg: 5.335 ± 0.11
2.911GluSer: 2.911 ± 0.067
3.268GluThr: 3.268 ± 0.066
4.393GluVal: 4.393 ± 0.085
0.755GluTrp: 0.755 ± 0.027
1.162GluTyr: 1.162 ± 0.038
0.0GluXaa: 0.0 ± 0.0
Phe
3.413PheAla: 3.413 ± 0.076
0.237PheCys: 0.237 ± 0.017
1.905PheAsp: 1.905 ± 0.05
1.729PheGlu: 1.729 ± 0.049
1.036PhePhe: 1.036 ± 0.04
2.885PheGly: 2.885 ± 0.076
0.639PheHis: 0.639 ± 0.03
0.942PheIle: 0.942 ± 0.046
0.556PheLys: 0.556 ± 0.03
2.653PheLeu: 2.653 ± 0.07
0.617PheMet: 0.617 ± 0.029
0.661PheAsn: 0.661 ± 0.037
1.367PhePro: 1.367 ± 0.041
0.75PheGln: 0.75 ± 0.029
1.655PheArg: 1.655 ± 0.048
1.746PheSer: 1.746 ± 0.051
2.132PheThr: 2.132 ± 0.05
2.682PheVal: 2.682 ± 0.073
0.422PheTrp: 0.422 ± 0.027
0.634PheTyr: 0.634 ± 0.029
0.0PheXaa: 0.0 ± 0.0
Gly
10.909GlyAla: 10.909 ± 0.136
0.707GlyCys: 0.707 ± 0.033
4.515GlyAsp: 4.515 ± 0.079
5.11GlyGlu: 5.11 ± 0.078
2.977GlyPhe: 2.977 ± 0.064
7.825GlyGly: 7.825 ± 0.152
2.218GlyHis: 2.218 ± 0.058
3.957GlyIle: 3.957 ± 0.07
2.246GlyLys: 2.246 ± 0.052
8.551GlyLeu: 8.551 ± 0.128
2.244GlyMet: 2.244 ± 0.055
1.729GlyAsn: 1.729 ± 0.052
4.601GlyPro: 4.601 ± 0.084
2.89GlyGln: 2.89 ± 0.062
6.572GlyArg: 6.572 ± 0.102
5.892GlySer: 5.892 ± 0.108
6.121GlyThr: 6.121 ± 0.098
7.788GlyVal: 7.788 ± 0.109
1.557GlyTrp: 1.557 ± 0.046
2.19GlyTyr: 2.19 ± 0.052
0.0GlyXaa: 0.0 ± 0.0
His
2.745HisAla: 2.745 ± 0.067
0.17HisCys: 0.17 ± 0.015
1.447HisAsp: 1.447 ± 0.038
1.604HisGlu: 1.604 ± 0.043
0.598HisPhe: 0.598 ± 0.028
2.568HisGly: 2.568 ± 0.062
0.798HisHis: 0.798 ± 0.03
0.619HisIle: 0.619 ± 0.027
0.368HisLys: 0.368 ± 0.023
2.224HisLeu: 2.224 ± 0.053
0.478HisMet: 0.478 ± 0.023
0.396HisAsn: 0.396 ± 0.024
1.933HisPro: 1.933 ± 0.048
0.661HisGln: 0.661 ± 0.03
2.249HisArg: 2.249 ± 0.062
1.123HisSer: 1.123 ± 0.036
1.411HisThr: 1.411 ± 0.039
2.15HisVal: 2.15 ± 0.059
0.349HisTrp: 0.349 ± 0.021
0.484HisTyr: 0.484 ± 0.027
0.0HisXaa: 0.0 ± 0.0
Ile
4.56IleAla: 4.56 ± 0.099
0.242IleCys: 0.242 ± 0.017
2.061IleAsp: 2.061 ± 0.053
2.051IleGlu: 2.051 ± 0.056
1.04IlePhe: 1.04 ± 0.036
3.264IleGly: 3.264 ± 0.065
0.758IleHis: 0.758 ± 0.032
1.184IleIle: 1.184 ± 0.046
0.801IleLys: 0.801 ± 0.039
3.059IleLeu: 3.059 ± 0.078
0.755IleMet: 0.755 ± 0.035
0.852IleAsn: 0.852 ± 0.031
2.041IlePro: 2.041 ± 0.057
0.982IleGln: 0.982 ± 0.034
2.259IleArg: 2.259 ± 0.05
2.002IleSer: 2.002 ± 0.052
2.483IleThr: 2.483 ± 0.062
3.213IleVal: 3.213 ± 0.066
0.303IleTrp: 0.303 ± 0.018
0.606IleTyr: 0.606 ± 0.028
0.0IleXaa: 0.0 ± 0.0
Lys
2.531LysAla: 2.531 ± 0.078
0.103LysCys: 0.103 ± 0.012
1.455LysAsp: 1.455 ± 0.047
1.11LysGlu: 1.11 ± 0.043
0.518LysPhe: 0.518 ± 0.027
1.633LysGly: 1.633 ± 0.049
0.484LysHis: 0.484 ± 0.024
0.987LysIle: 0.987 ± 0.044
0.913LysLys: 0.913 ± 0.041
1.98LysLeu: 1.98 ± 0.049
0.454LysMet: 0.454 ± 0.027
0.654LysAsn: 0.654 ± 0.035
1.194LysPro: 1.194 ± 0.053
0.719LysGln: 0.719 ± 0.032
1.473LysArg: 1.473 ± 0.047
1.127LysSer: 1.127 ± 0.041
1.284LysThr: 1.284 ± 0.046
1.768LysVal: 1.768 ± 0.055
0.232LysTrp: 0.232 ± 0.015
0.494LysTyr: 0.494 ± 0.024
0.0LysXaa: 0.0 ± 0.0
Leu
12.849LeuAla: 12.849 ± 0.166
0.749LeuCys: 0.749 ± 0.031
6.186LeuAsp: 6.186 ± 0.102
5.272LeuGlu: 5.272 ± 0.088
2.6LeuPhe: 2.6 ± 0.078
9.098LeuGly: 9.098 ± 0.122
2.152LeuHis: 2.152 ± 0.057
3.283LeuIle: 3.283 ± 0.077
1.884LeuLys: 1.884 ± 0.049
9.71LeuLeu: 9.71 ± 0.154
1.847LeuMet: 1.847 ± 0.054
2.049LeuAsn: 2.049 ± 0.052
5.516LeuPro: 5.516 ± 0.096
2.615LeuGln: 2.615 ± 0.055
7.46LeuArg: 7.46 ± 0.137
5.897LeuSer: 5.897 ± 0.097
6.279LeuThr: 6.279 ± 0.089
9.137LeuVal: 9.137 ± 0.127
1.296LeuTrp: 1.296 ± 0.044
1.667LeuTyr: 1.667 ± 0.045
0.0LeuXaa: 0.0 ± 0.0
Met
2.615MetAla: 2.615 ± 0.057
0.166MetCys: 0.166 ± 0.013
1.142MetAsp: 1.142 ± 0.042
0.896MetGlu: 0.896 ± 0.038
0.571MetPhe: 0.571 ± 0.032
1.808MetGly: 1.808 ± 0.049
0.425MetHis: 0.425 ± 0.023
0.822MetIle: 0.822 ± 0.032
0.478MetLys: 0.478 ± 0.027
1.991MetLeu: 1.991 ± 0.054
0.401MetMet: 0.401 ± 0.025
0.541MetAsn: 0.541 ± 0.026
1.14MetPro: 1.14 ± 0.04
0.574MetGln: 0.574 ± 0.025
1.401MetArg: 1.401 ± 0.045
1.8MetSer: 1.8 ± 0.051
1.691MetThr: 1.691 ± 0.042
1.677MetVal: 1.677 ± 0.043
0.242MetTrp: 0.242 ± 0.017
0.344MetTyr: 0.344 ± 0.024
0.0MetXaa: 0.0 ± 0.0
Asn
2.362AsnAla: 2.362 ± 0.056
0.121AsnCys: 0.121 ± 0.012
1.034AsnAsp: 1.034 ± 0.042
1.025AsnGlu: 1.025 ± 0.035
0.633AsnPhe: 0.633 ± 0.033
1.935AsnGly: 1.935 ± 0.051
0.469AsnHis: 0.469 ± 0.027
0.812AsnIle: 0.812 ± 0.034
0.483AsnLys: 0.483 ± 0.027
1.958AsnLeu: 1.958 ± 0.051
0.431AsnMet: 0.431 ± 0.024
0.487AsnAsn: 0.487 ± 0.029
1.638AsnPro: 1.638 ± 0.05
0.617AsnGln: 0.617 ± 0.03
1.411AsnArg: 1.411 ± 0.048
1.049AsnSer: 1.049 ± 0.034
1.198AsnThr: 1.198 ± 0.042
1.61AsnVal: 1.61 ± 0.05
0.318AsnTrp: 0.318 ± 0.021
0.46AsnTyr: 0.46 ± 0.024
0.0AsnXaa: 0.0 ± 0.0
Pro
7.655ProAla: 7.655 ± 0.129
0.309ProCys: 0.309 ± 0.024
3.767ProAsp: 3.767 ± 0.07
4.953ProGlu: 4.953 ± 0.092
1.557ProPhe: 1.557 ± 0.049
6.302ProGly: 6.302 ± 0.11
1.471ProHis: 1.471 ± 0.042
1.239ProIle: 1.239 ± 0.049
0.956ProLys: 0.956 ± 0.039
4.711ProLeu: 4.711 ± 0.081
1.007ProMet: 1.007 ± 0.036
0.943ProAsn: 0.943 ± 0.035
2.37ProPro: 2.37 ± 0.066
2.089ProGln: 2.089 ± 0.051
4.267ProArg: 4.267 ± 0.085
3.322ProSer: 3.322 ± 0.071
3.191ProThr: 3.191 ± 0.067
5.589ProVal: 5.589 ± 0.095
0.918ProTrp: 0.918 ± 0.036
1.059ProTyr: 1.059 ± 0.039
0.0ProXaa: 0.0 ± 0.0
Gln
3.576GlnAla: 3.576 ± 0.064
0.185GlnCys: 0.185 ± 0.014
2.267GlnAsp: 2.267 ± 0.059
1.869GlnGlu: 1.869 ± 0.054
0.893GlnPhe: 0.893 ± 0.033
2.562GlnGly: 2.562 ± 0.063
0.889GlnHis: 0.889 ± 0.033
1.253GlnIle: 1.253 ± 0.04
0.835GlnLys: 0.835 ± 0.033
3.561GlnLeu: 3.561 ± 0.073
0.617GlnMet: 0.617 ± 0.027
0.724GlnAsn: 0.724 ± 0.032
1.711GlnPro: 1.711 ± 0.048
1.532GlnGln: 1.532 ± 0.048
2.988GlnArg: 2.988 ± 0.071
1.509GlnSer: 1.509 ± 0.051
1.434GlnThr: 1.434 ± 0.045
2.333GlnVal: 2.333 ± 0.059
0.649GlnTrp: 0.649 ± 0.03
0.755GlnTyr: 0.755 ± 0.031
0.0GlnXaa: 0.0 ± 0.0
Arg
9.39ArgAla: 9.39 ± 0.124
0.531ArgCys: 0.531 ± 0.028
4.31ArgAsp: 4.31 ± 0.083
5.39ArgGlu: 5.39 ± 0.094
2.193ArgPhe: 2.193 ± 0.056
6.138ArgGly: 6.138 ± 0.085
1.855ArgHis: 1.855 ± 0.048
3.054ArgIle: 3.054 ± 0.061
1.541ArgLys: 1.541 ± 0.053
6.66ArgLeu: 6.66 ± 0.111
1.857ArgMet: 1.857 ± 0.047
1.487ArgAsn: 1.487 ± 0.05
4.064ArgPro: 4.064 ± 0.083
2.271ArgGln: 2.271 ± 0.055
6.747ArgArg: 6.747 ± 0.123
4.369ArgSer: 4.369 ± 0.084
4.91ArgThr: 4.91 ± 0.071
6.153ArgVal: 6.153 ± 0.098
1.256ArgTrp: 1.256 ± 0.039
1.485ArgTyr: 1.485 ± 0.039
0.0ArgXaa: 0.0 ± 0.0
Ser
7.058SerAla: 7.058 ± 0.098
0.366SerCys: 0.366 ± 0.023
2.754SerAsp: 2.754 ± 0.055
3.227SerGlu: 3.227 ± 0.075
1.719SerPhe: 1.719 ± 0.046
6.024SerGly: 6.024 ± 0.106
1.364SerHis: 1.364 ± 0.043
1.881SerIle: 1.881 ± 0.048
1.141SerLys: 1.141 ± 0.045
5.098SerLeu: 5.098 ± 0.075
1.389SerMet: 1.389 ± 0.046
1.073SerAsn: 1.073 ± 0.035
3.457SerPro: 3.457 ± 0.071
1.713SerGln: 1.713 ± 0.045
4.124SerArg: 4.124 ± 0.078
3.733SerSer: 3.733 ± 0.083
3.66SerThr: 3.66 ± 0.064
5.011SerVal: 5.011 ± 0.094
0.918SerTrp: 0.918 ± 0.035
1.262SerTyr: 1.262 ± 0.041
0.0SerXaa: 0.0 ± 0.0
Thr
8.091ThrAla: 8.091 ± 0.107
0.382ThrCys: 0.382 ± 0.023
3.548ThrAsp: 3.548 ± 0.074
3.443ThrGlu: 3.443 ± 0.066
1.717ThrPhe: 1.717 ± 0.047
6.458ThrGly: 6.458 ± 0.094
1.272ThrHis: 1.272 ± 0.042
1.779ThrIle: 1.779 ± 0.045
1.07ThrLys: 1.07 ± 0.038
5.325ThrLeu: 5.325 ± 0.087
1.183ThrMet: 1.183 ± 0.034
1.086ThrAsn: 1.086 ± 0.039
4.196ThrPro: 4.196 ± 0.07
1.671ThrGln: 1.671 ± 0.054
4.07ThrArg: 4.07 ± 0.068
3.447ThrSer: 3.447 ± 0.08
3.711ThrThr: 3.711 ± 0.074
6.714ThrVal: 6.714 ± 0.112
0.908ThrTrp: 0.908 ± 0.039
1.115ThrTyr: 1.115 ± 0.037
0.0ThrXaa: 0.0 ± 0.0
Val
11.205ValAla: 11.205 ± 0.123
0.779ValCys: 0.779 ± 0.029
5.587ValAsp: 5.587 ± 0.093
5.075ValGlu: 5.075 ± 0.082
2.655ValPhe: 2.655 ± 0.063
7.02ValGly: 7.02 ± 0.114
2.097ValHis: 2.097 ± 0.042
3.436ValIle: 3.436 ± 0.069
1.785ValLys: 1.785 ± 0.048
10.223ValLeu: 10.223 ± 0.15
1.818ValMet: 1.818 ± 0.052
1.895ValAsn: 1.895 ± 0.055
5.198ValPro: 5.198 ± 0.08
2.48ValGln: 2.48 ± 0.055
6.334ValArg: 6.334 ± 0.085
5.283ValSer: 5.283 ± 0.087
5.754ValThr: 5.754 ± 0.097
9.572ValVal: 9.572 ± 0.143
1.13ValTrp: 1.13 ± 0.04
1.513ValTyr: 1.513 ± 0.046
0.0ValXaa: 0.0 ± 0.0
Trp
1.62TrpAla: 1.62 ± 0.047
0.145TrpCys: 0.145 ± 0.015
0.875TrpAsp: 0.875 ± 0.038
0.719TrpGlu: 0.719 ± 0.026
0.572TrpPhe: 0.572 ± 0.029
1.072TrpGly: 1.072 ± 0.036
0.339TrpHis: 0.339 ± 0.021
0.586TrpIle: 0.586 ± 0.025
0.286TrpLys: 0.286 ± 0.022
1.858TrpLeu: 1.858 ± 0.056
0.272TrpMet: 0.272 ± 0.019
0.414TrpAsn: 0.414 ± 0.021
0.673TrpPro: 0.673 ± 0.032
0.513TrpGln: 0.513 ± 0.028
1.218TrpArg: 1.218 ± 0.039
0.849TrpSer: 0.849 ± 0.032
0.764TrpThr: 0.764 ± 0.036
1.132TrpVal: 1.132 ± 0.045
0.386TrpTrp: 0.386 ± 0.025
0.29TrpTyr: 0.29 ± 0.02
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.235TyrAla: 2.235 ± 0.049
0.153TyrCys: 0.153 ± 0.014
1.268TyrAsp: 1.268 ± 0.042
1.191TyrGlu: 1.191 ± 0.043
0.656TyrPhe: 0.656 ± 0.034
1.879TyrGly: 1.879 ± 0.054
0.364TyrHis: 0.364 ± 0.021
0.537TyrIle: 0.537 ± 0.028
0.402TyrLys: 0.402 ± 0.025
1.92TyrLeu: 1.92 ± 0.05
0.368TyrMet: 0.368 ± 0.023
0.429TyrAsn: 0.429 ± 0.027
1.054TyrPro: 1.054 ± 0.037
0.643TyrGln: 0.643 ± 0.031
1.618TyrArg: 1.618 ± 0.051
1.102TyrSer: 1.102 ± 0.039
1.199TyrThr: 1.199 ± 0.04
1.638TyrVal: 1.638 ± 0.041
0.306TyrTrp: 0.306 ± 0.019
0.48TyrTyr: 0.48 ± 0.026
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2352 proteins (793120 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski