Amino acid dipepetide frequency for Nosocomiicoccus massiliensis

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.814AlaAla: 3.814 ± 0.119
0.425AlaCys: 0.425 ± 0.033
2.823AlaAsp: 2.823 ± 0.076
3.649AlaGlu: 3.649 ± 0.098
2.866AlaPhe: 2.866 ± 0.079
3.992AlaGly: 3.992 ± 0.097
1.249AlaHis: 1.249 ± 0.06
5.042AlaIle: 5.042 ± 0.11
3.779AlaLys: 3.779 ± 0.093
6.668AlaLeu: 6.668 ± 0.129
1.697AlaMet: 1.697 ± 0.054
2.458AlaAsn: 2.458 ± 0.067
1.662AlaPro: 1.662 ± 0.052
1.631AlaGln: 1.631 ± 0.061
2.34AlaArg: 2.34 ± 0.083
3.425AlaSer: 3.425 ± 0.094
3.251AlaThr: 3.251 ± 0.09
4.687AlaVal: 4.687 ± 0.125
0.339AlaTrp: 0.339 ± 0.03
2.135AlaTyr: 2.135 ± 0.065
0.0AlaXaa: 0.0 ± 0.0
Cys
0.3CysAla: 0.3 ± 0.03
0.045CysCys: 0.045 ± 0.011
0.292CysAsp: 0.292 ± 0.022
0.323CysGlu: 0.323 ± 0.031
0.185CysPhe: 0.185 ± 0.019
0.483CysGly: 0.483 ± 0.037
0.154CysHis: 0.154 ± 0.018
0.395CysIle: 0.395 ± 0.031
0.286CysLys: 0.286 ± 0.024
0.374CysLeu: 0.374 ± 0.029
0.095CysMet: 0.095 ± 0.014
0.21CysAsn: 0.21 ± 0.021
0.242CysPro: 0.242 ± 0.024
0.148CysGln: 0.148 ± 0.016
0.166CysArg: 0.166 ± 0.021
0.366CysSer: 0.366 ± 0.028
0.29CysThr: 0.29 ± 0.022
0.298CysVal: 0.298 ± 0.026
0.021CysTrp: 0.021 ± 0.007
0.205CysTyr: 0.205 ± 0.019
0.0CysXaa: 0.0 ± 0.0
Asp
3.744AspAla: 3.744 ± 0.099
0.255AspCys: 0.255 ± 0.027
4.299AspAsp: 4.299 ± 0.112
6.123AspGlu: 6.123 ± 0.118
3.029AspPhe: 3.029 ± 0.078
3.89AspGly: 3.89 ± 0.114
1.083AspHis: 1.083 ± 0.047
5.603AspIle: 5.603 ± 0.134
4.389AspLys: 4.389 ± 0.096
5.416AspLeu: 5.416 ± 0.112
1.543AspMet: 1.543 ± 0.057
3.166AspAsn: 3.166 ± 0.081
1.65AspPro: 1.65 ± 0.071
1.383AspGln: 1.383 ± 0.067
2.328AspArg: 2.328 ± 0.079
3.119AspSer: 3.119 ± 0.089
3.499AspThr: 3.499 ± 0.083
5.123AspVal: 5.123 ± 0.111
0.44AspTrp: 0.44 ± 0.029
3.0AspTyr: 3.0 ± 0.089
0.0AspXaa: 0.0 ± 0.0
Glu
5.414GluAla: 5.414 ± 0.121
0.286GluCys: 0.286 ± 0.026
5.225GluAsp: 5.225 ± 0.109
7.305GluGlu: 7.305 ± 0.165
3.023GluPhe: 3.023 ± 0.084
3.824GluGly: 3.824 ± 0.092
1.664GluHis: 1.664 ± 0.054
6.181GluIle: 6.181 ± 0.122
6.214GluLys: 6.214 ± 0.13
7.557GluLeu: 7.557 ± 0.154
2.453GluMet: 2.453 ± 0.081
4.479GluAsn: 4.479 ± 0.089
1.761GluPro: 1.761 ± 0.07
2.509GluGln: 2.509 ± 0.072
3.746GluArg: 3.746 ± 0.101
4.539GluSer: 4.539 ± 0.111
4.541GluThr: 4.541 ± 0.112
5.836GluVal: 5.836 ± 0.137
0.577GluTrp: 0.577 ± 0.04
3.062GluTyr: 3.062 ± 0.081
0.0GluXaa: 0.0 ± 0.0
Phe
2.236PheAla: 2.236 ± 0.07
0.236PheCys: 0.236 ± 0.023
2.903PheAsp: 2.903 ± 0.074
3.088PheGlu: 3.088 ± 0.087
2.073PhePhe: 2.073 ± 0.083
3.117PheGly: 3.117 ± 0.089
0.756PheHis: 0.756 ± 0.04
4.325PheIle: 4.325 ± 0.119
3.688PheLys: 3.688 ± 0.104
4.221PheLeu: 4.221 ± 0.126
1.2PheMet: 1.2 ± 0.051
2.953PheAsn: 2.953 ± 0.093
1.114PhePro: 1.114 ± 0.046
1.149PheGln: 1.149 ± 0.051
1.428PheArg: 1.428 ± 0.068
3.055PheSer: 3.055 ± 0.102
2.947PheThr: 2.947 ± 0.081
3.38PheVal: 3.38 ± 0.097
0.288PheTrp: 0.288 ± 0.028
1.788PheTyr: 1.788 ± 0.058
0.0PheXaa: 0.0 ± 0.0
Gly
4.128GlyAla: 4.128 ± 0.115
0.362GlyCys: 0.362 ± 0.026
3.59GlyAsp: 3.59 ± 0.086
4.43GlyGlu: 4.43 ± 0.108
3.049GlyPhe: 3.049 ± 0.076
4.292GlyGly: 4.292 ± 0.131
1.457GlyHis: 1.457 ± 0.057
5.33GlyIle: 5.33 ± 0.119
4.233GlyLys: 4.233 ± 0.099
5.829GlyLeu: 5.829 ± 0.139
1.742GlyMet: 1.742 ± 0.064
2.612GlyAsn: 2.612 ± 0.088
1.492GlyPro: 1.492 ± 0.061
1.876GlyGln: 1.876 ± 0.078
2.39GlyArg: 2.39 ± 0.075
3.329GlySer: 3.329 ± 0.085
3.805GlyThr: 3.805 ± 0.092
4.993GlyVal: 4.993 ± 0.114
0.46GlyTrp: 0.46 ± 0.032
2.55GlyTyr: 2.55 ± 0.076
0.0GlyXaa: 0.0 ± 0.0
His
1.319HisAla: 1.319 ± 0.048
0.127HisCys: 0.127 ± 0.016
1.395HisAsp: 1.395 ± 0.054
1.455HisGlu: 1.455 ± 0.055
1.021HisPhe: 1.021 ± 0.046
1.397HisGly: 1.397 ± 0.06
0.592HisHis: 0.592 ± 0.034
1.712HisIle: 1.712 ± 0.058
1.163HisLys: 1.163 ± 0.047
2.092HisLeu: 2.092 ± 0.071
0.545HisMet: 0.545 ± 0.032
1.073HisAsn: 1.073 ± 0.053
1.007HisPro: 1.007 ± 0.047
0.616HisGln: 0.616 ± 0.039
0.884HisArg: 0.884 ± 0.047
1.071HisSer: 1.071 ± 0.051
1.151HisThr: 1.151 ± 0.046
1.512HisVal: 1.512 ± 0.06
0.138HisTrp: 0.138 ± 0.017
0.943HisTyr: 0.943 ± 0.047
0.0HisXaa: 0.0 ± 0.0
Ile
5.246IleAla: 5.246 ± 0.12
0.499IleCys: 0.499 ± 0.033
5.957IleAsp: 5.957 ± 0.13
7.027IleGlu: 7.027 ± 0.13
3.573IlePhe: 3.573 ± 0.107
5.757IleGly: 5.757 ± 0.132
1.798IleHis: 1.798 ± 0.065
7.397IleIle: 7.397 ± 0.191
5.449IleLys: 5.449 ± 0.124
7.666IleLeu: 7.666 ± 0.173
1.921IleMet: 1.921 ± 0.058
3.937IleAsn: 3.937 ± 0.108
3.008IlePro: 3.008 ± 0.089
2.626IleGln: 2.626 ± 0.073
2.932IleArg: 2.932 ± 0.085
5.042IleSer: 5.042 ± 0.104
4.899IleThr: 4.899 ± 0.128
6.454IleVal: 6.454 ± 0.133
0.454IleTrp: 0.454 ± 0.031
2.91IleTyr: 2.91 ± 0.091
0.0IleXaa: 0.0 ± 0.0
Lys
3.744LysAla: 3.744 ± 0.09
0.277LysCys: 0.277 ± 0.026
5.406LysAsp: 5.406 ± 0.121
7.845LysGlu: 7.845 ± 0.128
2.531LysPhe: 2.531 ± 0.071
3.692LysGly: 3.692 ± 0.092
1.662LysHis: 1.662 ± 0.06
5.086LysIle: 5.086 ± 0.113
5.566LysLys: 5.566 ± 0.144
5.794LysLeu: 5.794 ± 0.116
2.102LysMet: 2.102 ± 0.056
4.2LysAsn: 4.2 ± 0.105
1.843LysPro: 1.843 ± 0.06
2.443LysGln: 2.443 ± 0.088
3.616LysArg: 3.616 ± 0.093
3.943LysSer: 3.943 ± 0.093
4.009LysThr: 4.009 ± 0.092
4.818LysVal: 4.818 ± 0.103
0.432LysTrp: 0.432 ± 0.03
2.934LysTyr: 2.934 ± 0.077
0.0LysXaa: 0.0 ± 0.0
Leu
5.188LeuAla: 5.188 ± 0.107
0.405LeuCys: 0.405 ± 0.029
5.647LeuAsp: 5.647 ± 0.1
7.241LeuGlu: 7.241 ± 0.142
4.461LeuPhe: 4.461 ± 0.127
5.716LeuGly: 5.716 ± 0.127
1.642LeuHis: 1.642 ± 0.064
7.841LeuIle: 7.841 ± 0.17
7.502LeuLys: 7.502 ± 0.147
8.947LeuLeu: 8.947 ± 0.192
2.712LeuMet: 2.712 ± 0.08
5.739LeuAsn: 5.739 ± 0.133
3.164LeuPro: 3.164 ± 0.085
2.614LeuGln: 2.614 ± 0.078
3.302LeuArg: 3.302 ± 0.087
6.645LeuSer: 6.645 ± 0.125
5.593LeuThr: 5.593 ± 0.117
5.994LeuVal: 5.994 ± 0.117
0.547LeuTrp: 0.547 ± 0.035
3.421LeuTyr: 3.421 ± 0.091
0.0LeuXaa: 0.0 ± 0.0
Met
1.681MetAla: 1.681 ± 0.057
0.095MetCys: 0.095 ± 0.015
1.518MetAsp: 1.518 ± 0.057
1.545MetGlu: 1.545 ± 0.059
1.245MetPhe: 1.245 ± 0.056
1.377MetGly: 1.377 ± 0.058
0.508MetHis: 0.508 ± 0.034
2.433MetIle: 2.433 ± 0.079
2.225MetLys: 2.225 ± 0.074
2.603MetLeu: 2.603 ± 0.084
0.892MetMet: 0.892 ± 0.049
1.677MetAsn: 1.677 ± 0.051
0.861MetPro: 0.861 ± 0.041
0.727MetGln: 0.727 ± 0.032
1.177MetArg: 1.177 ± 0.053
1.946MetSer: 1.946 ± 0.064
1.823MetThr: 1.823 ± 0.066
1.572MetVal: 1.572 ± 0.058
0.156MetTrp: 0.156 ± 0.019
0.945MetTyr: 0.945 ± 0.048
0.0MetXaa: 0.0 ± 0.0
Asn
2.834AsnAla: 2.834 ± 0.078
0.23AsnCys: 0.23 ± 0.022
3.692AsnAsp: 3.692 ± 0.09
4.494AsnGlu: 4.494 ± 0.092
2.236AsnPhe: 2.236 ± 0.068
3.101AsnGly: 3.101 ± 0.089
1.21AsnHis: 1.21 ± 0.051
4.835AsnIle: 4.835 ± 0.113
3.9AsnLys: 3.9 ± 0.098
4.062AsnLeu: 4.062 ± 0.098
1.348AsnMet: 1.348 ± 0.056
2.914AsnAsn: 2.914 ± 0.093
1.781AsnPro: 1.781 ± 0.056
1.574AsnGln: 1.574 ± 0.068
2.262AsnArg: 2.262 ± 0.077
2.482AsnSer: 2.482 ± 0.072
2.844AsnThr: 2.844 ± 0.07
4.071AsnVal: 4.071 ± 0.089
0.329AsnTrp: 0.329 ± 0.026
2.301AsnTyr: 2.301 ± 0.072
0.0AsnXaa: 0.0 ± 0.0
Pro
1.49ProAla: 1.49 ± 0.055
0.15ProCys: 0.15 ± 0.02
1.592ProAsp: 1.592 ± 0.053
2.671ProGlu: 2.671 ± 0.071
1.701ProPhe: 1.701 ± 0.065
1.973ProGly: 1.973 ± 0.073
0.717ProHis: 0.717 ± 0.036
2.503ProIle: 2.503 ± 0.08
2.084ProLys: 2.084 ± 0.065
2.877ProLeu: 2.877 ± 0.076
0.865ProMet: 0.865 ± 0.044
1.494ProAsn: 1.494 ± 0.06
0.703ProPro: 0.703 ± 0.042
0.69ProGln: 0.69 ± 0.04
0.941ProArg: 0.941 ± 0.046
1.845ProSer: 1.845 ± 0.061
1.802ProThr: 1.802 ± 0.064
2.538ProVal: 2.538 ± 0.068
0.222ProTrp: 0.222 ± 0.02
1.311ProTyr: 1.311 ± 0.056
0.0ProXaa: 0.0 ± 0.0
Gln
1.675GlnAla: 1.675 ± 0.061
0.119GlnCys: 0.119 ± 0.015
1.547GlnAsp: 1.547 ± 0.056
1.864GlnGlu: 1.864 ± 0.066
1.759GlnPhe: 1.759 ± 0.057
1.475GlnGly: 1.475 ± 0.052
0.736GlnHis: 0.736 ± 0.047
2.252GlnIle: 2.252 ± 0.068
2.106GlnLys: 2.106 ± 0.069
3.232GlnLeu: 3.232 ± 0.093
0.863GlnMet: 0.863 ± 0.042
1.586GlnAsn: 1.586 ± 0.07
0.832GlnPro: 0.832 ± 0.046
0.974GlnGln: 0.974 ± 0.048
1.13GlnArg: 1.13 ± 0.049
2.065GlnSer: 2.065 ± 0.065
1.541GlnThr: 1.541 ± 0.057
1.837GlnVal: 1.837 ± 0.056
0.185GlnTrp: 0.185 ± 0.021
1.253GlnTyr: 1.253 ± 0.053
0.0GlnXaa: 0.0 ± 0.0
Arg
2.347ArgAla: 2.347 ± 0.066
0.154ArgCys: 0.154 ± 0.018
2.509ArgAsp: 2.509 ± 0.078
3.407ArgGlu: 3.407 ± 0.105
1.81ArgPhe: 1.81 ± 0.051
2.377ArgGly: 2.377 ± 0.076
0.886ArgHis: 0.886 ± 0.05
3.037ArgIle: 3.037 ± 0.085
2.877ArgLys: 2.877 ± 0.081
3.676ArgLeu: 3.676 ± 0.106
1.184ArgMet: 1.184 ± 0.054
1.911ArgAsn: 1.911 ± 0.064
1.243ArgPro: 1.243 ± 0.062
1.508ArgGln: 1.508 ± 0.059
1.993ArgArg: 1.993 ± 0.07
2.018ArgSer: 2.018 ± 0.076
2.038ArgThr: 2.038 ± 0.065
2.733ArgVal: 2.733 ± 0.083
0.191ArgTrp: 0.191 ± 0.022
1.621ArgTyr: 1.621 ± 0.06
0.0ArgXaa: 0.0 ± 0.0
Ser
3.094SerAla: 3.094 ± 0.077
0.259SerCys: 0.259 ± 0.022
3.409SerAsp: 3.409 ± 0.086
4.428SerGlu: 4.428 ± 0.104
2.959SerPhe: 2.959 ± 0.083
4.052SerGly: 4.052 ± 0.099
1.344SerHis: 1.344 ± 0.048
5.357SerIle: 5.357 ± 0.114
4.348SerLys: 4.348 ± 0.097
5.731SerLeu: 5.731 ± 0.138
1.504SerMet: 1.504 ± 0.057
3.078SerAsn: 3.078 ± 0.089
1.726SerPro: 1.726 ± 0.052
1.724SerGln: 1.724 ± 0.06
2.361SerArg: 2.361 ± 0.07
3.436SerSer: 3.436 ± 0.092
3.189SerThr: 3.189 ± 0.079
4.239SerVal: 4.239 ± 0.098
0.331SerTrp: 0.331 ± 0.029
2.464SerTyr: 2.464 ± 0.073
0.0SerXaa: 0.0 ± 0.0
Thr
3.173ThrAla: 3.173 ± 0.093
0.298ThrCys: 0.298 ± 0.03
3.475ThrAsp: 3.475 ± 0.079
4.044ThrGlu: 4.044 ± 0.099
2.838ThrPhe: 2.838 ± 0.071
3.826ThrGly: 3.826 ± 0.1
1.315ThrHis: 1.315 ± 0.051
5.116ThrIle: 5.116 ± 0.11
3.629ThrLys: 3.629 ± 0.094
5.866ThrLeu: 5.866 ± 0.115
1.352ThrMet: 1.352 ± 0.049
2.727ThrAsn: 2.727 ± 0.077
2.371ThrPro: 2.371 ± 0.08
1.453ThrGln: 1.453 ± 0.07
2.155ThrArg: 2.155 ± 0.077
3.316ThrSer: 3.316 ± 0.086
3.103ThrThr: 3.103 ± 0.085
4.646ThrVal: 4.646 ± 0.127
0.325ThrTrp: 0.325 ± 0.026
2.412ThrTyr: 2.412 ± 0.073
0.0ThrXaa: 0.0 ± 0.0
Val
4.087ValAla: 4.087 ± 0.11
0.366ValCys: 0.366 ± 0.024
4.584ValAsp: 4.584 ± 0.096
5.71ValGlu: 5.71 ± 0.123
3.29ValPhe: 3.29 ± 0.101
4.747ValGly: 4.747 ± 0.125
1.368ValHis: 1.368 ± 0.06
6.413ValIle: 6.413 ± 0.118
5.349ValLys: 5.349 ± 0.116
7.251ValLeu: 7.251 ± 0.143
1.827ValMet: 1.827 ± 0.068
3.68ValAsn: 3.68 ± 0.078
2.392ValPro: 2.392 ± 0.072
1.946ValGln: 1.946 ± 0.07
2.492ValArg: 2.492 ± 0.085
4.753ValSer: 4.753 ± 0.097
4.601ValThr: 4.601 ± 0.134
5.501ValVal: 5.501 ± 0.123
0.436ValTrp: 0.436 ± 0.034
2.507ValTyr: 2.507 ± 0.071
0.0ValXaa: 0.0 ± 0.0
Trp
0.345TrpAla: 0.345 ± 0.028
0.049TrpCys: 0.049 ± 0.01
0.308TrpAsp: 0.308 ± 0.03
0.343TrpGlu: 0.343 ± 0.029
0.36TrpPhe: 0.36 ± 0.027
0.37TrpGly: 0.37 ± 0.029
0.142TrpHis: 0.142 ± 0.016
0.575TrpIle: 0.575 ± 0.033
0.331TrpLys: 0.331 ± 0.026
0.738TrpLeu: 0.738 ± 0.041
0.214TrpMet: 0.214 ± 0.023
0.368TrpAsn: 0.368 ± 0.031
0.187TrpPro: 0.187 ± 0.024
0.212TrpGln: 0.212 ± 0.022
0.205TrpArg: 0.205 ± 0.021
0.366TrpSer: 0.366 ± 0.028
0.351TrpThr: 0.351 ± 0.027
0.384TrpVal: 0.384 ± 0.029
0.066TrpTrp: 0.066 ± 0.011
0.298TrpTyr: 0.298 ± 0.027
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.077TyrAla: 2.077 ± 0.073
0.238TyrCys: 0.238 ± 0.02
2.91TyrAsp: 2.91 ± 0.083
3.203TyrGlu: 3.203 ± 0.089
1.936TyrPhe: 1.936 ± 0.081
2.49TyrGly: 2.49 ± 0.074
0.929TyrHis: 0.929 ± 0.044
3.123TyrIle: 3.123 ± 0.093
2.829TyrLys: 2.829 ± 0.082
3.719TyrLeu: 3.719 ± 0.093
0.947TyrMet: 0.947 ± 0.048
2.24TyrAsn: 2.24 ± 0.063
1.128TyrPro: 1.128 ± 0.049
1.229TyrGln: 1.229 ± 0.049
1.668TyrArg: 1.668 ± 0.06
2.271TyrSer: 2.271 ± 0.073
2.162TyrThr: 2.162 ± 0.073
2.684TyrVal: 2.684 ± 0.074
0.277TyrTrp: 0.277 ± 0.026
1.718TyrTyr: 1.718 ± 0.072
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1652 proteins (486671 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski