Amino acid dipepetide frequency for Candidatus Cryosericum terrychapinii

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.171AlaAla: 10.171 ± 0.175
1.164AlaCys: 1.164 ± 0.057
5.072AlaAsp: 5.072 ± 0.112
5.295AlaGlu: 5.295 ± 0.127
3.61AlaPhe: 3.61 ± 0.101
8.529AlaGly: 8.529 ± 0.145
1.962AlaHis: 1.962 ± 0.067
5.435AlaIle: 5.435 ± 0.123
3.66AlaLys: 3.66 ± 0.102
10.261AlaLeu: 10.261 ± 0.135
2.743AlaMet: 2.743 ± 0.078
2.507AlaAsn: 2.507 ± 0.098
3.586AlaPro: 3.586 ± 0.107
3.153AlaGln: 3.153 ± 0.082
6.171AlaArg: 6.171 ± 0.136
6.034AlaSer: 6.034 ± 0.127
5.822AlaThr: 5.822 ± 0.148
8.333AlaVal: 8.333 ± 0.159
1.011AlaTrp: 1.011 ± 0.054
2.516AlaTyr: 2.516 ± 0.07
0.0AlaXaa: 0.0 ± 0.0
Cys
0.887CysAla: 0.887 ± 0.052
0.173CysCys: 0.173 ± 0.02
0.644CysAsp: 0.644 ± 0.039
0.504CysGlu: 0.504 ± 0.032
0.351CysPhe: 0.351 ± 0.025
1.092CysGly: 1.092 ± 0.065
0.248CysHis: 0.248 ± 0.023
0.556CysIle: 0.556 ± 0.041
0.318CysLys: 0.318 ± 0.029
0.917CysLeu: 0.917 ± 0.049
0.266CysMet: 0.266 ± 0.025
0.286CysAsn: 0.286 ± 0.026
0.687CysPro: 0.687 ± 0.049
0.284CysGln: 0.284 ± 0.028
0.667CysArg: 0.667 ± 0.04
0.75CysSer: 0.75 ± 0.041
0.705CysThr: 0.705 ± 0.046
0.775CysVal: 0.775 ± 0.043
0.126CysTrp: 0.126 ± 0.018
0.268CysTyr: 0.268 ± 0.028
0.0CysXaa: 0.0 ± 0.0
Asp
5.293AspAla: 5.293 ± 0.111
0.509AspCys: 0.509 ± 0.037
2.613AspAsp: 2.613 ± 0.085
3.448AspGlu: 3.448 ± 0.101
2.056AspPhe: 2.056 ± 0.073
4.423AspGly: 4.423 ± 0.102
1.097AspHis: 1.097 ± 0.052
3.356AspIle: 3.356 ± 0.088
2.252AspLys: 2.252 ± 0.09
5.585AspLeu: 5.585 ± 0.102
1.421AspMet: 1.421 ± 0.061
1.484AspAsn: 1.484 ± 0.061
2.813AspPro: 2.813 ± 0.075
1.754AspGln: 1.754 ± 0.06
3.32AspArg: 3.32 ± 0.097
2.761AspSer: 2.761 ± 0.074
3.092AspThr: 3.092 ± 0.096
4.948AspVal: 4.948 ± 0.101
0.664AspTrp: 0.664 ± 0.034
1.64AspTyr: 1.64 ± 0.062
0.0AspXaa: 0.0 ± 0.0
Glu
5.626GluAla: 5.626 ± 0.14
0.504GluCys: 0.504 ± 0.041
2.811GluAsp: 2.811 ± 0.098
4.124GluGlu: 4.124 ± 0.133
1.919GluPhe: 1.919 ± 0.069
4.137GluGly: 4.137 ± 0.093
1.446GluHis: 1.446 ± 0.067
3.198GluIle: 3.198 ± 0.095
2.745GluLys: 2.745 ± 0.093
6.034GluLeu: 6.034 ± 0.129
1.491GluMet: 1.491 ± 0.054
1.804GluAsn: 1.804 ± 0.074
2.227GluPro: 2.227 ± 0.082
2.466GluGln: 2.466 ± 0.082
4.164GluArg: 4.164 ± 0.115
3.151GluSer: 3.151 ± 0.084
3.286GluThr: 3.286 ± 0.071
4.52GluVal: 4.52 ± 0.115
0.523GluTrp: 0.523 ± 0.037
1.498GluTyr: 1.498 ± 0.069
0.0GluXaa: 0.0 ± 0.0
Phe
3.568PheAla: 3.568 ± 0.089
0.446PheCys: 0.446 ± 0.031
2.286PheAsp: 2.286 ± 0.072
2.068PheGlu: 2.068 ± 0.066
1.556PhePhe: 1.556 ± 0.063
3.39PheGly: 3.39 ± 0.087
0.739PheHis: 0.739 ± 0.039
2.038PheIle: 2.038 ± 0.072
1.259PheLys: 1.259 ± 0.061
3.75PheLeu: 3.75 ± 0.098
0.923PheMet: 0.923 ± 0.051
1.072PheAsn: 1.072 ± 0.049
1.617PhePro: 1.617 ± 0.062
1.043PheGln: 1.043 ± 0.046
1.95PheArg: 1.95 ± 0.067
2.54PheSer: 2.54 ± 0.082
2.227PheThr: 2.227 ± 0.075
3.529PheVal: 3.529 ± 0.111
0.459PheTrp: 0.459 ± 0.037
0.984PheTyr: 0.984 ± 0.05
0.0PheXaa: 0.0 ± 0.0
Gly
7.14GlyAla: 7.14 ± 0.14
0.991GlyCys: 0.991 ± 0.062
4.158GlyAsp: 4.158 ± 0.092
4.067GlyGlu: 4.067 ± 0.11
3.281GlyPhe: 3.281 ± 0.077
6.313GlyGly: 6.313 ± 0.167
1.73GlyHis: 1.73 ± 0.061
5.552GlyIle: 5.552 ± 0.131
4.182GlyLys: 4.182 ± 0.118
8.212GlyLeu: 8.212 ± 0.134
2.349GlyMet: 2.349 ± 0.081
2.304GlyAsn: 2.304 ± 0.085
2.561GlyPro: 2.561 ± 0.08
2.435GlyGln: 2.435 ± 0.081
4.57GlyArg: 4.57 ± 0.101
5.441GlySer: 5.441 ± 0.137
5.786GlyThr: 5.786 ± 0.152
6.678GlyVal: 6.678 ± 0.129
1.061GlyTrp: 1.061 ± 0.054
2.599GlyTyr: 2.599 ± 0.087
0.0GlyXaa: 0.0 ± 0.0
His
1.887HisAla: 1.887 ± 0.066
0.268HisCys: 0.268 ± 0.029
1.176HisAsp: 1.176 ± 0.059
1.232HisGlu: 1.232 ± 0.059
0.797HisPhe: 0.797 ± 0.043
1.763HisGly: 1.763 ± 0.067
0.536HisHis: 0.536 ± 0.04
1.162HisIle: 1.162 ± 0.049
0.667HisLys: 0.667 ± 0.037
1.964HisLeu: 1.964 ± 0.073
0.579HisMet: 0.579 ± 0.039
0.559HisAsn: 0.559 ± 0.038
1.133HisPro: 1.133 ± 0.05
0.592HisGln: 0.592 ± 0.033
1.313HisArg: 1.313 ± 0.055
1.068HisSer: 1.068 ± 0.053
1.099HisThr: 1.099 ± 0.055
1.772HisVal: 1.772 ± 0.066
0.286HisTrp: 0.286 ± 0.029
0.518HisTyr: 0.518 ± 0.037
0.0HisXaa: 0.0 ± 0.0
Ile
5.923IleAla: 5.923 ± 0.115
0.554IleCys: 0.554 ± 0.039
3.68IleAsp: 3.68 ± 0.106
3.432IleGlu: 3.432 ± 0.085
1.66IlePhe: 1.66 ± 0.069
4.7IleGly: 4.7 ± 0.117
0.901IleHis: 0.901 ± 0.045
3.158IleIle: 3.158 ± 0.112
1.962IleLys: 1.962 ± 0.072
5.466IleLeu: 5.466 ± 0.141
1.39IleMet: 1.39 ± 0.061
1.597IleAsn: 1.597 ± 0.064
2.831IlePro: 2.831 ± 0.094
1.592IleGln: 1.592 ± 0.072
3.376IleArg: 3.376 ± 0.09
3.453IleSer: 3.453 ± 0.105
3.266IleThr: 3.266 ± 0.106
5.583IleVal: 5.583 ± 0.123
0.529IleTrp: 0.529 ± 0.039
1.311IleTyr: 1.311 ± 0.054
0.0IleXaa: 0.0 ± 0.0
Lys
3.962LysAla: 3.962 ± 0.114
0.304LysCys: 0.304 ± 0.029
2.27LysAsp: 2.27 ± 0.081
2.626LysGlu: 2.626 ± 0.109
1.079LysPhe: 1.079 ± 0.054
3.308LysGly: 3.308 ± 0.088
0.7LysHis: 0.7 ± 0.042
1.917LysIle: 1.917 ± 0.082
2.088LysLys: 2.088 ± 0.083
3.649LysLeu: 3.649 ± 0.106
1.117LysMet: 1.117 ± 0.055
1.272LysAsn: 1.272 ± 0.062
1.896LysPro: 1.896 ± 0.063
1.475LysGln: 1.475 ± 0.062
2.358LysArg: 2.358 ± 0.07
2.387LysSer: 2.387 ± 0.072
2.685LysThr: 2.685 ± 0.086
3.281LysVal: 3.281 ± 0.084
0.383LysTrp: 0.383 ± 0.032
1.218LysTyr: 1.218 ± 0.057
0.0LysXaa: 0.0 ± 0.0
Leu
10.651LeuAla: 10.651 ± 0.175
1.009LeuCys: 1.009 ± 0.057
5.752LeuAsp: 5.752 ± 0.126
5.594LeuGlu: 5.594 ± 0.146
3.775LeuPhe: 3.775 ± 0.106
8.054LeuGly: 8.054 ± 0.143
2.047LeuHis: 2.047 ± 0.067
4.822LeuIle: 4.822 ± 0.115
4.034LeuLys: 4.034 ± 0.108
10.284LeuLeu: 10.284 ± 0.193
2.462LeuMet: 2.462 ± 0.083
2.809LeuAsn: 2.809 ± 0.081
4.887LeuPro: 4.887 ± 0.106
3.401LeuGln: 3.401 ± 0.096
6.27LeuArg: 6.27 ± 0.142
6.869LeuSer: 6.869 ± 0.135
6.198LeuThr: 6.198 ± 0.138
8.709LeuVal: 8.709 ± 0.177
1.061LeuTrp: 1.061 ± 0.054
2.55LeuTyr: 2.55 ± 0.086
0.0LeuXaa: 0.0 ± 0.0
Met
2.595MetAla: 2.595 ± 0.086
0.198MetCys: 0.198 ± 0.022
1.504MetAsp: 1.504 ± 0.058
1.5MetGlu: 1.5 ± 0.063
0.923MetPhe: 0.923 ± 0.055
2.0MetGly: 2.0 ± 0.077
0.511MetHis: 0.511 ± 0.034
1.387MetIle: 1.387 ± 0.057
1.318MetLys: 1.318 ± 0.058
2.563MetLeu: 2.563 ± 0.08
0.633MetMet: 0.633 ± 0.045
1.072MetAsn: 1.072 ± 0.05
1.187MetPro: 1.187 ± 0.051
0.894MetGln: 0.894 ± 0.044
1.613MetArg: 1.613 ± 0.067
1.797MetSer: 1.797 ± 0.064
1.917MetThr: 1.917 ± 0.068
2.095MetVal: 2.095 ± 0.064
0.182MetTrp: 0.182 ± 0.02
0.577MetTyr: 0.577 ± 0.036
0.0MetXaa: 0.0 ± 0.0
Asn
2.669AsnAla: 2.669 ± 0.084
0.322AsnCys: 0.322 ± 0.027
1.405AsnAsp: 1.405 ± 0.061
1.459AsnGlu: 1.459 ± 0.065
0.887AsnPhe: 0.887 ± 0.041
2.477AsnGly: 2.477 ± 0.087
0.5AsnHis: 0.5 ± 0.034
1.664AsnIle: 1.664 ± 0.063
1.065AsnLys: 1.065 ± 0.056
2.973AsnLeu: 2.973 ± 0.073
0.878AsnMet: 0.878 ± 0.048
0.759AsnAsn: 0.759 ± 0.047
1.874AsnPro: 1.874 ± 0.078
0.8AsnGln: 0.8 ± 0.04
1.547AsnArg: 1.547 ± 0.061
1.489AsnSer: 1.489 ± 0.066
1.554AsnThr: 1.554 ± 0.071
2.538AsnVal: 2.538 ± 0.082
0.426AsnTrp: 0.426 ± 0.035
0.831AsnTyr: 0.831 ± 0.042
0.0AsnXaa: 0.0 ± 0.0
Pro
4.455ProAla: 4.455 ± 0.125
0.417ProCys: 0.417 ± 0.029
2.689ProAsp: 2.689 ± 0.079
3.234ProGlu: 3.234 ± 0.091
1.851ProPhe: 1.851 ± 0.071
3.797ProGly: 3.797 ± 0.09
0.917ProHis: 0.917 ± 0.048
2.34ProIle: 2.34 ± 0.077
1.306ProLys: 1.306 ± 0.054
3.998ProLeu: 3.998 ± 0.102
1.187ProMet: 1.187 ± 0.057
1.146ProAsn: 1.146 ± 0.049
1.579ProPro: 1.579 ± 0.065
1.408ProGln: 1.408 ± 0.065
2.363ProArg: 2.363 ± 0.073
2.984ProSer: 2.984 ± 0.099
2.61ProThr: 2.61 ± 0.076
4.324ProVal: 4.324 ± 0.113
0.563ProTrp: 0.563 ± 0.037
1.367ProTyr: 1.367 ± 0.062
0.0ProXaa: 0.0 ± 0.0
Gln
3.263GlnAla: 3.263 ± 0.082
0.288GlnCys: 0.288 ± 0.026
1.579GlnAsp: 1.579 ± 0.06
2.092GlnGlu: 2.092 ± 0.074
1.227GlnPhe: 1.227 ± 0.054
2.635GlnGly: 2.635 ± 0.082
0.712GlnHis: 0.712 ± 0.039
1.615GlnIle: 1.615 ± 0.066
1.522GlnLys: 1.522 ± 0.058
3.435GlnLeu: 3.435 ± 0.111
0.86GlnMet: 0.86 ± 0.045
0.921GlnAsn: 0.921 ± 0.05
1.489GlnPro: 1.489 ± 0.064
1.374GlnGln: 1.374 ± 0.067
2.013GlnArg: 2.013 ± 0.07
1.881GlnSer: 1.881 ± 0.064
1.995GlnThr: 1.995 ± 0.073
2.617GlnVal: 2.617 ± 0.081
0.356GlnTrp: 0.356 ± 0.033
1.027GlnTyr: 1.027 ± 0.052
0.0GlnXaa: 0.0 ± 0.0
Arg
5.022ArgAla: 5.022 ± 0.121
0.637ArgCys: 0.637 ± 0.036
3.358ArgAsp: 3.358 ± 0.1
4.018ArgGlu: 4.018 ± 0.097
2.457ArgPhe: 2.457 ± 0.075
4.065ArgGly: 4.065 ± 0.096
1.329ArgHis: 1.329 ± 0.053
3.827ArgIle: 3.827 ± 0.089
2.763ArgLys: 2.763 ± 0.08
6.331ArgLeu: 6.331 ± 0.132
1.667ArgMet: 1.667 ± 0.059
1.804ArgAsn: 1.804 ± 0.064
2.302ArgPro: 2.302 ± 0.087
2.185ArgGln: 2.185 ± 0.081
4.198ArgArg: 4.198 ± 0.122
3.763ArgSer: 3.763 ± 0.1
3.84ArgThr: 3.84 ± 0.097
4.741ArgVal: 4.741 ± 0.104
0.642ArgTrp: 0.642 ± 0.04
1.754ArgTyr: 1.754 ± 0.06
0.0ArgXaa: 0.0 ± 0.0
Ser
5.522SerAla: 5.522 ± 0.127
0.68SerCys: 0.68 ± 0.043
3.234SerAsp: 3.234 ± 0.086
3.04SerGlu: 3.04 ± 0.098
2.691SerPhe: 2.691 ± 0.085
5.784SerGly: 5.784 ± 0.151
1.167SerHis: 1.167 ± 0.05
3.856SerIle: 3.856 ± 0.11
2.239SerLys: 2.239 ± 0.083
6.444SerLeu: 6.444 ± 0.146
1.829SerMet: 1.829 ± 0.07
1.579SerAsn: 1.579 ± 0.076
2.779SerPro: 2.779 ± 0.087
1.962SerGln: 1.962 ± 0.068
3.736SerArg: 3.736 ± 0.099
4.52SerSer: 4.52 ± 0.135
3.777SerThr: 3.777 ± 0.127
5.489SerVal: 5.489 ± 0.118
0.77SerTrp: 0.77 ± 0.051
1.766SerTyr: 1.766 ± 0.067
0.002SerXaa: 0.002 ± 0.002
Thr
6.054ThrAla: 6.054 ± 0.155
0.581ThrCys: 0.581 ± 0.039
3.198ThrAsp: 3.198 ± 0.098
3.02ThrGlu: 3.02 ± 0.082
2.304ThrPhe: 2.304 ± 0.091
5.462ThrGly: 5.462 ± 0.129
1.14ThrHis: 1.14 ± 0.056
3.817ThrIle: 3.817 ± 0.121
2.135ThrLys: 2.135 ± 0.073
6.016ThrLeu: 6.016 ± 0.122
1.727ThrMet: 1.727 ± 0.069
1.538ThrAsn: 1.538 ± 0.067
3.311ThrPro: 3.311 ± 0.095
1.806ThrGln: 1.806 ± 0.064
3.396ThrArg: 3.396 ± 0.085
3.955ThrSer: 3.955 ± 0.124
4.056ThrThr: 4.056 ± 0.131
5.912ThrVal: 5.912 ± 0.147
0.804ThrTrp: 0.804 ± 0.061
1.754ThrTyr: 1.754 ± 0.082
0.0ThrXaa: 0.0 ± 0.0
Val
8.754ValAla: 8.754 ± 0.146
1.018ValCys: 1.018 ± 0.046
4.935ValAsp: 4.935 ± 0.108
4.894ValGlu: 4.894 ± 0.12
3.466ValPhe: 3.466 ± 0.098
6.344ValGly: 6.344 ± 0.123
1.73ValHis: 1.73 ± 0.066
4.554ValIle: 4.554 ± 0.11
3.07ValLys: 3.07 ± 0.088
9.254ValLeu: 9.254 ± 0.157
2.088ValMet: 2.088 ± 0.077
2.365ValAsn: 2.365 ± 0.079
4.054ValPro: 4.054 ± 0.098
2.896ValGln: 2.896 ± 0.097
5.191ValArg: 5.191 ± 0.109
5.594ValSer: 5.594 ± 0.114
5.779ValThr: 5.779 ± 0.153
8.036ValVal: 8.036 ± 0.163
0.885ValTrp: 0.885 ± 0.042
2.106ValTyr: 2.106 ± 0.063
0.0ValXaa: 0.0 ± 0.0
Trp
0.876TrpAla: 0.876 ± 0.043
0.119TrpCys: 0.119 ± 0.017
0.536TrpAsp: 0.536 ± 0.035
0.559TrpGlu: 0.559 ± 0.04
0.52TrpPhe: 0.52 ± 0.038
0.759TrpGly: 0.759 ± 0.048
0.248TrpHis: 0.248 ± 0.024
0.624TrpIle: 0.624 ± 0.037
0.509TrpLys: 0.509 ± 0.033
1.212TrpLeu: 1.212 ± 0.071
0.259TrpMet: 0.259 ± 0.023
0.541TrpAsn: 0.541 ± 0.042
0.421TrpPro: 0.421 ± 0.036
0.502TrpGln: 0.502 ± 0.037
0.671TrpArg: 0.671 ± 0.038
0.838TrpSer: 0.838 ± 0.049
0.721TrpThr: 0.721 ± 0.06
0.741TrpVal: 0.741 ± 0.045
0.203TrpTrp: 0.203 ± 0.022
0.412TrpTyr: 0.412 ± 0.031
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.716TyrAla: 2.716 ± 0.082
0.32TyrCys: 0.32 ± 0.027
1.631TyrAsp: 1.631 ± 0.066
1.466TyrGlu: 1.466 ± 0.065
1.032TyrPhe: 1.032 ± 0.048
2.372TyrGly: 2.372 ± 0.074
0.714TyrHis: 0.714 ± 0.044
1.486TyrIle: 1.486 ± 0.066
0.885TyrLys: 0.885 ± 0.046
2.892TyrLeu: 2.892 ± 0.09
0.595TyrMet: 0.595 ± 0.034
0.7TyrAsn: 0.7 ± 0.041
1.302TyrPro: 1.302 ± 0.052
0.881TyrGln: 0.881 ± 0.042
1.815TyrArg: 1.815 ± 0.062
1.59TyrSer: 1.59 ± 0.07
1.572TyrThr: 1.572 ± 0.074
2.392TyrVal: 2.392 ± 0.084
0.336TyrTrp: 0.336 ± 0.031
0.89TyrTyr: 0.89 ± 0.052
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.002XaaGly: 0.002 ± 0.002
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.002XaaXaa: 0.002 ± 0.002
Statistics based on 1392 proteins (444009 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski