Amino acid dipepetide frequency for Reinekea thalattae

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.663AlaAla: 8.663 ± 0.141
0.91AlaCys: 0.91 ± 0.034
5.351AlaAsp: 5.351 ± 0.085
6.905AlaGlu: 6.905 ± 0.094
3.364AlaPhe: 3.364 ± 0.064
6.15AlaGly: 6.15 ± 0.095
1.706AlaHis: 1.706 ± 0.046
6.343AlaIle: 6.343 ± 0.093
4.793AlaLys: 4.793 ± 0.089
10.481AlaLeu: 10.481 ± 0.138
2.624AlaMet: 2.624 ± 0.065
3.592AlaAsn: 3.592 ± 0.062
2.787AlaPro: 2.787 ± 0.064
4.513AlaGln: 4.513 ± 0.089
3.761AlaArg: 3.761 ± 0.071
5.989AlaSer: 5.989 ± 0.093
4.544AlaThr: 4.544 ± 0.073
6.272AlaVal: 6.272 ± 0.09
1.059AlaTrp: 1.059 ± 0.038
2.545AlaTyr: 2.545 ± 0.061
0.0AlaXaa: 0.0 ± 0.0
Cys
0.816CysAla: 0.816 ± 0.03
0.151CysCys: 0.151 ± 0.014
0.537CysAsp: 0.537 ± 0.027
0.52CysGlu: 0.52 ± 0.027
0.448CysPhe: 0.448 ± 0.026
0.811CysGly: 0.811 ± 0.034
0.254CysHis: 0.254 ± 0.019
0.642CysIle: 0.642 ± 0.026
0.384CysLys: 0.384 ± 0.026
0.94CysLeu: 0.94 ± 0.037
0.189CysMet: 0.189 ± 0.015
0.318CysAsn: 0.318 ± 0.021
0.42CysPro: 0.42 ± 0.027
0.541CysGln: 0.541 ± 0.025
0.437CysArg: 0.437 ± 0.025
0.69CysSer: 0.69 ± 0.033
0.435CysThr: 0.435 ± 0.022
0.576CysVal: 0.576 ± 0.026
0.123CysTrp: 0.123 ± 0.012
0.309CysTyr: 0.309 ± 0.018
0.0CysXaa: 0.0 ± 0.0
Asp
5.017AspAla: 5.017 ± 0.091
0.53AspCys: 0.53 ± 0.029
3.544AspAsp: 3.544 ± 0.086
4.042AspGlu: 4.042 ± 0.076
2.479AspPhe: 2.479 ± 0.056
3.593AspGly: 3.593 ± 0.08
1.14AspHis: 1.14 ± 0.042
4.267AspIle: 4.267 ± 0.077
2.54AspLys: 2.54 ± 0.062
5.459AspLeu: 5.459 ± 0.106
1.369AspMet: 1.369 ± 0.04
2.219AspAsn: 2.219 ± 0.058
1.973AspPro: 1.973 ± 0.053
2.661AspGln: 2.661 ± 0.054
2.51AspArg: 2.51 ± 0.057
3.72AspSer: 3.72 ± 0.068
2.525AspThr: 2.525 ± 0.061
3.893AspVal: 3.893 ± 0.069
0.949AspTrp: 0.949 ± 0.031
2.193AspTyr: 2.193 ± 0.05
0.0AspXaa: 0.0 ± 0.0
Glu
5.773GluAla: 5.773 ± 0.08
0.499GluCys: 0.499 ± 0.025
2.923GluAsp: 2.923 ± 0.058
3.357GluGlu: 3.357 ± 0.081
2.503GluPhe: 2.503 ± 0.065
3.465GluGly: 3.465 ± 0.068
1.647GluHis: 1.647 ± 0.047
3.661GluIle: 3.661 ± 0.066
3.429GluLys: 3.429 ± 0.072
7.415GluLeu: 7.415 ± 0.107
1.511GluMet: 1.511 ± 0.051
2.64GluAsn: 2.64 ± 0.062
2.407GluPro: 2.407 ± 0.078
5.083GluGln: 5.083 ± 0.091
3.48GluArg: 3.48 ± 0.074
3.911GluSer: 3.911 ± 0.078
3.223GluThr: 3.223 ± 0.069
4.315GluVal: 4.315 ± 0.081
0.593GluTrp: 0.593 ± 0.027
1.733GluTyr: 1.733 ± 0.052
0.0GluXaa: 0.0 ± 0.0
Phe
3.418PheAla: 3.418 ± 0.069
0.49PheCys: 0.49 ± 0.026
2.784PheAsp: 2.784 ± 0.06
2.59PheGlu: 2.59 ± 0.057
1.746PhePhe: 1.746 ± 0.059
3.055PheGly: 3.055 ± 0.069
0.764PheHis: 0.764 ± 0.029
2.897PheIle: 2.897 ± 0.069
1.88PheLys: 1.88 ± 0.046
3.633PheLeu: 3.633 ± 0.082
0.946PheMet: 0.946 ± 0.035
2.025PheAsn: 2.025 ± 0.057
1.242PhePro: 1.242 ± 0.038
1.437PheGln: 1.437 ± 0.041
1.47PheArg: 1.47 ± 0.041
3.39PheSer: 3.39 ± 0.066
2.062PheThr: 2.062 ± 0.047
2.712PheVal: 2.712 ± 0.057
0.543PheTrp: 0.543 ± 0.026
1.346PheTyr: 1.346 ± 0.04
0.0PheXaa: 0.0 ± 0.0
Gly
5.291GlyAla: 5.291 ± 0.101
0.791GlyCys: 0.791 ± 0.032
3.814GlyAsp: 3.814 ± 0.076
4.147GlyGlu: 4.147 ± 0.076
3.306GlyPhe: 3.306 ± 0.066
4.405GlyGly: 4.405 ± 0.096
1.471GlyHis: 1.471 ± 0.044
4.469GlyIle: 4.469 ± 0.082
3.396GlyLys: 3.396 ± 0.069
6.986GlyLeu: 6.986 ± 0.098
1.813GlyMet: 1.813 ± 0.048
2.301GlyAsn: 2.301 ± 0.059
1.7GlyPro: 1.7 ± 0.042
2.768GlyGln: 2.768 ± 0.054
3.122GlyArg: 3.122 ± 0.056
4.316GlySer: 4.316 ± 0.071
3.219GlyThr: 3.219 ± 0.075
5.004GlyVal: 5.004 ± 0.08
0.903GlyTrp: 0.903 ± 0.033
2.516GlyTyr: 2.516 ± 0.055
0.0GlyXaa: 0.0 ± 0.0
His
1.586HisAla: 1.586 ± 0.045
0.312HisCys: 0.312 ± 0.018
1.09HisAsp: 1.09 ± 0.044
1.173HisGlu: 1.173 ± 0.041
0.99HisPhe: 0.99 ± 0.038
1.402HisGly: 1.402 ± 0.045
0.635HisHis: 0.635 ± 0.04
1.314HisIle: 1.314 ± 0.041
0.916HisLys: 0.916 ± 0.036
2.216HisLeu: 2.216 ± 0.055
0.465HisMet: 0.465 ± 0.019
0.819HisAsn: 0.819 ± 0.033
1.056HisPro: 1.056 ± 0.036
1.216HisGln: 1.216 ± 0.038
1.087HisArg: 1.087 ± 0.039
1.52HisSer: 1.52 ± 0.046
0.969HisThr: 0.969 ± 0.034
1.071HisVal: 1.071 ± 0.036
0.436HisTrp: 0.436 ± 0.024
0.854HisTyr: 0.854 ± 0.035
0.0HisXaa: 0.0 ± 0.0
Ile
6.761IleAla: 6.761 ± 0.116
0.663IleCys: 0.663 ± 0.028
4.51IleAsp: 4.51 ± 0.086
5.169IleGlu: 5.169 ± 0.088
2.174IlePhe: 2.174 ± 0.051
4.66IleGly: 4.66 ± 0.085
1.253IleHis: 1.253 ± 0.04
3.744IleIle: 3.744 ± 0.083
3.132IleLys: 3.132 ± 0.067
5.404IleLeu: 5.404 ± 0.099
1.189IleMet: 1.189 ± 0.037
2.948IleAsn: 2.948 ± 0.067
2.486IlePro: 2.486 ± 0.06
2.407IleGln: 2.407 ± 0.058
2.888IleArg: 2.888 ± 0.05
4.677IleSer: 4.677 ± 0.084
3.496IleThr: 3.496 ± 0.071
4.161IleVal: 4.161 ± 0.071
0.649IleTrp: 0.649 ± 0.031
1.72IleTyr: 1.72 ± 0.046
0.0IleXaa: 0.0 ± 0.0
Lys
4.665LysAla: 4.665 ± 0.086
0.231LysCys: 0.231 ± 0.018
2.56LysAsp: 2.56 ± 0.062
3.001LysGlu: 3.001 ± 0.066
1.307LysPhe: 1.307 ± 0.037
2.87LysGly: 2.87 ± 0.068
1.064LysHis: 1.064 ± 0.036
2.895LysIle: 2.895 ± 0.063
2.871LysLys: 2.871 ± 0.07
4.724LysLeu: 4.724 ± 0.078
1.107LysMet: 1.107 ± 0.037
2.107LysAsn: 2.107 ± 0.049
2.416LysPro: 2.416 ± 0.057
2.807LysGln: 2.807 ± 0.065
2.539LysArg: 2.539 ± 0.065
2.924LysSer: 2.924 ± 0.062
2.981LysThr: 2.981 ± 0.059
3.225LysVal: 3.225 ± 0.065
0.421LysTrp: 0.421 ± 0.021
1.207LysTyr: 1.207 ± 0.048
0.0LysXaa: 0.0 ± 0.0
Leu
11.05LeuAla: 11.05 ± 0.132
1.007LeuCys: 1.007 ± 0.033
5.907LeuAsp: 5.907 ± 0.091
6.164LeuGlu: 6.164 ± 0.105
4.377LeuPhe: 4.377 ± 0.086
6.761LeuGly: 6.761 ± 0.091
1.885LeuHis: 1.885 ± 0.044
6.739LeuIle: 6.739 ± 0.101
5.342LeuLys: 5.342 ± 0.079
11.354LeuLeu: 11.354 ± 0.184
2.741LeuMet: 2.741 ± 0.059
4.735LeuAsn: 4.735 ± 0.078
4.713LeuPro: 4.713 ± 0.073
4.32LeuGln: 4.32 ± 0.073
4.284LeuArg: 4.284 ± 0.073
8.268LeuSer: 8.268 ± 0.09
6.149LeuThr: 6.149 ± 0.085
7.263LeuVal: 7.263 ± 0.118
1.138LeuTrp: 1.138 ± 0.045
2.854LeuTyr: 2.854 ± 0.056
0.0LeuXaa: 0.0 ± 0.0
Met
2.592MetAla: 2.592 ± 0.056
0.144MetCys: 0.144 ± 0.014
1.131MetAsp: 1.131 ± 0.037
1.157MetGlu: 1.157 ± 0.04
0.768MetPhe: 0.768 ± 0.031
1.488MetGly: 1.488 ± 0.044
0.482MetHis: 0.482 ± 0.022
1.495MetIle: 1.495 ± 0.046
1.268MetLys: 1.268 ± 0.035
2.55MetLeu: 2.55 ± 0.065
0.65MetMet: 0.65 ± 0.033
1.189MetAsn: 1.189 ± 0.037
1.115MetPro: 1.115 ± 0.034
1.203MetGln: 1.203 ± 0.039
1.101MetArg: 1.101 ± 0.038
1.879MetSer: 1.879 ± 0.046
1.643MetThr: 1.643 ± 0.043
1.708MetVal: 1.708 ± 0.047
0.193MetTrp: 0.193 ± 0.014
0.46MetTyr: 0.46 ± 0.02
0.0MetXaa: 0.0 ± 0.0
Asn
3.677AsnAla: 3.677 ± 0.07
0.39AsnCys: 0.39 ± 0.022
2.337AsnAsp: 2.337 ± 0.062
2.519AsnGlu: 2.519 ± 0.052
1.409AsnPhe: 1.409 ± 0.042
2.771AsnGly: 2.771 ± 0.062
0.84AsnHis: 0.84 ± 0.029
2.7AsnIle: 2.7 ± 0.055
2.017AsnLys: 2.017 ± 0.048
3.942AsnLeu: 3.942 ± 0.068
0.909AsnMet: 0.909 ± 0.031
1.835AsnAsn: 1.835 ± 0.052
1.969AsnPro: 1.969 ± 0.045
2.154AsnGln: 2.154 ± 0.05
1.987AsnArg: 1.987 ± 0.046
2.871AsnSer: 2.871 ± 0.071
2.077AsnThr: 2.077 ± 0.062
2.356AsnVal: 2.356 ± 0.056
0.58AsnTrp: 0.58 ± 0.024
1.419AsnTyr: 1.419 ± 0.037
0.0AsnXaa: 0.0 ± 0.0
Pro
3.376ProAla: 3.376 ± 0.073
0.262ProCys: 0.262 ± 0.017
2.333ProAsp: 2.333 ± 0.059
3.375ProGlu: 3.375 ± 0.08
1.702ProPhe: 1.702 ± 0.051
2.329ProGly: 2.329 ± 0.059
0.758ProHis: 0.758 ± 0.031
2.515ProIle: 2.515 ± 0.061
1.868ProLys: 1.868 ± 0.046
3.942ProLeu: 3.942 ± 0.068
1.004ProMet: 1.004 ± 0.034
1.741ProAsn: 1.741 ± 0.044
1.083ProPro: 1.083 ± 0.041
1.397ProGln: 1.397 ± 0.041
1.267ProArg: 1.267 ± 0.039
2.673ProSer: 2.673 ± 0.06
2.07ProThr: 2.07 ± 0.048
2.952ProVal: 2.952 ± 0.057
0.525ProTrp: 0.525 ± 0.027
1.149ProTyr: 1.149 ± 0.036
0.0ProXaa: 0.0 ± 0.0
Gln
5.207GlnAla: 5.207 ± 0.099
0.45GlnCys: 0.45 ± 0.024
1.991GlnAsp: 1.991 ± 0.045
2.375GlnGlu: 2.375 ± 0.05
1.798GlnPhe: 1.798 ± 0.044
3.226GlnGly: 3.226 ± 0.056
1.244GlnHis: 1.244 ± 0.039
2.587GlnIle: 2.587 ± 0.054
1.997GlnLys: 1.997 ± 0.05
6.323GlnLeu: 6.323 ± 0.101
1.148GlnMet: 1.148 ± 0.037
1.629GlnAsn: 1.629 ± 0.041
2.197GlnPro: 2.197 ± 0.059
4.291GlnGln: 4.291 ± 0.108
2.742GlnArg: 2.742 ± 0.058
3.24GlnSer: 3.24 ± 0.069
2.514GlnThr: 2.514 ± 0.059
3.172GlnVal: 3.172 ± 0.071
0.867GlnTrp: 0.867 ± 0.03
1.42GlnTyr: 1.42 ± 0.04
0.0GlnXaa: 0.0 ± 0.0
Arg
3.607ArgAla: 3.607 ± 0.063
0.491ArgCys: 0.491 ± 0.028
2.257ArgAsp: 2.257 ± 0.055
2.731ArgGlu: 2.731 ± 0.062
2.273ArgPhe: 2.273 ± 0.056
2.547ArgGly: 2.547 ± 0.059
1.063ArgHis: 1.063 ± 0.035
3.09ArgIle: 3.09 ± 0.055
2.245ArgLys: 2.245 ± 0.053
5.445ArgLeu: 5.445 ± 0.095
1.193ArgMet: 1.193 ± 0.042
1.743ArgAsn: 1.743 ± 0.044
1.646ArgPro: 1.646 ± 0.047
2.452ArgGln: 2.452 ± 0.05
2.299ArgArg: 2.299 ± 0.06
2.919ArgSer: 2.919 ± 0.062
2.038ArgThr: 2.038 ± 0.043
3.16ArgVal: 3.16 ± 0.067
0.709ArgTrp: 0.709 ± 0.028
1.815ArgTyr: 1.815 ± 0.048
0.0ArgXaa: 0.0 ± 0.0
Ser
6.37SerAla: 6.37 ± 0.09
0.614SerCys: 0.614 ± 0.03
4.084SerAsp: 4.084 ± 0.075
4.399SerGlu: 4.399 ± 0.074
3.004SerPhe: 3.004 ± 0.066
4.862SerGly: 4.862 ± 0.082
1.554SerHis: 1.554 ± 0.052
4.573SerIle: 4.573 ± 0.08
2.926SerLys: 2.926 ± 0.063
7.674SerLeu: 7.674 ± 0.091
1.727SerMet: 1.727 ± 0.041
2.725SerAsn: 2.725 ± 0.062
2.263SerPro: 2.263 ± 0.046
3.153SerGln: 3.153 ± 0.064
3.08SerArg: 3.08 ± 0.061
4.912SerSer: 4.912 ± 0.109
3.472SerThr: 3.472 ± 0.068
4.93SerVal: 4.93 ± 0.082
0.844SerTrp: 0.844 ± 0.033
2.151SerTyr: 2.151 ± 0.049
0.0SerXaa: 0.0 ± 0.0
Thr
4.723ThrAla: 4.723 ± 0.087
0.367ThrCys: 0.367 ± 0.017
3.11ThrAsp: 3.11 ± 0.06
3.579ThrGlu: 3.579 ± 0.069
1.859ThrPhe: 1.859 ± 0.045
4.005ThrGly: 4.005 ± 0.079
1.091ThrHis: 1.091 ± 0.034
3.232ThrIle: 3.232 ± 0.073
2.116ThrLys: 2.116 ± 0.053
6.236ThrLeu: 6.236 ± 0.083
1.018ThrMet: 1.018 ± 0.037
1.931ThrAsn: 1.931 ± 0.06
2.473ThrPro: 2.473 ± 0.057
2.362ThrGln: 2.362 ± 0.058
2.084ThrArg: 2.084 ± 0.057
3.298ThrSer: 3.298 ± 0.072
2.955ThrThr: 2.955 ± 0.078
3.65ThrVal: 3.65 ± 0.057
0.573ThrTrp: 0.573 ± 0.029
1.25ThrTyr: 1.25 ± 0.045
0.0ThrXaa: 0.0 ± 0.0
Val
6.477ValAla: 6.477 ± 0.107
0.745ValCys: 0.745 ± 0.033
4.214ValAsp: 4.214 ± 0.08
4.498ValGlu: 4.498 ± 0.081
2.84ValPhe: 2.84 ± 0.061
4.536ValGly: 4.536 ± 0.075
1.297ValHis: 1.297 ± 0.04
4.624ValIle: 4.624 ± 0.075
3.226ValLys: 3.226 ± 0.061
6.84ValLeu: 6.84 ± 0.099
1.758ValMet: 1.758 ± 0.047
2.707ValAsn: 2.707 ± 0.055
2.523ValPro: 2.523 ± 0.054
2.46ValGln: 2.46 ± 0.058
3.018ValArg: 3.018 ± 0.066
5.099ValSer: 5.099 ± 0.077
3.789ValThr: 3.789 ± 0.071
5.22ValVal: 5.22 ± 0.088
0.694ValTrp: 0.694 ± 0.026
1.753ValTyr: 1.753 ± 0.047
0.0ValXaa: 0.0 ± 0.0
Trp
0.909TrpAla: 0.909 ± 0.037
0.132TrpCys: 0.132 ± 0.012
0.521TrpAsp: 0.521 ± 0.029
0.369TrpGlu: 0.369 ± 0.022
0.61TrpPhe: 0.61 ± 0.025
0.654TrpGly: 0.654 ± 0.028
0.334TrpHis: 0.334 ± 0.02
0.623TrpIle: 0.623 ± 0.026
0.402TrpLys: 0.402 ± 0.023
2.06TrpLeu: 2.06 ± 0.064
0.281TrpMet: 0.281 ± 0.019
0.444TrpAsn: 0.444 ± 0.025
0.573TrpPro: 0.573 ± 0.027
1.253TrpGln: 1.253 ± 0.042
0.682TrpArg: 0.682 ± 0.026
0.751TrpSer: 0.751 ± 0.032
0.441TrpThr: 0.441 ± 0.022
0.855TrpVal: 0.855 ± 0.033
0.19TrpTrp: 0.19 ± 0.016
0.396TrpTyr: 0.396 ± 0.023
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.377TyrAla: 2.377 ± 0.053
0.362TyrCys: 0.362 ± 0.023
1.626TyrAsp: 1.626 ± 0.044
1.497TyrGlu: 1.497 ± 0.043
1.471TyrPhe: 1.471 ± 0.048
2.147TyrGly: 2.147 ± 0.049
0.648TyrHis: 0.648 ± 0.027
1.68TyrIle: 1.68 ± 0.054
1.177TyrLys: 1.177 ± 0.039
3.426TyrLeu: 3.426 ± 0.067
0.574TyrMet: 0.574 ± 0.027
1.112TyrAsn: 1.112 ± 0.034
1.279TyrPro: 1.279 ± 0.044
1.94TyrGln: 1.94 ± 0.061
1.881TyrArg: 1.881 ± 0.049
2.27TyrSer: 2.27 ± 0.061
1.307TyrThr: 1.307 ± 0.041
1.824TyrVal: 1.824 ± 0.046
0.492TyrTrp: 0.492 ± 0.024
1.003TyrTyr: 1.003 ± 0.043
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2651 proteins (861729 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski