Amino acid dipepetide frequency for Halovenus aranensis

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.222AlaAla: 11.222 ± 0.148
0.752AlaCys: 0.752 ± 0.032
8.565AlaAsp: 8.565 ± 0.111
8.223AlaGlu: 8.223 ± 0.109
3.639AlaPhe: 3.639 ± 0.065
8.793AlaGly: 8.793 ± 0.106
1.712AlaHis: 1.712 ± 0.048
4.554AlaIle: 4.554 ± 0.069
1.85AlaLys: 1.85 ± 0.041
9.762AlaLeu: 9.762 ± 0.129
2.009AlaMet: 2.009 ± 0.043
2.303AlaAsn: 2.303 ± 0.049
3.418AlaPro: 3.418 ± 0.062
2.586AlaGln: 2.586 ± 0.052
5.622AlaArg: 5.622 ± 0.092
5.275AlaSer: 5.275 ± 0.091
7.079AlaThr: 7.079 ± 0.1
10.486AlaVal: 10.486 ± 0.137
1.024AlaTrp: 1.024 ± 0.029
2.483AlaTyr: 2.483 ± 0.054
0.0AlaXaa: 0.0 ± 0.0
Cys
0.601CysAla: 0.601 ± 0.029
0.092CysCys: 0.092 ± 0.01
0.613CysAsp: 0.613 ± 0.026
0.63CysGlu: 0.63 ± 0.026
0.212CysPhe: 0.212 ± 0.017
0.854CysGly: 0.854 ± 0.037
0.184CysHis: 0.184 ± 0.012
0.319CysIle: 0.319 ± 0.018
0.125CysLys: 0.125 ± 0.013
0.678CysLeu: 0.678 ± 0.027
0.109CysMet: 0.109 ± 0.011
0.203CysAsn: 0.203 ± 0.014
0.585CysPro: 0.585 ± 0.027
0.225CysGln: 0.225 ± 0.016
0.522CysArg: 0.522 ± 0.026
0.47CysSer: 0.47 ± 0.024
0.425CysThr: 0.425 ± 0.023
0.501CysVal: 0.501 ± 0.021
0.113CysTrp: 0.113 ± 0.01
0.218CysTyr: 0.218 ± 0.015
0.0CysXaa: 0.0 ± 0.0
Asp
8.447AspAla: 8.447 ± 0.112
0.72AspCys: 0.72 ± 0.031
7.603AspAsp: 7.603 ± 0.116
8.157AspGlu: 8.157 ± 0.119
2.018AspPhe: 2.018 ± 0.053
7.986AspGly: 7.986 ± 0.129
1.838AspHis: 1.838 ± 0.044
4.36AspIle: 4.36 ± 0.072
1.223AspLys: 1.223 ± 0.04
6.524AspLeu: 6.524 ± 0.092
1.198AspMet: 1.198 ± 0.039
1.695AspAsn: 1.695 ± 0.052
4.034AspPro: 4.034 ± 0.069
2.152AspGln: 2.152 ± 0.051
5.831AspArg: 5.831 ± 0.086
4.588AspSer: 4.588 ± 0.078
4.799AspThr: 4.799 ± 0.075
6.499AspVal: 6.499 ± 0.089
1.042AspTrp: 1.042 ± 0.035
1.76AspTyr: 1.76 ± 0.044
0.0AspXaa: 0.0 ± 0.0
Glu
8.392GluAla: 8.392 ± 0.13
0.605GluCys: 0.605 ± 0.026
5.881GluAsp: 5.881 ± 0.099
7.358GluGlu: 7.358 ± 0.131
3.136GluPhe: 3.136 ± 0.062
5.233GluGly: 5.233 ± 0.081
2.079GluHis: 2.079 ± 0.05
4.084GluIle: 4.084 ± 0.089
1.967GluLys: 1.967 ± 0.053
7.725GluLeu: 7.725 ± 0.111
2.165GluMet: 2.165 ± 0.05
2.667GluAsn: 2.667 ± 0.061
3.854GluPro: 3.854 ± 0.062
3.749GluGln: 3.749 ± 0.073
7.198GluArg: 7.198 ± 0.103
5.481GluSer: 5.481 ± 0.078
7.803GluThr: 7.803 ± 0.106
5.77GluVal: 5.77 ± 0.092
1.137GluTrp: 1.137 ± 0.037
2.899GluTyr: 2.899 ± 0.062
0.0GluXaa: 0.0 ± 0.0
Phe
3.367PheAla: 3.367 ± 0.074
0.288PheCys: 0.288 ± 0.016
3.174PheAsp: 3.174 ± 0.053
3.629PheGlu: 3.629 ± 0.068
1.004PhePhe: 1.004 ± 0.041
3.267PheGly: 3.267 ± 0.07
0.577PheHis: 0.577 ± 0.021
1.21PheIle: 1.21 ± 0.042
0.513PheLys: 0.513 ± 0.024
2.947PheLeu: 2.947 ± 0.071
0.49PheMet: 0.49 ± 0.023
0.774PheAsn: 0.774 ± 0.026
1.246PhePro: 1.246 ± 0.039
0.912PheGln: 0.912 ± 0.029
1.751PheArg: 1.751 ± 0.044
1.767PheSer: 1.767 ± 0.045
1.914PheThr: 1.914 ± 0.052
3.082PheVal: 3.082 ± 0.058
0.382PheTrp: 0.382 ± 0.022
0.891PheTyr: 0.891 ± 0.03
0.0PheXaa: 0.0 ± 0.0
Gly
7.183GlyAla: 7.183 ± 0.105
0.765GlyCys: 0.765 ± 0.03
6.405GlyAsp: 6.405 ± 0.099
7.068GlyGlu: 7.068 ± 0.086
3.061GlyPhe: 3.061 ± 0.061
6.84GlyGly: 6.84 ± 0.114
1.681GlyHis: 1.681 ± 0.046
4.195GlyIle: 4.195 ± 0.067
1.871GlyLys: 1.871 ± 0.05
7.255GlyLeu: 7.255 ± 0.109
1.614GlyMet: 1.614 ± 0.042
2.0GlyAsn: 2.0 ± 0.047
3.154GlyPro: 3.154 ± 0.067
2.369GlyGln: 2.369 ± 0.054
4.355GlyArg: 4.355 ± 0.066
4.676GlySer: 4.676 ± 0.086
5.929GlyThr: 5.929 ± 0.084
7.193GlyVal: 7.193 ± 0.097
1.07GlyTrp: 1.07 ± 0.037
2.593GlyTyr: 2.593 ± 0.052
0.0GlyXaa: 0.0 ± 0.0
His
1.912HisAla: 1.912 ± 0.043
0.211HisCys: 0.211 ± 0.013
1.861HisAsp: 1.861 ± 0.046
1.769HisGlu: 1.769 ± 0.053
0.521HisPhe: 0.521 ± 0.027
1.843HisGly: 1.843 ± 0.048
0.53HisHis: 0.53 ± 0.024
0.862HisIle: 0.862 ± 0.032
0.363HisLys: 0.363 ± 0.021
1.752HisLeu: 1.752 ± 0.039
0.303HisMet: 0.303 ± 0.017
0.576HisAsn: 0.576 ± 0.027
1.198HisPro: 1.198 ± 0.034
0.542HisGln: 0.542 ± 0.023
1.274HisArg: 1.274 ± 0.039
0.964HisSer: 0.964 ± 0.031
1.152HisThr: 1.152 ± 0.037
1.77HisVal: 1.77 ± 0.043
0.246HisTrp: 0.246 ± 0.016
0.562HisTyr: 0.562 ± 0.022
0.0HisXaa: 0.0 ± 0.0
Ile
4.706IleAla: 4.706 ± 0.069
0.278IleCys: 0.278 ± 0.016
4.212IleAsp: 4.212 ± 0.077
4.994IleGlu: 4.994 ± 0.08
1.125IlePhe: 1.125 ± 0.037
3.519IleGly: 3.519 ± 0.08
0.894IleHis: 0.894 ± 0.033
1.552IleIle: 1.552 ± 0.05
0.818IleLys: 0.818 ± 0.035
3.25IleLeu: 3.25 ± 0.072
0.513IleMet: 0.513 ± 0.026
1.073IleAsn: 1.073 ± 0.037
2.019IlePro: 2.019 ± 0.045
1.372IleGln: 1.372 ± 0.038
2.604IleArg: 2.604 ± 0.048
2.384IleSer: 2.384 ± 0.063
2.536IleThr: 2.536 ± 0.051
3.52IleVal: 3.52 ± 0.072
0.342IleTrp: 0.342 ± 0.019
0.962IleTyr: 0.962 ± 0.03
0.0IleXaa: 0.0 ± 0.0
Lys
1.784LysAla: 1.784 ± 0.05
0.136LysCys: 0.136 ± 0.011
1.059LysAsp: 1.059 ± 0.041
1.506LysGlu: 1.506 ± 0.048
0.529LysPhe: 0.529 ± 0.026
1.223LysGly: 1.223 ± 0.045
0.542LysHis: 0.542 ± 0.025
0.902LysIle: 0.902 ± 0.033
0.582LysLys: 0.582 ± 0.034
1.84LysLeu: 1.84 ± 0.05
0.417LysMet: 0.417 ± 0.02
0.578LysAsn: 0.578 ± 0.03
0.974LysPro: 0.974 ± 0.028
1.047LysGln: 1.047 ± 0.034
1.698LysArg: 1.698 ± 0.049
1.272LysSer: 1.272 ± 0.038
1.516LysThr: 1.516 ± 0.038
1.162LysVal: 1.162 ± 0.033
0.197LysTrp: 0.197 ± 0.014
0.581LysTyr: 0.581 ± 0.025
0.0LysXaa: 0.0 ± 0.0
Leu
10.436LeuAla: 10.436 ± 0.135
0.703LeuCys: 0.703 ± 0.026
8.241LeuAsp: 8.241 ± 0.103
6.85LeuGlu: 6.85 ± 0.11
3.241LeuPhe: 3.241 ± 0.07
7.657LeuGly: 7.657 ± 0.11
1.494LeuHis: 1.494 ± 0.041
2.818LeuIle: 2.818 ± 0.071
1.665LeuLys: 1.665 ± 0.041
8.462LeuLeu: 8.462 ± 0.137
1.296LeuMet: 1.296 ± 0.038
1.97LeuAsn: 1.97 ± 0.05
3.86LeuPro: 3.86 ± 0.07
2.458LeuGln: 2.458 ± 0.054
5.226LeuArg: 5.226 ± 0.082
5.922LeuSer: 5.922 ± 0.096
5.611LeuThr: 5.611 ± 0.085
8.561LeuVal: 8.561 ± 0.12
0.903LeuTrp: 0.903 ± 0.034
2.25LeuTyr: 2.25 ± 0.051
0.0LeuXaa: 0.0 ± 0.0
Met
1.756MetAla: 1.756 ± 0.042
0.117MetCys: 0.117 ± 0.01
1.263MetAsp: 1.263 ± 0.037
1.258MetGlu: 1.258 ± 0.038
0.529MetPhe: 0.529 ± 0.023
1.393MetGly: 1.393 ± 0.04
0.377MetHis: 0.377 ± 0.022
0.584MetIle: 0.584 ± 0.024
0.439MetLys: 0.439 ± 0.023
1.527MetLeu: 1.527 ± 0.041
0.315MetMet: 0.315 ± 0.018
0.602MetAsn: 0.602 ± 0.025
0.8MetPro: 0.8 ± 0.028
0.557MetGln: 0.557 ± 0.022
1.052MetArg: 1.052 ± 0.032
1.537MetSer: 1.537 ± 0.046
1.687MetThr: 1.687 ± 0.042
1.433MetVal: 1.433 ± 0.04
0.155MetTrp: 0.155 ± 0.013
0.406MetTyr: 0.406 ± 0.02
0.0MetXaa: 0.0 ± 0.0
Asn
2.52AsnAla: 2.52 ± 0.057
0.234AsnCys: 0.234 ± 0.017
1.76AsnAsp: 1.76 ± 0.048
2.001AsnGlu: 2.001 ± 0.056
0.688AsnPhe: 0.688 ± 0.029
2.001AsnGly: 2.001 ± 0.054
0.581AsnHis: 0.581 ± 0.024
1.246AsnIle: 1.246 ± 0.044
0.56AsnLys: 0.56 ± 0.03
2.044AsnLeu: 2.044 ± 0.053
0.443AsnMet: 0.443 ± 0.021
0.62AsnAsn: 0.62 ± 0.028
1.624AsnPro: 1.624 ± 0.045
0.746AsnGln: 0.746 ± 0.027
1.709AsnArg: 1.709 ± 0.042
1.154AsnSer: 1.154 ± 0.041
1.476AsnThr: 1.476 ± 0.043
2.256AsnVal: 2.256 ± 0.05
0.302AsnTrp: 0.302 ± 0.019
0.684AsnTyr: 0.684 ± 0.032
0.0AsnXaa: 0.0 ± 0.0
Pro
4.507ProAla: 4.507 ± 0.078
0.264ProCys: 0.264 ± 0.017
4.508ProAsp: 4.508 ± 0.08
4.814ProGlu: 4.814 ± 0.077
1.525ProPhe: 1.525 ± 0.041
3.462ProGly: 3.462 ± 0.057
0.843ProHis: 0.843 ± 0.03
1.667ProIle: 1.667 ± 0.041
0.843ProLys: 0.843 ± 0.027
3.576ProLeu: 3.576 ± 0.065
0.819ProMet: 0.819 ± 0.029
1.018ProAsn: 1.018 ± 0.029
2.047ProPro: 2.047 ± 0.051
1.163ProGln: 1.163 ± 0.037
2.097ProArg: 2.097 ± 0.044
2.512ProSer: 2.512 ± 0.046
3.276ProThr: 3.276 ± 0.067
4.041ProVal: 4.041 ± 0.068
0.445ProTrp: 0.445 ± 0.022
1.096ProTyr: 1.096 ± 0.038
0.0ProXaa: 0.0 ± 0.0
Gln
3.013GlnAla: 3.013 ± 0.056
0.2GlnCys: 0.2 ± 0.016
1.547GlnAsp: 1.547 ± 0.045
2.279GlnGlu: 2.279 ± 0.054
1.336GlnPhe: 1.336 ± 0.036
1.86GlnGly: 1.86 ± 0.044
0.663GlnHis: 0.663 ± 0.027
1.326GlnIle: 1.326 ± 0.04
0.686GlnLys: 0.686 ± 0.029
2.983GlnLeu: 2.983 ± 0.058
0.668GlnMet: 0.668 ± 0.027
0.873GlnAsn: 0.873 ± 0.03
1.396GlnPro: 1.396 ± 0.034
1.52GlnGln: 1.52 ± 0.051
2.557GlnArg: 2.557 ± 0.057
2.064GlnSer: 2.064 ± 0.047
2.436GlnThr: 2.436 ± 0.057
2.26GlnVal: 2.26 ± 0.052
0.35GlnTrp: 0.35 ± 0.017
0.949GlnTyr: 0.949 ± 0.035
0.0GlnXaa: 0.0 ± 0.0
Arg
5.816ArgAla: 5.816 ± 0.094
0.475ArgCys: 0.475 ± 0.023
4.771ArgAsp: 4.771 ± 0.075
6.537ArgGlu: 6.537 ± 0.092
2.097ArgPhe: 2.097 ± 0.046
4.139ArgGly: 4.139 ± 0.063
1.261ArgHis: 1.261 ± 0.037
2.871ArgIle: 2.871 ± 0.051
1.498ArgLys: 1.498 ± 0.041
5.999ArgLeu: 5.999 ± 0.091
1.214ArgMet: 1.214 ± 0.036
1.668ArgAsn: 1.668 ± 0.042
2.482ArgPro: 2.482 ± 0.052
2.392ArgGln: 2.392 ± 0.054
4.624ArgArg: 4.624 ± 0.085
3.191ArgSer: 3.191 ± 0.057
3.76ArgThr: 3.76 ± 0.061
4.931ArgVal: 4.931 ± 0.076
0.756ArgTrp: 0.756 ± 0.031
1.995ArgTyr: 1.995 ± 0.047
0.0ArgXaa: 0.0 ± 0.0
Ser
5.332SerAla: 5.332 ± 0.074
0.368SerCys: 0.368 ± 0.019
4.411SerAsp: 4.411 ± 0.089
4.77SerGlu: 4.77 ± 0.076
2.05SerPhe: 2.05 ± 0.048
4.982SerGly: 4.982 ± 0.082
1.122SerHis: 1.122 ± 0.033
2.419SerIle: 2.419 ± 0.052
1.277SerLys: 1.277 ± 0.039
5.333SerLeu: 5.333 ± 0.082
1.176SerMet: 1.176 ± 0.039
1.458SerAsn: 1.458 ± 0.036
2.615SerPro: 2.615 ± 0.059
1.786SerGln: 1.786 ± 0.047
3.313SerArg: 3.313 ± 0.056
3.029SerSer: 3.029 ± 0.073
3.771SerThr: 3.771 ± 0.067
5.22SerVal: 5.22 ± 0.072
0.69SerTrp: 0.69 ± 0.03
1.592SerTyr: 1.592 ± 0.046
0.0SerXaa: 0.0 ± 0.0
Thr
7.633ThrAla: 7.633 ± 0.098
0.378ThrCys: 0.378 ± 0.02
5.905ThrAsp: 5.905 ± 0.1
5.758ThrGlu: 5.758 ± 0.083
2.173ThrPhe: 2.173 ± 0.05
5.843ThrGly: 5.843 ± 0.079
1.322ThrHis: 1.322 ± 0.039
2.962ThrIle: 2.962 ± 0.055
1.119ThrLys: 1.119 ± 0.035
6.32ThrLeu: 6.32 ± 0.092
1.091ThrMet: 1.091 ± 0.036
1.547ThrAsn: 1.547 ± 0.044
3.549ThrPro: 3.549 ± 0.064
1.924ThrGln: 1.924 ± 0.055
3.408ThrArg: 3.408 ± 0.057
3.06ThrSer: 3.06 ± 0.055
4.73ThrThr: 4.73 ± 0.081
7.86ThrVal: 7.86 ± 0.114
0.637ThrTrp: 0.637 ± 0.024
1.782ThrTyr: 1.782 ± 0.047
0.0ThrXaa: 0.0 ± 0.0
Val
9.428ValAla: 9.428 ± 0.123
0.721ValCys: 0.721 ± 0.025
7.362ValAsp: 7.362 ± 0.096
8.036ValGlu: 8.036 ± 0.107
3.007ValPhe: 3.007 ± 0.061
7.4ValGly: 7.4 ± 0.109
1.614ValHis: 1.614 ± 0.043
3.36ValIle: 3.36 ± 0.061
1.375ValLys: 1.375 ± 0.041
7.983ValLeu: 7.983 ± 0.108
1.309ValMet: 1.309 ± 0.036
1.947ValAsn: 1.947 ± 0.05
3.957ValPro: 3.957 ± 0.066
2.123ValGln: 2.123 ± 0.049
4.828ValArg: 4.828 ± 0.066
5.41ValSer: 5.41 ± 0.078
6.778ValThr: 6.778 ± 0.098
8.826ValVal: 8.826 ± 0.124
0.811ValTrp: 0.811 ± 0.029
2.186ValTyr: 2.186 ± 0.041
0.0ValXaa: 0.0 ± 0.0
Trp
0.887TrpAla: 0.887 ± 0.032
0.12TrpCys: 0.12 ± 0.01
0.743TrpAsp: 0.743 ± 0.03
0.972TrpGlu: 0.972 ± 0.035
0.409TrpPhe: 0.409 ± 0.022
0.781TrpGly: 0.781 ± 0.029
0.25TrpHis: 0.25 ± 0.016
0.401TrpIle: 0.401 ± 0.022
0.297TrpLys: 0.297 ± 0.019
1.222TrpLeu: 1.222 ± 0.038
0.239TrpMet: 0.239 ± 0.014
0.335TrpAsn: 0.335 ± 0.017
0.415TrpPro: 0.415 ± 0.022
0.438TrpGln: 0.438 ± 0.024
0.773TrpArg: 0.773 ± 0.029
0.575TrpSer: 0.575 ± 0.026
0.778TrpThr: 0.778 ± 0.029
0.855TrpVal: 0.855 ± 0.03
0.171TrpTrp: 0.171 ± 0.015
0.412TrpTyr: 0.412 ± 0.021
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.416TyrAla: 2.416 ± 0.052
0.252TyrCys: 0.252 ± 0.016
2.592TyrAsp: 2.592 ± 0.055
2.577TyrGlu: 2.577 ± 0.058
0.884TyrPhe: 0.884 ± 0.034
2.199TyrGly: 2.199 ± 0.051
0.647TyrHis: 0.647 ± 0.028
0.977TyrIle: 0.977 ± 0.035
0.473TyrLys: 0.473 ± 0.025
2.444TyrLeu: 2.444 ± 0.054
0.399TyrMet: 0.399 ± 0.02
0.743TyrAsn: 0.743 ± 0.033
1.29TyrPro: 1.29 ± 0.04
0.906TyrGln: 0.906 ± 0.033
2.036TyrArg: 2.036 ± 0.045
1.383TyrSer: 1.383 ± 0.041
1.549TyrThr: 1.549 ± 0.039
2.235TyrVal: 2.235 ± 0.043
0.299TyrTrp: 0.299 ± 0.018
0.839TyrTyr: 0.839 ± 0.036
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3354 proteins (979672 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski