Amino acid dipepetide frequency for methanogenic archaeon ISO4-H5

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.447AlaAla: 7.447 ± 0.203
1.324AlaCys: 1.324 ± 0.047
5.352AlaAsp: 5.352 ± 0.101
5.869AlaGlu: 5.869 ± 0.131
3.393AlaPhe: 3.393 ± 0.11
6.292AlaGly: 6.292 ± 0.128
1.258AlaHis: 1.258 ± 0.052
5.49AlaIle: 5.49 ± 0.123
4.743AlaLys: 4.743 ± 0.128
7.029AlaLeu: 7.029 ± 0.126
2.64AlaMet: 2.64 ± 0.083
2.683AlaAsn: 2.683 ± 0.074
2.486AlaPro: 2.486 ± 0.073
1.862AlaGln: 1.862 ± 0.07
3.419AlaArg: 3.419 ± 0.109
4.346AlaSer: 4.346 ± 0.108
3.915AlaThr: 3.915 ± 0.124
7.073AlaVal: 7.073 ± 0.135
0.6AlaTrp: 0.6 ± 0.038
2.782AlaTyr: 2.782 ± 0.072
0.0AlaXaa: 0.0 ± 0.0
Cys
1.214CysAla: 1.214 ± 0.046
0.365CysCys: 0.365 ± 0.028
1.025CysAsp: 1.025 ± 0.043
0.994CysGlu: 0.994 ± 0.04
0.567CysPhe: 0.567 ± 0.034
1.834CysGly: 1.834 ± 0.072
0.306CysHis: 0.306 ± 0.025
1.093CysIle: 1.093 ± 0.043
0.816CysLys: 0.816 ± 0.039
1.227CysLeu: 1.227 ± 0.052
0.466CysMet: 0.466 ± 0.027
0.555CysAsn: 0.555 ± 0.03
0.99CysPro: 0.99 ± 0.059
0.386CysGln: 0.386 ± 0.028
0.94CysArg: 0.94 ± 0.045
1.096CysSer: 1.096 ± 0.051
1.009CysThr: 1.009 ± 0.05
1.044CysVal: 1.044 ± 0.04
0.181CysTrp: 0.181 ± 0.018
0.56CysTyr: 0.56 ± 0.034
0.0CysXaa: 0.0 ± 0.0
Asp
5.288AspAla: 5.288 ± 0.095
0.98AspCys: 0.98 ± 0.039
3.57AspAsp: 3.57 ± 0.09
4.425AspGlu: 4.425 ± 0.1
2.514AspPhe: 2.514 ± 0.07
5.375AspGly: 5.375 ± 0.146
1.035AspHis: 1.035 ± 0.045
4.851AspIle: 4.851 ± 0.097
3.323AspLys: 3.323 ± 0.084
5.9AspLeu: 5.9 ± 0.112
2.147AspMet: 2.147 ± 0.064
2.171AspAsn: 2.171 ± 0.067
2.979AspPro: 2.979 ± 0.079
1.288AspGln: 1.288 ± 0.05
3.638AspArg: 3.638 ± 0.107
4.252AspSer: 4.252 ± 0.103
3.365AspThr: 3.365 ± 0.076
4.672AspVal: 4.672 ± 0.094
0.635AspTrp: 0.635 ± 0.04
2.674AspTyr: 2.674 ± 0.082
0.0AspXaa: 0.0 ± 0.0
Glu
5.357GluAla: 5.357 ± 0.112
0.997GluCys: 0.997 ± 0.042
4.171GluAsp: 4.171 ± 0.084
5.263GluGlu: 5.263 ± 0.158
2.64GluPhe: 2.64 ± 0.063
4.597GluGly: 4.597 ± 0.095
1.275GluHis: 1.275 ± 0.049
4.856GluIle: 4.856 ± 0.096
4.675GluLys: 4.675 ± 0.102
5.629GluLeu: 5.629 ± 0.118
2.304GluMet: 2.304 ± 0.07
2.906GluAsn: 2.906 ± 0.076
2.17GluPro: 2.17 ± 0.062
1.736GluGln: 1.736 ± 0.057
3.304GluArg: 3.304 ± 0.091
3.878GluSer: 3.878 ± 0.09
3.537GluThr: 3.537 ± 0.086
4.326GluVal: 4.326 ± 0.085
0.698GluTrp: 0.698 ± 0.033
2.655GluTyr: 2.655 ± 0.068
0.0GluXaa: 0.0 ± 0.0
Phe
2.751PheAla: 2.751 ± 0.085
0.719PheCys: 0.719 ± 0.032
2.958PheAsp: 2.958 ± 0.077
2.478PheGlu: 2.478 ± 0.059
1.467PhePhe: 1.467 ± 0.059
3.25PheGly: 3.25 ± 0.081
0.607PheHis: 0.607 ± 0.034
2.528PheIle: 2.528 ± 0.078
1.902PheLys: 1.902 ± 0.066
3.074PheLeu: 3.074 ± 0.089
1.261PheMet: 1.261 ± 0.056
1.481PheAsn: 1.481 ± 0.055
1.462PhePro: 1.462 ± 0.055
0.842PheGln: 0.842 ± 0.035
1.985PheArg: 1.985 ± 0.063
2.873PheSer: 2.873 ± 0.079
2.526PheThr: 2.526 ± 0.08
2.941PheVal: 2.941 ± 0.076
0.381PheTrp: 0.381 ± 0.029
1.413PheTyr: 1.413 ± 0.062
0.0PheXaa: 0.0 ± 0.0
Gly
5.549GlyAla: 5.549 ± 0.126
1.535GlyCys: 1.535 ± 0.059
4.444GlyAsp: 4.444 ± 0.107
4.32GlyGlu: 4.32 ± 0.089
3.095GlyPhe: 3.095 ± 0.08
5.366GlyGly: 5.366 ± 0.152
1.469GlyHis: 1.469 ± 0.054
5.836GlyIle: 5.836 ± 0.11
5.773GlyLys: 5.773 ± 0.11
6.154GlyLeu: 6.154 ± 0.126
2.653GlyMet: 2.653 ± 0.079
3.275GlyAsn: 3.275 ± 0.094
2.185GlyPro: 2.185 ± 0.066
1.698GlyGln: 1.698 ± 0.063
3.784GlyArg: 3.784 ± 0.1
5.239GlySer: 5.239 ± 0.148
5.443GlyThr: 5.443 ± 0.17
5.31GlyVal: 5.31 ± 0.105
0.947GlyTrp: 0.947 ± 0.058
3.158GlyTyr: 3.158 ± 0.101
0.0GlyXaa: 0.0 ± 0.0
His
1.152HisAla: 1.152 ± 0.047
0.343HisCys: 0.343 ± 0.024
1.03HisAsp: 1.03 ± 0.045
0.901HisGlu: 0.901 ± 0.04
0.712HisPhe: 0.712 ± 0.037
1.404HisGly: 1.404 ± 0.051
0.358HisHis: 0.358 ± 0.026
1.246HisIle: 1.246 ± 0.053
0.776HisLys: 0.776 ± 0.039
1.514HisLeu: 1.514 ± 0.057
0.586HisMet: 0.586 ± 0.032
0.63HisAsn: 0.63 ± 0.033
0.983HisPro: 0.983 ± 0.038
0.365HisGln: 0.365 ± 0.027
0.926HisArg: 0.926 ± 0.05
1.044HisSer: 1.044 ± 0.045
0.919HisThr: 0.919 ± 0.04
1.218HisVal: 1.218 ± 0.046
0.181HisTrp: 0.181 ± 0.017
0.607HisTyr: 0.607 ± 0.031
0.0HisXaa: 0.0 ± 0.0
Ile
5.782IleAla: 5.782 ± 0.121
1.197IleCys: 1.197 ± 0.046
4.627IleAsp: 4.627 ± 0.091
4.186IleGlu: 4.186 ± 0.096
2.257IlePhe: 2.257 ± 0.072
5.7IleGly: 5.7 ± 0.121
1.216IleHis: 1.216 ± 0.049
4.67IleIle: 4.67 ± 0.132
3.24IleLys: 3.24 ± 0.082
6.038IleLeu: 6.038 ± 0.142
1.879IleMet: 1.879 ± 0.052
2.439IleAsn: 2.439 ± 0.071
3.433IlePro: 3.433 ± 0.092
1.524IleGln: 1.524 ± 0.051
3.661IleArg: 3.661 ± 0.092
4.894IleSer: 4.894 ± 0.109
4.279IleThr: 4.279 ± 0.114
5.298IleVal: 5.298 ± 0.112
0.56IleTrp: 0.56 ± 0.037
2.077IleTyr: 2.077 ± 0.065
0.0IleXaa: 0.0 ± 0.0
Lys
5.227LysAla: 5.227 ± 0.118
0.884LysCys: 0.884 ± 0.049
4.219LysAsp: 4.219 ± 0.1
4.606LysGlu: 4.606 ± 0.106
1.783LysPhe: 1.783 ± 0.059
4.179LysGly: 4.179 ± 0.087
0.933LysHis: 0.933 ± 0.039
4.245LysIle: 4.245 ± 0.096
4.454LysLys: 4.454 ± 0.122
4.576LysLeu: 4.576 ± 0.1
1.738LysMet: 1.738 ± 0.062
2.547LysAsn: 2.547 ± 0.068
1.879LysPro: 1.879 ± 0.064
1.542LysGln: 1.542 ± 0.054
2.669LysArg: 2.669 ± 0.087
3.299LysSer: 3.299 ± 0.077
3.555LysThr: 3.555 ± 0.085
4.275LysVal: 4.275 ± 0.099
0.635LysTrp: 0.635 ± 0.038
2.234LysTyr: 2.234 ± 0.064
0.0LysXaa: 0.0 ± 0.0
Leu
6.641LeuAla: 6.641 ± 0.163
1.411LeuCys: 1.411 ± 0.052
5.397LeuAsp: 5.397 ± 0.112
5.321LeuGlu: 5.321 ± 0.103
3.292LeuPhe: 3.292 ± 0.105
6.447LeuGly: 6.447 ± 0.104
1.251LeuHis: 1.251 ± 0.046
5.46LeuIle: 5.46 ± 0.122
5.121LeuLys: 5.121 ± 0.104
6.6LeuLeu: 6.6 ± 0.162
2.827LeuMet: 2.827 ± 0.086
3.306LeuAsn: 3.306 ± 0.079
3.346LeuPro: 3.346 ± 0.075
1.905LeuGln: 1.905 ± 0.057
4.204LeuArg: 4.204 ± 0.102
5.951LeuSer: 5.951 ± 0.116
4.945LeuThr: 4.945 ± 0.097
5.82LeuVal: 5.82 ± 0.128
0.663LeuTrp: 0.663 ± 0.038
2.739LeuTyr: 2.739 ± 0.079
0.0LeuXaa: 0.0 ± 0.0
Met
2.867MetAla: 2.867 ± 0.076
0.466MetCys: 0.466 ± 0.03
2.267MetAsp: 2.267 ± 0.077
2.112MetGlu: 2.112 ± 0.062
1.134MetPhe: 1.134 ± 0.045
2.37MetGly: 2.37 ± 0.07
0.532MetHis: 0.532 ± 0.031
2.001MetIle: 2.001 ± 0.067
2.058MetLys: 2.058 ± 0.069
2.417MetLeu: 2.417 ± 0.079
1.164MetMet: 1.164 ± 0.051
1.279MetAsn: 1.279 ± 0.051
1.199MetPro: 1.199 ± 0.047
0.786MetGln: 0.786 ± 0.041
1.528MetArg: 1.528 ± 0.057
2.297MetSer: 2.297 ± 0.069
1.924MetThr: 1.924 ± 0.057
2.271MetVal: 2.271 ± 0.074
0.24MetTrp: 0.24 ± 0.02
1.091MetTyr: 1.091 ± 0.049
0.0MetXaa: 0.0 ± 0.0
Asn
3.048AsnAla: 3.048 ± 0.095
0.668AsnCys: 0.668 ± 0.037
2.257AsnAsp: 2.257 ± 0.061
2.22AsnGlu: 2.22 ± 0.061
1.154AsnPhe: 1.154 ± 0.05
3.753AsnGly: 3.753 ± 0.105
0.731AsnHis: 0.731 ± 0.038
2.841AsnIle: 2.841 ± 0.076
1.895AsnLys: 1.895 ± 0.064
3.283AsnLeu: 3.283 ± 0.079
1.169AsnMet: 1.169 ± 0.044
1.559AsnAsn: 1.559 ± 0.061
2.008AsnPro: 2.008 ± 0.058
0.84AsnGln: 0.84 ± 0.049
1.99AsnArg: 1.99 ± 0.056
2.281AsnSer: 2.281 ± 0.078
2.177AsnThr: 2.177 ± 0.077
2.918AsnVal: 2.918 ± 0.082
0.355AsnTrp: 0.355 ± 0.026
1.456AsnTyr: 1.456 ± 0.06
0.0AsnXaa: 0.0 ± 0.0
Pro
3.198ProAla: 3.198 ± 0.084
0.51ProCys: 0.51 ± 0.032
2.902ProAsp: 2.902 ± 0.077
3.616ProGlu: 3.616 ± 0.102
1.58ProPhe: 1.58 ± 0.062
2.897ProGly: 2.897 ± 0.068
0.699ProHis: 0.699 ± 0.04
2.253ProIle: 2.253 ± 0.067
2.144ProLys: 2.144 ± 0.07
3.007ProLeu: 3.007 ± 0.079
1.152ProMet: 1.152 ± 0.043
1.373ProAsn: 1.373 ± 0.046
1.202ProPro: 1.202 ± 0.057
0.997ProGln: 0.997 ± 0.046
1.67ProArg: 1.67 ± 0.065
2.499ProSer: 2.499 ± 0.077
1.982ProThr: 1.982 ± 0.067
3.374ProVal: 3.374 ± 0.073
0.338ProTrp: 0.338 ± 0.026
1.515ProTyr: 1.515 ± 0.054
0.0ProXaa: 0.0 ± 0.0
Gln
1.937GlnAla: 1.937 ± 0.06
0.374GlnCys: 0.374 ± 0.025
1.314GlnAsp: 1.314 ± 0.045
1.542GlnGlu: 1.542 ± 0.053
0.931GlnPhe: 0.931 ± 0.041
1.528GlnGly: 1.528 ± 0.057
0.371GlnHis: 0.371 ± 0.023
1.677GlnIle: 1.677 ± 0.05
1.648GlnLys: 1.648 ± 0.054
1.822GlnLeu: 1.822 ± 0.05
1.047GlnMet: 1.047 ± 0.04
1.002GlnAsn: 1.002 ± 0.042
0.773GlnPro: 0.773 ± 0.035
0.628GlnGln: 0.628 ± 0.037
1.234GlnArg: 1.234 ± 0.049
1.425GlnSer: 1.425 ± 0.055
1.296GlnThr: 1.296 ± 0.053
1.521GlnVal: 1.521 ± 0.05
0.28GlnTrp: 0.28 ± 0.025
1.108GlnTyr: 1.108 ± 0.051
0.0GlnXaa: 0.0 ± 0.0
Arg
3.353ArgAla: 3.353 ± 0.075
0.865ArgCys: 0.865 ± 0.049
3.186ArgAsp: 3.186 ± 0.113
3.496ArgGlu: 3.496 ± 0.099
2.262ArgPhe: 2.262 ± 0.073
3.036ArgGly: 3.036 ± 0.077
0.837ArgHis: 0.837 ± 0.034
3.715ArgIle: 3.715 ± 0.099
3.476ArgLys: 3.476 ± 0.095
3.932ArgLeu: 3.932 ± 0.107
1.877ArgMet: 1.877 ± 0.067
2.253ArgAsn: 2.253 ± 0.064
1.783ArgPro: 1.783 ± 0.072
1.265ArgGln: 1.265 ± 0.045
2.878ArgArg: 2.878 ± 0.108
3.04ArgSer: 3.04 ± 0.089
2.662ArgThr: 2.662 ± 0.067
2.96ArgVal: 2.96 ± 0.082
0.473ArgTrp: 0.473 ± 0.03
2.109ArgTyr: 2.109 ± 0.064
0.0ArgXaa: 0.0 ± 0.0
Ser
5.178SerAla: 5.178 ± 0.113
0.927SerCys: 0.927 ± 0.047
4.472SerAsp: 4.472 ± 0.096
4.538SerGlu: 4.538 ± 0.1
2.702SerPhe: 2.702 ± 0.084
5.55SerGly: 5.55 ± 0.149
1.018SerHis: 1.018 ± 0.045
4.346SerIle: 4.346 ± 0.109
3.659SerLys: 3.659 ± 0.095
5.274SerLeu: 5.274 ± 0.112
1.971SerMet: 1.971 ± 0.063
2.33SerAsn: 2.33 ± 0.067
2.215SerPro: 2.215 ± 0.063
1.609SerGln: 1.609 ± 0.053
3.003SerArg: 3.003 ± 0.085
4.223SerSer: 4.223 ± 0.123
3.33SerThr: 3.33 ± 0.093
5.448SerVal: 5.448 ± 0.133
0.68SerTrp: 0.68 ± 0.039
2.245SerTyr: 2.245 ± 0.073
0.0SerXaa: 0.0 ± 0.0
Thr
4.736ThrAla: 4.736 ± 0.121
0.846ThrCys: 0.846 ± 0.045
3.887ThrAsp: 3.887 ± 0.095
3.885ThrGlu: 3.885 ± 0.094
2.453ThrPhe: 2.453 ± 0.086
4.926ThrGly: 4.926 ± 0.113
0.931ThrHis: 0.931 ± 0.04
3.931ThrIle: 3.931 ± 0.134
2.726ThrLys: 2.726 ± 0.077
4.82ThrLeu: 4.82 ± 0.122
1.528ThrMet: 1.528 ± 0.057
1.923ThrAsn: 1.923 ± 0.059
2.603ThrPro: 2.603 ± 0.071
1.308ThrGln: 1.308 ± 0.055
2.121ThrArg: 2.121 ± 0.06
3.449ThrSer: 3.449 ± 0.106
3.132ThrThr: 3.132 ± 0.107
6.034ThrVal: 6.034 ± 0.188
0.498ThrTrp: 0.498 ± 0.03
2.372ThrTyr: 2.372 ± 0.124
0.0ThrXaa: 0.0 ± 0.0
Val
5.693ValAla: 5.693 ± 0.117
1.333ValCys: 1.333 ± 0.047
4.696ValAsp: 4.696 ± 0.106
4.426ValGlu: 4.426 ± 0.106
3.221ValPhe: 3.221 ± 0.089
5.037ValGly: 5.037 ± 0.115
1.22ValHis: 1.22 ± 0.045
5.155ValIle: 5.155 ± 0.097
4.667ValLys: 4.667 ± 0.085
6.372ValLeu: 6.372 ± 0.129
2.271ValMet: 2.271 ± 0.079
2.756ValAsn: 2.756 ± 0.08
3.53ValPro: 3.53 ± 0.076
1.662ValGln: 1.662 ± 0.05
3.931ValArg: 3.931 ± 0.092
5.305ValSer: 5.305 ± 0.118
5.013ValThr: 5.013 ± 0.161
5.103ValVal: 5.103 ± 0.114
0.642ValTrp: 0.642 ± 0.035
2.758ValTyr: 2.758 ± 0.1
0.0ValXaa: 0.0 ± 0.0
Trp
0.71TrpAla: 0.71 ± 0.042
0.165TrpCys: 0.165 ± 0.016
0.644TrpAsp: 0.644 ± 0.038
0.614TrpGlu: 0.614 ± 0.033
0.437TrpPhe: 0.437 ± 0.031
0.585TrpGly: 0.585 ± 0.036
0.167TrpHis: 0.167 ± 0.018
0.647TrpIle: 0.647 ± 0.031
0.654TrpLys: 0.654 ± 0.035
0.712TrpLeu: 0.712 ± 0.037
0.378TrpMet: 0.378 ± 0.027
0.552TrpAsn: 0.552 ± 0.038
0.228TrpPro: 0.228 ± 0.021
0.252TrpGln: 0.252 ± 0.02
0.402TrpArg: 0.402 ± 0.027
0.564TrpSer: 0.564 ± 0.034
0.585TrpThr: 0.585 ± 0.036
0.68TrpVal: 0.68 ± 0.034
0.127TrpTrp: 0.127 ± 0.017
0.416TrpTyr: 0.416 ± 0.033
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.878TyrAla: 2.878 ± 0.079
0.759TyrCys: 0.759 ± 0.037
2.669TyrAsp: 2.669 ± 0.086
2.161TyrGlu: 2.161 ± 0.062
1.31TyrPhe: 1.31 ± 0.047
3.104TyrGly: 3.104 ± 0.088
0.68TyrHis: 0.68 ± 0.037
2.121TyrIle: 2.121 ± 0.069
1.726TyrLys: 1.726 ± 0.059
3.262TyrLeu: 3.262 ± 0.076
0.941TyrMet: 0.941 ± 0.048
1.587TyrAsn: 1.587 ± 0.063
1.456TyrPro: 1.456 ± 0.055
0.955TyrGln: 0.955 ± 0.041
2.173TyrArg: 2.173 ± 0.065
2.765TyrSer: 2.765 ± 0.103
2.512TyrThr: 2.512 ± 0.114
2.502TyrVal: 2.502 ± 0.078
0.405TyrTrp: 0.405 ± 0.028
1.536TyrTyr: 1.536 ± 0.063
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1805 proteins (574731 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski