Amino acid dipepetide frequency for Helicobacter sp. 13S00401-1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.804AlaAla: 3.804 ± 0.145
0.959AlaCys: 0.959 ± 0.054
2.562AlaAsp: 2.562 ± 0.071
2.339AlaGlu: 2.339 ± 0.063
3.638AlaPhe: 3.638 ± 0.094
4.119AlaGly: 4.119 ± 0.114
1.192AlaHis: 1.192 ± 0.059
5.455AlaIle: 5.455 ± 0.129
6.855AlaLys: 6.855 ± 0.143
9.848AlaLeu: 9.848 ± 0.136
1.684AlaMet: 1.684 ± 0.059
4.45AlaAsn: 4.45 ± 0.104
2.033AlaPro: 2.033 ± 0.065
2.231AlaGln: 2.231 ± 0.084
3.113AlaArg: 3.113 ± 0.081
6.593AlaSer: 6.593 ± 0.107
3.609AlaThr: 3.609 ± 0.107
3.299AlaVal: 3.299 ± 0.1
0.466AlaTrp: 0.466 ± 0.031
2.728AlaTyr: 2.728 ± 0.073
0.0AlaXaa: 0.0 ± 0.0
Cys
0.783CysAla: 0.783 ± 0.041
0.089CysCys: 0.089 ± 0.014
0.552CysAsp: 0.552 ± 0.035
0.653CysGlu: 0.653 ± 0.035
0.525CysPhe: 0.525 ± 0.034
0.751CysGly: 0.751 ± 0.047
0.219CysHis: 0.219 ± 0.023
0.874CysIle: 0.874 ± 0.039
0.797CysLys: 0.797 ± 0.049
0.899CysLeu: 0.899 ± 0.041
0.279CysMet: 0.279 ± 0.022
0.456CysAsn: 0.456 ± 0.03
0.285CysPro: 0.285 ± 0.026
0.171CysGln: 0.171 ± 0.021
0.216CysArg: 0.216 ± 0.02
0.589CysSer: 0.589 ± 0.032
0.404CysThr: 0.404 ± 0.027
0.785CysVal: 0.785 ± 0.039
0.042CysTrp: 0.042 ± 0.009
0.31CysTyr: 0.31 ± 0.023
0.0CysXaa: 0.0 ± 0.0
Asp
3.852AspAla: 3.852 ± 0.097
0.485AspCys: 0.485 ± 0.028
2.522AspAsp: 2.522 ± 0.068
3.586AspGlu: 3.586 ± 0.096
3.02AspPhe: 3.02 ± 0.082
3.018AspGly: 3.018 ± 0.099
0.618AspHis: 0.618 ± 0.034
5.636AspIle: 5.636 ± 0.118
4.984AspLys: 4.984 ± 0.098
5.965AspLeu: 5.965 ± 0.127
1.526AspMet: 1.526 ± 0.063
2.341AspAsn: 2.341 ± 0.075
1.544AspPro: 1.544 ± 0.06
0.626AspGln: 0.626 ± 0.036
1.765AspArg: 1.765 ± 0.071
5.155AspSer: 5.155 ± 0.119
2.991AspThr: 2.991 ± 0.086
3.38AspVal: 3.38 ± 0.079
0.266AspTrp: 0.266 ± 0.024
2.254AspTyr: 2.254 ± 0.066
0.0AspXaa: 0.0 ± 0.0
Glu
4.601GluAla: 4.601 ± 0.102
0.749GluCys: 0.749 ± 0.043
3.326GluAsp: 3.326 ± 0.109
3.994GluGlu: 3.994 ± 0.12
2.67GluPhe: 2.67 ± 0.085
3.729GluGly: 3.729 ± 0.085
1.089GluHis: 1.089 ± 0.048
4.485GluIle: 4.485 ± 0.097
4.695GluLys: 4.695 ± 0.124
6.429GluLeu: 6.429 ± 0.131
1.178GluMet: 1.178 ± 0.052
2.847GluAsn: 2.847 ± 0.077
1.251GluPro: 1.251 ± 0.053
1.448GluGln: 1.448 ± 0.055
2.2GluArg: 2.2 ± 0.076
4.71GluSer: 4.71 ± 0.112
2.312GluThr: 2.312 ± 0.069
4.658GluVal: 4.658 ± 0.12
0.408GluTrp: 0.408 ± 0.03
2.085GluTyr: 2.085 ± 0.059
0.0GluXaa: 0.0 ± 0.0
Phe
3.032PheAla: 3.032 ± 0.072
0.61PheCys: 0.61 ± 0.042
2.733PheAsp: 2.733 ± 0.075
2.639PheGlu: 2.639 ± 0.078
2.647PhePhe: 2.647 ± 0.097
3.44PheGly: 3.44 ± 0.088
0.741PheHis: 0.741 ± 0.043
4.462PheIle: 4.462 ± 0.137
4.525PheLys: 4.525 ± 0.113
5.361PheLeu: 5.361 ± 0.129
1.263PheMet: 1.263 ± 0.048
2.997PheAsn: 2.997 ± 0.079
1.165PhePro: 1.165 ± 0.053
0.986PheGln: 0.986 ± 0.045
1.455PheArg: 1.455 ± 0.052
4.179PheSer: 4.179 ± 0.092
2.212PheThr: 2.212 ± 0.072
2.735PheVal: 2.735 ± 0.09
0.391PheTrp: 0.391 ± 0.029
2.204PheTyr: 2.204 ± 0.075
0.0PheXaa: 0.0 ± 0.0
Gly
4.604GlyAla: 4.604 ± 0.126
0.597GlyCys: 0.597 ± 0.039
3.02GlyAsp: 3.02 ± 0.094
3.359GlyGlu: 3.359 ± 0.089
4.037GlyPhe: 4.037 ± 0.082
4.585GlyGly: 4.585 ± 0.134
1.12GlyHis: 1.12 ± 0.053
5.565GlyIle: 5.565 ± 0.129
4.522GlyLys: 4.522 ± 0.098
6.477GlyLeu: 6.477 ± 0.127
1.553GlyMet: 1.553 ± 0.067
2.641GlyAsn: 2.641 ± 0.086
1.039GlyPro: 1.039 ± 0.044
1.319GlyGln: 1.319 ± 0.055
2.152GlyArg: 2.152 ± 0.062
4.406GlySer: 4.406 ± 0.104
2.77GlyThr: 2.77 ± 0.085
4.83GlyVal: 4.83 ± 0.112
0.458GlyTrp: 0.458 ± 0.032
2.764GlyTyr: 2.764 ± 0.073
0.0GlyXaa: 0.0 ± 0.0
His
1.253HisAla: 1.253 ± 0.052
0.148HisCys: 0.148 ± 0.016
0.716HisAsp: 0.716 ± 0.038
0.857HisGlu: 0.857 ± 0.04
1.049HisPhe: 1.049 ± 0.044
1.124HisGly: 1.124 ± 0.046
0.383HisHis: 0.383 ± 0.03
1.596HisIle: 1.596 ± 0.053
1.426HisLys: 1.426 ± 0.05
1.663HisLeu: 1.663 ± 0.059
0.529HisMet: 0.529 ± 0.032
0.855HisAsn: 0.855 ± 0.037
0.739HisPro: 0.739 ± 0.039
0.437HisGln: 0.437 ± 0.03
0.547HisArg: 0.547 ± 0.036
1.199HisSer: 1.199 ± 0.052
0.962HisThr: 0.962 ± 0.042
0.905HisVal: 0.905 ± 0.043
0.125HisTrp: 0.125 ± 0.018
0.716HisTyr: 0.716 ± 0.034
0.0HisXaa: 0.0 ± 0.0
Ile
6.51IleAla: 6.51 ± 0.137
0.812IleCys: 0.812 ± 0.046
4.987IleAsp: 4.987 ± 0.115
4.77IleGlu: 4.77 ± 0.11
3.911IlePhe: 3.911 ± 0.118
5.143IleGly: 5.143 ± 0.119
1.08IleHis: 1.08 ± 0.043
5.848IleIle: 5.848 ± 0.155
7.157IleLys: 7.157 ± 0.114
8.185IleLeu: 8.185 ± 0.141
1.534IleMet: 1.534 ± 0.056
4.493IleAsn: 4.493 ± 0.104
2.456IlePro: 2.456 ± 0.076
1.632IleGln: 1.632 ± 0.059
2.389IleArg: 2.389 ± 0.066
5.934IleSer: 5.934 ± 0.121
4.073IleThr: 4.073 ± 0.102
5.134IleVal: 5.134 ± 0.124
0.535IleTrp: 0.535 ± 0.037
3.122IleTyr: 3.122 ± 0.085
0.0IleXaa: 0.0 ± 0.0
Lys
6.901LysAla: 6.901 ± 0.113
0.57LysCys: 0.57 ± 0.033
7.832LysAsp: 7.832 ± 0.16
7.628LysGlu: 7.628 ± 0.178
2.693LysPhe: 2.693 ± 0.074
4.753LysGly: 4.753 ± 0.095
1.675LysHis: 1.675 ± 0.052
6.244LysIle: 6.244 ± 0.104
7.166LysLys: 7.166 ± 0.145
8.194LysLeu: 8.194 ± 0.14
1.919LysMet: 1.919 ± 0.059
4.934LysAsn: 4.934 ± 0.105
2.887LysPro: 2.887 ± 0.086
2.616LysGln: 2.616 ± 0.079
2.974LysArg: 2.974 ± 0.083
5.873LysSer: 5.873 ± 0.11
4.608LysThr: 4.608 ± 0.098
5.478LysVal: 5.478 ± 0.12
0.456LysTrp: 0.456 ± 0.03
2.704LysTyr: 2.704 ± 0.088
0.0LysXaa: 0.0 ± 0.0
Leu
8.148LeuAla: 8.148 ± 0.144
1.095LeuCys: 1.095 ± 0.049
7.315LeuAsp: 7.315 ± 0.138
8.079LeuGlu: 8.079 ± 0.176
4.323LeuPhe: 4.323 ± 0.102
7.37LeuGly: 7.37 ± 0.15
1.869LeuHis: 1.869 ± 0.062
6.893LeuIle: 6.893 ± 0.139
11.245LeuLys: 11.245 ± 0.187
10.177LeuLeu: 10.177 ± 0.189
2.108LeuMet: 2.108 ± 0.073
6.214LeuAsn: 6.214 ± 0.139
3.773LeuPro: 3.773 ± 0.084
3.449LeuGln: 3.449 ± 0.097
3.883LeuArg: 3.883 ± 0.086
9.216LeuSer: 9.216 ± 0.157
4.458LeuThr: 4.458 ± 0.099
6.319LeuVal: 6.319 ± 0.128
0.587LeuTrp: 0.587 ± 0.031
3.309LeuTyr: 3.309 ± 0.079
0.0LeuXaa: 0.0 ± 0.0
Met
1.611MetAla: 1.611 ± 0.064
0.237MetCys: 0.237 ± 0.023
1.313MetAsp: 1.313 ± 0.057
1.226MetGlu: 1.226 ± 0.05
0.945MetPhe: 0.945 ± 0.05
1.351MetGly: 1.351 ± 0.059
0.56MetHis: 0.56 ± 0.04
1.617MetIle: 1.617 ± 0.055
1.952MetLys: 1.952 ± 0.057
2.735MetLeu: 2.735 ± 0.073
0.472MetMet: 0.472 ± 0.034
0.947MetAsn: 0.947 ± 0.04
1.209MetPro: 1.209 ± 0.053
1.207MetGln: 1.207 ± 0.048
0.885MetArg: 0.885 ± 0.045
1.553MetSer: 1.553 ± 0.06
0.86MetThr: 0.86 ± 0.041
1.147MetVal: 1.147 ± 0.046
0.154MetTrp: 0.154 ± 0.018
0.593MetTyr: 0.593 ± 0.038
0.0MetXaa: 0.0 ± 0.0
Asn
4.385AsnAla: 4.385 ± 0.117
0.331AsnCys: 0.331 ± 0.029
2.289AsnAsp: 2.289 ± 0.083
2.818AsnGlu: 2.818 ± 0.087
2.837AsnPhe: 2.837 ± 0.088
2.822AsnGly: 2.822 ± 0.08
0.822AsnHis: 0.822 ± 0.041
5.451AsnIle: 5.451 ± 0.123
3.894AsnLys: 3.894 ± 0.098
6.756AsnLeu: 6.756 ± 0.18
1.272AsnMet: 1.272 ± 0.052
2.572AsnAsn: 2.572 ± 0.112
2.387AsnPro: 2.387 ± 0.076
1.344AsnGln: 1.344 ± 0.059
1.319AsnArg: 1.319 ± 0.052
3.582AsnSer: 3.582 ± 0.118
3.082AsnThr: 3.082 ± 0.103
2.918AsnVal: 2.918 ± 0.08
0.218AsnTrp: 0.218 ± 0.02
2.221AsnTyr: 2.221 ± 0.072
0.0AsnXaa: 0.0 ± 0.0
Pro
1.615ProAla: 1.615 ± 0.055
0.269ProCys: 0.269 ± 0.026
1.322ProAsp: 1.322 ± 0.051
1.517ProGlu: 1.517 ± 0.048
1.792ProPhe: 1.792 ± 0.06
1.397ProGly: 1.397 ± 0.055
0.708ProHis: 0.708 ± 0.04
2.441ProIle: 2.441 ± 0.076
3.005ProLys: 3.005 ± 0.09
3.946ProLeu: 3.946 ± 0.093
0.668ProMet: 0.668 ± 0.037
2.073ProAsn: 2.073 ± 0.075
0.92ProPro: 0.92 ± 0.043
0.866ProGln: 0.866 ± 0.039
1.009ProArg: 1.009 ± 0.045
2.793ProSer: 2.793 ± 0.077
2.012ProThr: 2.012 ± 0.064
1.852ProVal: 1.852 ± 0.067
0.229ProTrp: 0.229 ± 0.023
1.349ProTyr: 1.349 ± 0.058
0.0ProXaa: 0.0 ± 0.0
Gln
2.439GlnAla: 2.439 ± 0.102
0.144GlnCys: 0.144 ± 0.018
2.06GlnAsp: 2.06 ± 0.068
1.79GlnGlu: 1.79 ± 0.068
0.782GlnPhe: 0.782 ± 0.04
1.667GlnGly: 1.667 ± 0.064
0.35GlnHis: 0.35 ± 0.022
1.884GlnIle: 1.884 ± 0.06
2.668GlnLys: 2.668 ± 0.084
1.809GlnLeu: 1.809 ± 0.074
0.487GlnMet: 0.487 ± 0.033
1.954GlnAsn: 1.954 ± 0.061
0.624GlnPro: 0.624 ± 0.035
0.649GlnGln: 0.649 ± 0.039
1.003GlnArg: 1.003 ± 0.048
2.071GlnSer: 2.071 ± 0.066
1.617GlnThr: 1.617 ± 0.057
1.594GlnVal: 1.594 ± 0.049
0.166GlnTrp: 0.166 ± 0.018
0.678GlnTyr: 0.678 ± 0.036
0.0GlnXaa: 0.0 ± 0.0
Arg
2.41ArgAla: 2.41 ± 0.065
0.296ArgCys: 0.296 ± 0.025
1.877ArgAsp: 1.877 ± 0.071
2.094ArgGlu: 2.094 ± 0.065
2.087ArgPhe: 2.087 ± 0.072
1.902ArgGly: 1.902 ± 0.055
0.647ArgHis: 0.647 ± 0.036
2.681ArgIle: 2.681 ± 0.076
2.304ArgLys: 2.304 ± 0.069
4.231ArgLeu: 4.231 ± 0.112
0.835ArgMet: 0.835 ± 0.046
1.58ArgAsn: 1.58 ± 0.056
1.105ArgPro: 1.105 ± 0.052
0.893ArgGln: 0.893 ± 0.039
1.332ArgArg: 1.332 ± 0.055
2.029ArgSer: 2.029 ± 0.067
1.263ArgThr: 1.263 ± 0.056
2.304ArgVal: 2.304 ± 0.072
0.214ArgTrp: 0.214 ± 0.021
1.65ArgTyr: 1.65 ± 0.059
0.0ArgXaa: 0.0 ± 0.0
Ser
4.753SerAla: 4.753 ± 0.097
0.599SerCys: 0.599 ± 0.037
3.469SerAsp: 3.469 ± 0.087
3.7SerGlu: 3.7 ± 0.085
4.878SerPhe: 4.878 ± 0.107
4.626SerGly: 4.626 ± 0.099
1.27SerHis: 1.27 ± 0.051
6.387SerIle: 6.387 ± 0.129
7.782SerLys: 7.782 ± 0.133
9.74SerLeu: 9.74 ± 0.179
1.975SerMet: 1.975 ± 0.067
3.969SerAsn: 3.969 ± 0.103
2.391SerPro: 2.391 ± 0.069
2.071SerGln: 2.071 ± 0.062
2.148SerArg: 2.148 ± 0.061
5.773SerSer: 5.773 ± 0.148
4.142SerThr: 4.142 ± 0.105
4.135SerVal: 4.135 ± 0.107
0.385SerTrp: 0.385 ± 0.027
3.124SerTyr: 3.124 ± 0.077
0.0SerXaa: 0.0 ± 0.0
Thr
2.597ThrAla: 2.597 ± 0.076
0.474ThrCys: 0.474 ± 0.038
1.948ThrAsp: 1.948 ± 0.069
1.68ThrGlu: 1.68 ± 0.062
2.377ThrPhe: 2.377 ± 0.078
2.749ThrGly: 2.749 ± 0.075
1.105ThrHis: 1.105 ± 0.044
3.521ThrIle: 3.521 ± 0.086
4.843ThrLys: 4.843 ± 0.111
6.296ThrLeu: 6.296 ± 0.132
0.897ThrMet: 0.897 ± 0.039
3.095ThrAsn: 3.095 ± 0.121
2.568ThrPro: 2.568 ± 0.078
1.705ThrGln: 1.705 ± 0.061
1.711ThrArg: 1.711 ± 0.066
4.152ThrSer: 4.152 ± 0.12
3.145ThrThr: 3.145 ± 0.11
2.013ThrVal: 2.013 ± 0.071
0.293ThrTrp: 0.293 ± 0.025
1.933ThrTyr: 1.933 ± 0.072
0.0ThrXaa: 0.0 ± 0.0
Val
4.529ValAla: 4.529 ± 0.101
0.789ValCys: 0.789 ± 0.043
3.344ValAsp: 3.344 ± 0.088
3.126ValGlu: 3.126 ± 0.083
3.13ValPhe: 3.13 ± 0.086
4.514ValGly: 4.514 ± 0.106
0.986ValHis: 0.986 ± 0.045
5.034ValIle: 5.034 ± 0.116
4.181ValLys: 4.181 ± 0.105
6.876ValLeu: 6.876 ± 0.146
1.322ValMet: 1.322 ± 0.051
2.818ValAsn: 2.818 ± 0.073
1.963ValPro: 1.963 ± 0.065
1.465ValGln: 1.465 ± 0.06
2.152ValArg: 2.152 ± 0.071
4.778ValSer: 4.778 ± 0.099
2.554ValThr: 2.554 ± 0.067
4.593ValVal: 4.593 ± 0.118
0.354ValTrp: 0.354 ± 0.033
2.092ValTyr: 2.092 ± 0.064
0.0ValXaa: 0.0 ± 0.0
Trp
0.375TrpAla: 0.375 ± 0.03
0.073TrpCys: 0.073 ± 0.012
0.333TrpAsp: 0.333 ± 0.026
0.358TrpGlu: 0.358 ± 0.028
0.26TrpPhe: 0.26 ± 0.026
0.452TrpGly: 0.452 ± 0.03
0.183TrpHis: 0.183 ± 0.018
0.549TrpIle: 0.549 ± 0.031
0.348TrpLys: 0.348 ± 0.027
0.745TrpLeu: 0.745 ± 0.042
0.2TrpMet: 0.2 ± 0.022
0.258TrpAsn: 0.258 ± 0.022
0.142TrpPro: 0.142 ± 0.016
0.264TrpGln: 0.264 ± 0.026
0.264TrpArg: 0.264 ± 0.021
0.346TrpSer: 0.346 ± 0.025
0.173TrpThr: 0.173 ± 0.019
0.422TrpVal: 0.422 ± 0.031
0.083TrpTrp: 0.083 ± 0.015
0.233TrpTyr: 0.233 ± 0.02
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.774TyrAla: 2.774 ± 0.064
0.354TyrCys: 0.354 ± 0.025
2.013TyrAsp: 2.013 ± 0.064
2.653TyrGlu: 2.653 ± 0.081
2.175TyrPhe: 2.175 ± 0.072
2.306TyrGly: 2.306 ± 0.07
0.612TyrHis: 0.612 ± 0.034
3.084TyrIle: 3.084 ± 0.098
3.757TyrLys: 3.757 ± 0.097
3.498TyrLeu: 3.498 ± 0.096
0.87TyrMet: 0.87 ± 0.047
1.827TyrAsn: 1.827 ± 0.069
1.303TyrPro: 1.303 ± 0.054
1.057TyrGln: 1.057 ± 0.048
1.218TyrArg: 1.218 ± 0.052
2.383TyrSer: 2.383 ± 0.072
1.854TyrThr: 1.854 ± 0.064
2.098TyrVal: 2.098 ± 0.067
0.233TyrTrp: 0.233 ± 0.022
1.486TyrTyr: 1.486 ± 0.061
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1584 proteins (519509 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski