Amino acid dipepetide frequency for Halanaerobium salsuginis

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.231AlaAla: 9.231 ± 0.145
0.483AlaCys: 0.483 ± 0.024
5.149AlaAsp: 5.149 ± 0.086
7.335AlaGlu: 7.335 ± 0.122
2.844AlaPhe: 2.844 ± 0.066
6.426AlaGly: 6.426 ± 0.101
1.067AlaHis: 1.067 ± 0.035
5.441AlaIle: 5.441 ± 0.083
5.024AlaLys: 5.024 ± 0.091
7.166AlaLeu: 7.166 ± 0.092
1.666AlaMet: 1.666 ± 0.046
3.425AlaAsn: 3.425 ± 0.067
1.724AlaPro: 1.724 ± 0.048
2.657AlaGln: 2.657 ± 0.06
3.308AlaArg: 3.308 ± 0.065
3.816AlaSer: 3.816 ± 0.066
2.859AlaThr: 2.859 ± 0.062
6.028AlaVal: 6.028 ± 0.09
0.51AlaTrp: 0.51 ± 0.022
2.317AlaTyr: 2.317 ± 0.052
0.0AlaXaa: 0.0 ± 0.0
Cys
0.386CysAla: 0.386 ± 0.022
0.105CysCys: 0.105 ± 0.011
0.355CysAsp: 0.355 ± 0.02
0.387CysGlu: 0.387 ± 0.023
0.267CysPhe: 0.267 ± 0.016
0.658CysGly: 0.658 ± 0.029
0.162CysHis: 0.162 ± 0.014
0.393CysIle: 0.393 ± 0.024
0.359CysLys: 0.359 ± 0.021
0.674CysLeu: 0.674 ± 0.028
0.107CysMet: 0.107 ± 0.01
0.317CysAsn: 0.317 ± 0.018
0.401CysPro: 0.401 ± 0.023
0.412CysGln: 0.412 ± 0.02
0.277CysArg: 0.277 ± 0.021
0.469CysSer: 0.469 ± 0.023
0.271CysThr: 0.271 ± 0.022
0.295CysVal: 0.295 ± 0.018
0.069CysTrp: 0.069 ± 0.009
0.253CysTyr: 0.253 ± 0.018
0.0CysXaa: 0.0 ± 0.0
Asp
2.388AspAla: 2.388 ± 0.054
0.406AspCys: 0.406 ± 0.022
2.416AspAsp: 2.416 ± 0.067
2.968AspGlu: 2.968 ± 0.071
3.266AspPhe: 3.266 ± 0.062
2.826AspGly: 2.826 ± 0.06
0.919AspHis: 0.919 ± 0.032
4.409AspIle: 4.409 ± 0.078
4.098AspLys: 4.098 ± 0.082
7.093AspLeu: 7.093 ± 0.097
1.0AspMet: 1.0 ± 0.031
2.963AspAsn: 2.963 ± 0.058
1.931AspPro: 1.931 ± 0.047
3.005AspGln: 3.005 ± 0.064
2.061AspArg: 2.061 ± 0.057
3.192AspSer: 3.192 ± 0.062
1.812AspThr: 1.812 ± 0.049
2.611AspVal: 2.611 ± 0.06
0.61AspTrp: 0.61 ± 0.027
3.068AspTyr: 3.068 ± 0.065
0.0AspXaa: 0.0 ± 0.0
Glu
5.027GluAla: 5.027 ± 0.085
0.336GluCys: 0.336 ± 0.02
3.131GluAsp: 3.131 ± 0.065
5.124GluGlu: 5.124 ± 0.111
3.049GluPhe: 3.049 ± 0.061
2.861GluGly: 2.861 ± 0.073
0.981GluHis: 0.981 ± 0.036
7.479GluIle: 7.479 ± 0.105
6.705GluLys: 6.705 ± 0.098
8.51GluLeu: 8.51 ± 0.112
1.941GluMet: 1.941 ± 0.048
4.294GluAsn: 4.294 ± 0.081
1.569GluPro: 1.569 ± 0.05
2.75GluGln: 2.75 ± 0.061
2.462GluArg: 2.462 ± 0.058
3.309GluSer: 3.309 ± 0.059
2.899GluThr: 2.899 ± 0.06
4.721GluVal: 4.721 ± 0.093
0.469GluTrp: 0.469 ± 0.024
2.453GluTyr: 2.453 ± 0.056
0.0GluXaa: 0.0 ± 0.0
Phe
3.54PheAla: 3.54 ± 0.067
0.338PheCys: 0.338 ± 0.02
2.35PheAsp: 2.35 ± 0.051
2.479PheGlu: 2.479 ± 0.059
2.226PhePhe: 2.226 ± 0.058
2.75PheGly: 2.75 ± 0.068
0.581PheHis: 0.581 ± 0.029
4.038PheIle: 4.038 ± 0.087
3.767PheLys: 3.767 ± 0.067
4.813PheLeu: 4.813 ± 0.094
1.008PheMet: 1.008 ± 0.034
2.8PheAsn: 2.8 ± 0.066
1.406PhePro: 1.406 ± 0.04
1.227PheGln: 1.227 ± 0.033
1.429PheArg: 1.429 ± 0.043
3.527PheSer: 3.527 ± 0.071
2.479PheThr: 2.479 ± 0.058
2.349PheVal: 2.349 ± 0.059
0.428PheTrp: 0.428 ± 0.024
1.81PheTyr: 1.81 ± 0.055
0.0PheXaa: 0.0 ± 0.0
Gly
4.291GlyAla: 4.291 ± 0.081
0.593GlyCys: 0.593 ± 0.027
3.101GlyAsp: 3.101 ± 0.068
4.118GlyGlu: 4.118 ± 0.074
2.928GlyPhe: 2.928 ± 0.059
4.277GlyGly: 4.277 ± 0.071
1.112GlyHis: 1.112 ± 0.038
6.002GlyIle: 6.002 ± 0.101
4.521GlyLys: 4.521 ± 0.079
6.397GlyLeu: 6.397 ± 0.094
1.62GlyMet: 1.62 ± 0.044
2.552GlyAsn: 2.552 ± 0.052
1.656GlyPro: 1.656 ± 0.05
2.363GlyGln: 2.363 ± 0.05
2.448GlyArg: 2.448 ± 0.056
4.033GlySer: 4.033 ± 0.066
2.929GlyThr: 2.929 ± 0.062
4.285GlyVal: 4.285 ± 0.072
0.638GlyTrp: 0.638 ± 0.029
2.69GlyTyr: 2.69 ± 0.051
0.0GlyXaa: 0.0 ± 0.0
His
0.793HisAla: 0.793 ± 0.03
0.181HisCys: 0.181 ± 0.013
0.779HisAsp: 0.779 ± 0.031
0.754HisGlu: 0.754 ± 0.034
0.773HisPhe: 0.773 ± 0.029
0.977HisGly: 0.977 ± 0.037
0.395HisHis: 0.395 ± 0.022
1.002HisIle: 1.002 ± 0.033
0.964HisLys: 0.964 ± 0.033
1.707HisLeu: 1.707 ± 0.043
0.204HisMet: 0.204 ± 0.017
0.904HisAsn: 0.904 ± 0.032
0.79HisPro: 0.79 ± 0.031
0.807HisGln: 0.807 ± 0.031
0.624HisArg: 0.624 ± 0.028
0.909HisSer: 0.909 ± 0.032
0.583HisThr: 0.583 ± 0.028
0.681HisVal: 0.681 ± 0.029
0.134HisTrp: 0.134 ± 0.013
0.712HisTyr: 0.712 ± 0.027
0.0HisXaa: 0.0 ± 0.0
Ile
6.893IleAla: 6.893 ± 0.102
0.579IleCys: 0.579 ± 0.024
4.878IleAsp: 4.878 ± 0.074
5.713IleGlu: 5.713 ± 0.084
4.185IlePhe: 4.185 ± 0.092
5.15IleGly: 5.15 ± 0.09
1.014IleHis: 1.014 ± 0.038
8.326IleIle: 8.326 ± 0.127
7.352IleLys: 7.352 ± 0.098
8.347IleLeu: 8.347 ± 0.12
1.984IleMet: 1.984 ± 0.051
5.236IleAsn: 5.236 ± 0.092
3.205IlePro: 3.205 ± 0.06
2.013IleGln: 2.013 ± 0.051
2.798IleArg: 2.798 ± 0.047
5.769IleSer: 5.769 ± 0.091
4.579IleThr: 4.579 ± 0.067
4.74IleVal: 4.74 ± 0.082
0.626IleTrp: 0.626 ± 0.035
3.102IleTyr: 3.102 ± 0.062
0.0IleXaa: 0.0 ± 0.0
Lys
5.371LysAla: 5.371 ± 0.075
0.361LysCys: 0.361 ± 0.024
3.778LysAsp: 3.778 ± 0.063
6.226LysGlu: 6.226 ± 0.101
3.088LysPhe: 3.088 ± 0.06
3.446LysGly: 3.446 ± 0.067
0.952LysHis: 0.952 ± 0.034
7.773LysIle: 7.773 ± 0.095
6.986LysLys: 6.986 ± 0.108
8.668LysLeu: 8.668 ± 0.095
1.896LysMet: 1.896 ± 0.048
5.25LysAsn: 5.25 ± 0.092
1.882LysPro: 1.882 ± 0.045
2.949LysGln: 2.949 ± 0.062
2.734LysArg: 2.734 ± 0.058
4.141LysSer: 4.141 ± 0.071
3.891LysThr: 3.891 ± 0.07
4.668LysVal: 4.668 ± 0.083
0.534LysTrp: 0.534 ± 0.025
3.113LysTyr: 3.113 ± 0.07
0.0LysXaa: 0.0 ± 0.0
Leu
10.33LeuAla: 10.33 ± 0.134
0.5LeuCys: 0.5 ± 0.027
5.446LeuAsp: 5.446 ± 0.08
6.988LeuGlu: 6.988 ± 0.089
4.405LeuPhe: 4.405 ± 0.095
6.423LeuGly: 6.423 ± 0.089
1.322LeuHis: 1.322 ± 0.039
9.201LeuIle: 9.201 ± 0.128
9.3LeuLys: 9.3 ± 0.118
10.717LeuLeu: 10.717 ± 0.146
2.206LeuMet: 2.206 ± 0.052
6.321LeuAsn: 6.321 ± 0.091
3.975LeuPro: 3.975 ± 0.079
3.452LeuGln: 3.452 ± 0.066
3.459LeuArg: 3.459 ± 0.071
6.72LeuSer: 6.72 ± 0.092
6.371LeuThr: 6.371 ± 0.086
6.182LeuVal: 6.182 ± 0.094
0.607LeuTrp: 0.607 ± 0.029
3.045LeuTyr: 3.045 ± 0.071
0.0LeuXaa: 0.0 ± 0.0
Met
2.005MetAla: 2.005 ± 0.047
0.096MetCys: 0.096 ± 0.011
1.101MetAsp: 1.101 ± 0.037
1.358MetGlu: 1.358 ± 0.039
0.796MetPhe: 0.796 ± 0.03
1.562MetGly: 1.562 ± 0.046
0.287MetHis: 0.287 ± 0.019
1.896MetIle: 1.896 ± 0.049
1.692MetLys: 1.692 ± 0.04
2.202MetLeu: 2.202 ± 0.05
0.534MetMet: 0.534 ± 0.026
1.103MetAsn: 1.103 ± 0.038
0.856MetPro: 0.856 ± 0.032
0.95MetGln: 0.95 ± 0.032
0.791MetArg: 0.791 ± 0.032
1.372MetSer: 1.372 ± 0.035
1.206MetThr: 1.206 ± 0.041
1.366MetVal: 1.366 ± 0.043
0.138MetTrp: 0.138 ± 0.012
0.585MetTyr: 0.585 ± 0.028
0.0MetXaa: 0.0 ± 0.0
Asn
2.367AsnAla: 2.367 ± 0.057
0.501AsnCys: 0.501 ± 0.024
2.642AsnAsp: 2.642 ± 0.055
2.925AsnGlu: 2.925 ± 0.069
2.916AsnPhe: 2.916 ± 0.059
2.654AsnGly: 2.654 ± 0.063
0.805AsnHis: 0.805 ± 0.033
4.672AsnIle: 4.672 ± 0.076
4.789AsnLys: 4.789 ± 0.09
6.816AsnLeu: 6.816 ± 0.088
1.229AsnMet: 1.229 ± 0.039
3.957AsnAsn: 3.957 ± 0.09
2.177AsnPro: 2.177 ± 0.059
2.74AsnGln: 2.74 ± 0.075
1.973AsnArg: 1.973 ± 0.052
3.92AsnSer: 3.92 ± 0.071
2.095AsnThr: 2.095 ± 0.044
2.4AsnVal: 2.4 ± 0.059
0.653AsnTrp: 0.653 ± 0.029
3.158AsnTyr: 3.158 ± 0.058
0.0AsnXaa: 0.0 ± 0.0
Pro
3.298ProAla: 3.298 ± 0.06
0.203ProCys: 0.203 ± 0.015
2.261ProAsp: 2.261 ± 0.055
3.275ProGlu: 3.275 ± 0.062
1.447ProPhe: 1.447 ± 0.041
2.508ProGly: 2.508 ± 0.057
0.554ProHis: 0.554 ± 0.025
2.337ProIle: 2.337 ± 0.058
1.294ProLys: 1.294 ± 0.037
3.15ProLeu: 3.15 ± 0.063
0.616ProMet: 0.616 ± 0.025
1.127ProAsn: 1.127 ± 0.036
0.834ProPro: 0.834 ± 0.033
1.065ProGln: 1.065 ± 0.034
1.037ProArg: 1.037 ± 0.037
1.318ProSer: 1.318 ± 0.039
1.567ProThr: 1.567 ± 0.051
2.611ProVal: 2.611 ± 0.062
0.33ProTrp: 0.33 ± 0.02
1.254ProTyr: 1.254 ± 0.032
0.0ProXaa: 0.0 ± 0.0
Gln
3.49GlnAla: 3.49 ± 0.064
0.149GlnCys: 0.149 ± 0.015
1.835GlnAsp: 1.835 ± 0.046
2.995GlnGlu: 2.995 ± 0.058
1.36GlnPhe: 1.36 ± 0.042
2.345GlnGly: 2.345 ± 0.058
0.575GlnHis: 0.575 ± 0.027
3.21GlnIle: 3.21 ± 0.069
3.209GlnLys: 3.209 ± 0.066
4.595GlnLeu: 4.595 ± 0.083
0.703GlnMet: 0.703 ± 0.027
2.076GlnAsn: 2.076 ± 0.05
1.208GlnPro: 1.208 ± 0.042
2.532GlnGln: 2.532 ± 0.071
1.464GlnArg: 1.464 ± 0.049
2.185GlnSer: 2.185 ± 0.048
1.799GlnThr: 1.799 ± 0.048
2.352GlnVal: 2.352 ± 0.057
0.216GlnTrp: 0.216 ± 0.016
1.198GlnTyr: 1.198 ± 0.037
0.0GlnXaa: 0.0 ± 0.0
Arg
2.558ArgAla: 2.558 ± 0.056
0.248ArgCys: 0.248 ± 0.017
1.954ArgAsp: 1.954 ± 0.047
2.916ArgGlu: 2.916 ± 0.062
1.589ArgPhe: 1.589 ± 0.047
2.255ArgGly: 2.255 ± 0.059
0.596ArgHis: 0.596 ± 0.023
3.079ArgIle: 3.079 ± 0.056
2.877ArgLys: 2.877 ± 0.06
3.802ArgLeu: 3.802 ± 0.075
0.794ArgMet: 0.794 ± 0.032
1.945ArgAsn: 1.945 ± 0.048
1.135ArgPro: 1.135 ± 0.038
1.617ArgGln: 1.617 ± 0.048
1.565ArgArg: 1.565 ± 0.05
2.072ArgSer: 2.072 ± 0.047
1.696ArgThr: 1.696 ± 0.044
2.201ArgVal: 2.201 ± 0.054
0.297ArgTrp: 0.297 ± 0.017
1.321ArgTyr: 1.321 ± 0.033
0.0ArgXaa: 0.0 ± 0.0
Ser
4.394SerAla: 4.394 ± 0.075
0.516SerCys: 0.516 ± 0.028
3.245SerAsp: 3.245 ± 0.071
3.963SerGlu: 3.963 ± 0.079
3.098SerPhe: 3.098 ± 0.059
4.559SerGly: 4.559 ± 0.074
0.945SerHis: 0.945 ± 0.03
4.476SerIle: 4.476 ± 0.085
4.061SerLys: 4.061 ± 0.07
6.486SerLeu: 6.486 ± 0.082
1.12SerMet: 1.12 ± 0.037
3.08SerAsn: 3.08 ± 0.065
1.835SerPro: 1.835 ± 0.052
2.478SerGln: 2.478 ± 0.047
2.484SerArg: 2.484 ± 0.057
3.971SerSer: 3.971 ± 0.076
2.633SerThr: 2.633 ± 0.057
3.217SerVal: 3.217 ± 0.055
0.661SerTrp: 0.661 ± 0.031
2.462SerTyr: 2.462 ± 0.053
0.0SerXaa: 0.0 ± 0.0
Thr
5.088ThrAla: 5.088 ± 0.075
0.248ThrCys: 0.248 ± 0.017
2.845ThrAsp: 2.845 ± 0.057
3.558ThrGlu: 3.558 ± 0.064
1.924ThrPhe: 1.924 ± 0.045
4.043ThrGly: 4.043 ± 0.073
0.695ThrHis: 0.695 ± 0.026
4.047ThrIle: 4.047 ± 0.068
3.005ThrLys: 3.005 ± 0.049
4.042ThrLeu: 4.042 ± 0.064
0.996ThrMet: 0.996 ± 0.036
2.125ThrAsn: 2.125 ± 0.048
1.73ThrPro: 1.73 ± 0.044
1.182ThrGln: 1.182 ± 0.039
1.644ThrArg: 1.644 ± 0.041
2.364ThrSer: 2.364 ± 0.046
2.499ThrThr: 2.499 ± 0.061
3.274ThrVal: 3.274 ± 0.064
0.32ThrTrp: 0.32 ± 0.019
1.356ThrTyr: 1.356 ± 0.036
0.0ThrXaa: 0.0 ± 0.0
Val
4.625ValAla: 4.625 ± 0.073
0.399ValCys: 0.399 ± 0.021
3.397ValAsp: 3.397 ± 0.062
4.653ValGlu: 4.653 ± 0.081
2.655ValPhe: 2.655 ± 0.049
3.966ValGly: 3.966 ± 0.071
0.767ValHis: 0.767 ± 0.032
5.581ValIle: 5.581 ± 0.082
4.948ValLys: 4.948 ± 0.085
5.632ValLeu: 5.632 ± 0.08
1.53ValMet: 1.53 ± 0.044
3.226ValAsn: 3.226 ± 0.057
1.994ValPro: 1.994 ± 0.045
1.559ValGln: 1.559 ± 0.048
1.953ValArg: 1.953 ± 0.048
3.713ValSer: 3.713 ± 0.071
2.763ValThr: 2.763 ± 0.059
3.928ValVal: 3.928 ± 0.067
0.408ValTrp: 0.408 ± 0.021
2.025ValTyr: 2.025 ± 0.05
0.0ValXaa: 0.0 ± 0.0
Trp
0.442TrpAla: 0.442 ± 0.022
0.044TrpCys: 0.044 ± 0.007
0.427TrpAsp: 0.427 ± 0.023
0.537TrpGlu: 0.537 ± 0.023
0.35TrpPhe: 0.35 ± 0.021
0.576TrpGly: 0.576 ± 0.028
0.158TrpHis: 0.158 ± 0.015
0.531TrpIle: 0.531 ± 0.027
0.438TrpLys: 0.438 ± 0.024
0.983TrpLeu: 0.983 ± 0.034
0.176TrpMet: 0.176 ± 0.015
0.417TrpAsn: 0.417 ± 0.025
0.306TrpPro: 0.306 ± 0.019
0.739TrpGln: 0.739 ± 0.032
0.35TrpArg: 0.35 ± 0.021
0.5TrpSer: 0.5 ± 0.027
0.402TrpThr: 0.402 ± 0.024
0.361TrpVal: 0.361 ± 0.022
0.154TrpTrp: 0.154 ± 0.013
0.28TrpTyr: 0.28 ± 0.019
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.959TyrAla: 1.959 ± 0.047
0.334TyrCys: 0.334 ± 0.018
1.952TyrAsp: 1.952 ± 0.047
1.798TyrGlu: 1.798 ± 0.044
2.179TyrPhe: 2.179 ± 0.053
2.293TyrGly: 2.293 ± 0.051
0.787TyrHis: 0.787 ± 0.031
2.593TyrIle: 2.593 ± 0.052
2.322TyrLys: 2.322 ± 0.06
5.169TyrLeu: 5.169 ± 0.083
0.567TyrMet: 0.567 ± 0.028
2.272TyrAsn: 2.272 ± 0.059
1.469TyrPro: 1.469 ± 0.04
2.997TyrGln: 2.997 ± 0.068
1.656TyrArg: 1.656 ± 0.047
2.383TyrSer: 2.383 ± 0.052
1.651TyrThr: 1.651 ± 0.041
1.452TyrVal: 1.452 ± 0.036
0.369TyrTrp: 0.369 ± 0.023
1.906TyrTyr: 1.906 ± 0.052
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2783 proteins (888209 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski