Amino acid dipepetide frequency for Lysobacter sp. HDW10

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
14.674AlaAla: 14.674 ± 0.259
1.171AlaCys: 1.171 ± 0.049
6.412AlaAsp: 6.412 ± 0.097
6.69AlaGlu: 6.69 ± 0.142
4.327AlaPhe: 4.327 ± 0.083
9.102AlaGly: 9.102 ± 0.174
2.85AlaHis: 2.85 ± 0.074
5.743AlaIle: 5.743 ± 0.093
4.668AlaLys: 4.668 ± 0.108
14.087AlaLeu: 14.087 ± 0.182
3.471AlaMet: 3.471 ± 0.067
3.522AlaAsn: 3.522 ± 0.098
5.352AlaPro: 5.352 ± 0.116
5.056AlaGln: 5.056 ± 0.104
8.175AlaArg: 8.175 ± 0.14
6.669AlaSer: 6.669 ± 0.133
6.027AlaThr: 6.027 ± 0.158
8.287AlaVal: 8.287 ± 0.119
1.884AlaTrp: 1.884 ± 0.058
2.493AlaTyr: 2.493 ± 0.067
0.0AlaXaa: 0.0 ± 0.0
Cys
1.038CysAla: 1.038 ± 0.048
0.096CysCys: 0.096 ± 0.013
0.506CysAsp: 0.506 ± 0.032
0.489CysGlu: 0.489 ± 0.031
0.311CysPhe: 0.311 ± 0.023
0.84CysGly: 0.84 ± 0.041
0.241CysHis: 0.241 ± 0.019
0.457CysIle: 0.457 ± 0.031
0.325CysLys: 0.325 ± 0.022
0.744CysLeu: 0.744 ± 0.037
0.198CysMet: 0.198 ± 0.019
0.232CysAsn: 0.232 ± 0.016
0.407CysPro: 0.407 ± 0.028
0.203CysGln: 0.203 ± 0.016
0.407CysArg: 0.407 ± 0.024
0.454CysSer: 0.454 ± 0.028
0.501CysThr: 0.501 ± 0.026
0.816CysVal: 0.816 ± 0.039
0.112CysTrp: 0.112 ± 0.015
0.172CysTyr: 0.172 ± 0.016
0.0CysXaa: 0.0 ± 0.0
Asp
8.769AspAla: 8.769 ± 0.128
0.393AspCys: 0.393 ± 0.026
3.06AspAsp: 3.06 ± 0.087
3.181AspGlu: 3.181 ± 0.078
2.119AspPhe: 2.119 ± 0.058
4.599AspGly: 4.599 ± 0.105
1.109AspHis: 1.109 ± 0.042
3.133AspIle: 3.133 ± 0.072
1.934AspLys: 1.934 ± 0.051
5.449AspLeu: 5.449 ± 0.095
1.381AspMet: 1.381 ± 0.041
1.486AspAsn: 1.486 ± 0.054
2.903AspPro: 2.903 ± 0.062
1.721AspGln: 1.721 ± 0.051
3.508AspArg: 3.508 ± 0.086
2.576AspSer: 2.576 ± 0.076
3.036AspThr: 3.036 ± 0.075
4.846AspVal: 4.846 ± 0.09
0.884AspTrp: 0.884 ± 0.037
1.499AspTyr: 1.499 ± 0.054
0.0AspXaa: 0.0 ± 0.0
Glu
6.938GluAla: 6.938 ± 0.126
0.417GluCys: 0.417 ± 0.027
2.934GluAsp: 2.934 ± 0.067
2.551GluGlu: 2.551 ± 0.083
1.845GluPhe: 1.845 ± 0.056
4.333GluGly: 4.333 ± 0.09
1.326GluHis: 1.326 ± 0.049
3.002GluIle: 3.002 ± 0.087
2.217GluLys: 2.217 ± 0.065
5.264GluLeu: 5.264 ± 0.108
1.42GluMet: 1.42 ± 0.051
1.643GluAsn: 1.643 ± 0.053
1.984GluPro: 1.984 ± 0.058
2.15GluGln: 2.15 ± 0.092
4.467GluArg: 4.467 ± 0.11
3.237GluSer: 3.237 ± 0.067
3.08GluThr: 3.08 ± 0.075
4.183GluVal: 4.183 ± 0.087
0.732GluTrp: 0.732 ± 0.04
1.142GluTyr: 1.142 ± 0.039
0.0GluXaa: 0.0 ± 0.0
Phe
4.287PheAla: 4.287 ± 0.094
0.348PheCys: 0.348 ± 0.02
2.804PheAsp: 2.804 ± 0.068
2.486PheGlu: 2.486 ± 0.061
1.281PhePhe: 1.281 ± 0.045
3.206PheGly: 3.206 ± 0.077
0.773PheHis: 0.773 ± 0.039
1.702PheIle: 1.702 ± 0.057
1.508PheLys: 1.508 ± 0.051
3.045PheLeu: 3.045 ± 0.08
0.779PheMet: 0.779 ± 0.037
1.362PheAsn: 1.362 ± 0.053
1.412PhePro: 1.412 ± 0.045
1.023PheGln: 1.023 ± 0.041
1.885PheArg: 1.885 ± 0.058
2.32PheSer: 2.32 ± 0.064
1.828PheThr: 1.828 ± 0.067
2.829PheVal: 2.829 ± 0.075
0.495PheTrp: 0.495 ± 0.031
0.782PheTyr: 0.782 ± 0.04
0.0PheXaa: 0.0 ± 0.0
Gly
8.244GlyAla: 8.244 ± 0.161
0.763GlyCys: 0.763 ± 0.038
4.269GlyAsp: 4.269 ± 0.093
4.453GlyGlu: 4.453 ± 0.097
3.289GlyPhe: 3.289 ± 0.071
6.345GlyGly: 6.345 ± 0.22
1.937GlyHis: 1.937 ± 0.06
4.352GlyIle: 4.352 ± 0.092
3.797GlyLys: 3.797 ± 0.094
7.63GlyLeu: 7.63 ± 0.113
2.465GlyMet: 2.465 ± 0.067
2.65GlyAsn: 2.65 ± 0.11
2.56GlyPro: 2.56 ± 0.062
2.983GlyGln: 2.983 ± 0.08
4.921GlyArg: 4.921 ± 0.108
4.358GlySer: 4.358 ± 0.13
4.519GlyThr: 4.519 ± 0.168
6.314GlyVal: 6.314 ± 0.107
1.289GlyTrp: 1.289 ± 0.052
2.214GlyTyr: 2.214 ± 0.059
0.0GlyXaa: 0.0 ± 0.0
His
3.284HisAla: 3.284 ± 0.075
0.265HisCys: 0.265 ± 0.023
1.18HisAsp: 1.18 ± 0.049
1.125HisGlu: 1.125 ± 0.048
0.96HisPhe: 0.96 ± 0.036
1.927HisGly: 1.927 ± 0.06
0.614HisHis: 0.614 ± 0.033
0.992HisIle: 0.992 ± 0.042
0.605HisLys: 0.605 ± 0.029
2.189HisLeu: 2.189 ± 0.059
0.529HisMet: 0.529 ± 0.028
0.529HisAsn: 0.529 ± 0.032
1.359HisPro: 1.359 ± 0.051
0.654HisGln: 0.654 ± 0.034
1.458HisArg: 1.458 ± 0.049
1.168HisSer: 1.168 ± 0.04
1.179HisThr: 1.179 ± 0.044
1.891HisVal: 1.891 ± 0.049
0.435HisTrp: 0.435 ± 0.024
0.621HisTyr: 0.621 ± 0.032
0.0HisXaa: 0.0 ± 0.0
Ile
6.959IleAla: 6.959 ± 0.123
0.439IleCys: 0.439 ± 0.03
3.645IleAsp: 3.645 ± 0.08
3.722IleGlu: 3.722 ± 0.076
1.489IlePhe: 1.489 ± 0.051
4.351IleGly: 4.351 ± 0.095
0.998IleHis: 0.998 ± 0.034
1.794IleIle: 1.794 ± 0.063
1.612IleLys: 1.612 ± 0.052
3.824IleLeu: 3.824 ± 0.088
0.812IleMet: 0.812 ± 0.034
1.514IleAsn: 1.514 ± 0.054
2.276IlePro: 2.276 ± 0.062
1.653IleGln: 1.653 ± 0.048
3.089IleArg: 3.089 ± 0.066
2.921IleSer: 2.921 ± 0.087
2.592IleThr: 2.592 ± 0.075
3.756IleVal: 3.756 ± 0.067
0.547IleTrp: 0.547 ± 0.031
1.038IleTyr: 1.038 ± 0.036
0.0IleXaa: 0.0 ± 0.0
Lys
4.377LysAla: 4.377 ± 0.113
0.213LysCys: 0.213 ± 0.019
2.221LysAsp: 2.221 ± 0.068
1.746LysGlu: 1.746 ± 0.061
1.198LysPhe: 1.198 ± 0.042
2.716LysGly: 2.716 ± 0.069
1.014LysHis: 1.014 ± 0.041
1.736LysIle: 1.736 ± 0.055
1.727LysLys: 1.727 ± 0.103
3.772LysLeu: 3.772 ± 0.092
0.985LysMet: 0.985 ± 0.042
1.167LysAsn: 1.167 ± 0.042
2.265LysPro: 2.265 ± 0.062
1.712LysGln: 1.712 ± 0.056
2.931LysArg: 2.931 ± 0.075
2.319LysSer: 2.319 ± 0.065
2.192LysThr: 2.192 ± 0.057
2.816LysVal: 2.816 ± 0.078
0.432LysTrp: 0.432 ± 0.023
0.841LysTyr: 0.841 ± 0.041
0.0LysXaa: 0.0 ± 0.0
Leu
11.919LeuAla: 11.919 ± 0.17
0.964LeuCys: 0.964 ± 0.045
5.993LeuAsp: 5.993 ± 0.101
5.332LeuGlu: 5.332 ± 0.124
3.419LeuPhe: 3.419 ± 0.077
7.638LeuGly: 7.638 ± 0.124
2.22LeuHis: 2.22 ± 0.062
4.769LeuIle: 4.769 ± 0.088
4.195LeuLys: 4.195 ± 0.086
9.588LeuLeu: 9.588 ± 0.189
2.557LeuMet: 2.557 ± 0.081
3.159LeuAsn: 3.159 ± 0.064
5.426LeuPro: 5.426 ± 0.087
3.623LeuGln: 3.623 ± 0.078
7.003LeuArg: 7.003 ± 0.123
6.489LeuSer: 6.489 ± 0.101
5.119LeuThr: 5.119 ± 0.107
6.922LeuVal: 6.922 ± 0.111
1.236LeuTrp: 1.236 ± 0.055
2.085LeuTyr: 2.085 ± 0.063
0.0LeuXaa: 0.0 ± 0.0
Met
2.814MetAla: 2.814 ± 0.058
0.174MetCys: 0.174 ± 0.017
1.343MetAsp: 1.343 ± 0.048
1.045MetGlu: 1.045 ± 0.045
0.788MetPhe: 0.788 ± 0.038
1.927MetGly: 1.927 ± 0.058
0.674MetHis: 0.674 ± 0.034
1.158MetIle: 1.158 ± 0.044
1.229MetLys: 1.229 ± 0.042
2.536MetLeu: 2.536 ± 0.061
0.643MetMet: 0.643 ± 0.032
0.938MetAsn: 0.938 ± 0.032
1.633MetPro: 1.633 ± 0.05
1.298MetGln: 1.298 ± 0.044
2.032MetArg: 2.032 ± 0.058
1.853MetSer: 1.853 ± 0.049
1.61MetThr: 1.61 ± 0.049
1.649MetVal: 1.649 ± 0.048
0.22MetTrp: 0.22 ± 0.02
0.444MetTyr: 0.444 ± 0.025
0.0MetXaa: 0.0 ± 0.0
Asn
4.281AsnAla: 4.281 ± 0.113
0.244AsnCys: 0.244 ± 0.021
1.763AsnAsp: 1.763 ± 0.07
1.535AsnGlu: 1.535 ± 0.053
1.094AsnPhe: 1.094 ± 0.044
2.9AsnGly: 2.9 ± 0.102
0.603AsnHis: 0.603 ± 0.029
1.528AsnIle: 1.528 ± 0.057
1.053AsnLys: 1.053 ± 0.045
2.647AsnLeu: 2.647 ± 0.069
0.628AsnMet: 0.628 ± 0.024
0.963AsnAsn: 0.963 ± 0.085
1.89AsnPro: 1.89 ± 0.054
0.926AsnGln: 0.926 ± 0.046
1.857AsnArg: 1.857 ± 0.054
1.588AsnSer: 1.588 ± 0.07
1.775AsnThr: 1.775 ± 0.073
2.433AsnVal: 2.433 ± 0.103
0.461AsnTrp: 0.461 ± 0.026
0.794AsnTyr: 0.794 ± 0.037
0.0AsnXaa: 0.0 ± 0.0
Pro
5.38ProAla: 5.38 ± 0.123
0.311ProCys: 0.311 ± 0.021
3.129ProAsp: 3.129 ± 0.074
3.318ProGlu: 3.318 ± 0.07
1.659ProPhe: 1.659 ± 0.051
3.655ProGly: 3.655 ± 0.078
1.001ProHis: 1.001 ± 0.041
2.298ProIle: 2.298 ± 0.063
1.878ProLys: 1.878 ± 0.059
4.308ProLeu: 4.308 ± 0.084
1.517ProMet: 1.517 ± 0.047
1.655ProAsn: 1.655 ± 0.046
1.995ProPro: 1.995 ± 0.067
1.674ProGln: 1.674 ± 0.055
2.542ProArg: 2.542 ± 0.068
2.768ProSer: 2.768 ± 0.071
2.611ProThr: 2.611 ± 0.07
3.914ProVal: 3.914 ± 0.08
0.692ProTrp: 0.692 ± 0.035
1.127ProTyr: 1.127 ± 0.041
0.0ProXaa: 0.0 ± 0.0
Gln
4.549GlnAla: 4.549 ± 0.114
0.287GlnCys: 0.287 ± 0.022
1.695GlnAsp: 1.695 ± 0.05
1.467GlnGlu: 1.467 ± 0.047
1.341GlnPhe: 1.341 ± 0.048
2.87GlnGly: 2.87 ± 0.063
0.853GlnHis: 0.853 ± 0.039
1.746GlnIle: 1.746 ± 0.052
1.334GlnLys: 1.334 ± 0.043
4.077GlnLeu: 4.077 ± 0.086
0.998GlnMet: 0.998 ± 0.034
1.0GlnAsn: 1.0 ± 0.04
1.686GlnPro: 1.686 ± 0.05
1.57GlnGln: 1.57 ± 0.059
2.96GlnArg: 2.96 ± 0.084
2.286GlnSer: 2.286 ± 0.055
1.909GlnThr: 1.909 ± 0.057
2.887GlnVal: 2.887 ± 0.067
0.617GlnTrp: 0.617 ± 0.029
0.753GlnTyr: 0.753 ± 0.039
0.0GlnXaa: 0.0 ± 0.0
Arg
7.905ArgAla: 7.905 ± 0.132
0.485ArgCys: 0.485 ± 0.031
4.129ArgAsp: 4.129 ± 0.097
4.013ArgGlu: 4.013 ± 0.106
2.672ArgPhe: 2.672 ± 0.066
4.532ArgGly: 4.532 ± 0.089
1.508ArgHis: 1.508 ± 0.049
3.874ArgIle: 3.874 ± 0.087
2.591ArgLys: 2.591 ± 0.079
6.512ArgLeu: 6.512 ± 0.134
2.087ArgMet: 2.087 ± 0.054
2.18ArgAsn: 2.18 ± 0.06
2.746ArgPro: 2.746 ± 0.077
2.227ArgGln: 2.227 ± 0.064
4.129ArgArg: 4.129 ± 0.101
3.324ArgSer: 3.324 ± 0.077
3.397ArgThr: 3.397 ± 0.068
5.331ArgVal: 5.331 ± 0.101
1.069ArgTrp: 1.069 ± 0.043
1.878ArgTyr: 1.878 ± 0.05
0.0ArgXaa: 0.0 ± 0.0
Ser
6.684SerAla: 6.684 ± 0.134
0.438SerCys: 0.438 ± 0.03
3.166SerAsp: 3.166 ± 0.077
3.114SerGlu: 3.114 ± 0.079
2.013SerPhe: 2.013 ± 0.062
5.297SerGly: 5.297 ± 0.131
1.218SerHis: 1.218 ± 0.048
2.847SerIle: 2.847 ± 0.066
2.397SerLys: 2.397 ± 0.072
5.329SerLeu: 5.329 ± 0.115
1.452SerMet: 1.452 ± 0.056
1.995SerAsn: 1.995 ± 0.078
2.66SerPro: 2.66 ± 0.061
2.017SerGln: 2.017 ± 0.064
3.675SerArg: 3.675 ± 0.08
3.139SerSer: 3.139 ± 0.091
3.358SerThr: 3.358 ± 0.114
4.306SerVal: 4.306 ± 0.101
0.754SerTrp: 0.754 ± 0.033
1.297SerTyr: 1.297 ± 0.048
0.0SerXaa: 0.0 ± 0.0
Thr
5.974ThrAla: 5.974 ± 0.157
0.444ThrCys: 0.444 ± 0.027
2.867ThrAsp: 2.867 ± 0.097
2.594ThrGlu: 2.594 ± 0.059
1.914ThrPhe: 1.914 ± 0.078
4.763ThrGly: 4.763 ± 0.117
1.502ThrHis: 1.502 ± 0.049
2.265ThrIle: 2.265 ± 0.087
1.394ThrLys: 1.394 ± 0.052
6.261ThrLeu: 6.261 ± 0.099
1.078ThrMet: 1.078 ± 0.044
1.368ThrAsn: 1.368 ± 0.08
3.318ThrPro: 3.318 ± 0.074
2.197ThrGln: 2.197 ± 0.057
3.732ThrArg: 3.732 ± 0.077
2.952ThrSer: 2.952 ± 0.125
2.978ThrThr: 2.978 ± 0.112
4.254ThrVal: 4.254 ± 0.147
0.773ThrTrp: 0.773 ± 0.034
1.269ThrTyr: 1.269 ± 0.064
0.0ThrXaa: 0.0 ± 0.0
Val
8.532ValAla: 8.532 ± 0.135
0.762ValCys: 0.762 ± 0.032
4.606ValAsp: 4.606 ± 0.081
4.393ValGlu: 4.393 ± 0.093
2.867ValPhe: 2.867 ± 0.065
5.538ValGly: 5.538 ± 0.114
1.711ValHis: 1.711 ± 0.053
4.002ValIle: 4.002 ± 0.082
2.643ValLys: 2.643 ± 0.068
8.126ValLeu: 8.126 ± 0.13
2.014ValMet: 2.014 ± 0.052
2.49ValAsn: 2.49 ± 0.08
3.758ValPro: 3.758 ± 0.071
2.656ValGln: 2.656 ± 0.079
4.976ValArg: 4.976 ± 0.089
4.587ValSer: 4.587 ± 0.096
4.163ValThr: 4.163 ± 0.192
6.302ValVal: 6.302 ± 0.117
0.988ValTrp: 0.988 ± 0.035
1.57ValTyr: 1.57 ± 0.047
0.0ValXaa: 0.0 ± 0.0
Trp
1.198TrpAla: 1.198 ± 0.049
0.151TrpCys: 0.151 ± 0.016
0.577TrpAsp: 0.577 ± 0.03
0.475TrpGlu: 0.475 ± 0.029
0.566TrpPhe: 0.566 ± 0.029
0.835TrpGly: 0.835 ± 0.04
0.389TrpHis: 0.389 ± 0.028
0.76TrpIle: 0.76 ± 0.029
0.519TrpLys: 0.519 ± 0.029
1.984TrpLeu: 1.984 ± 0.063
0.5TrpMet: 0.5 ± 0.027
0.45TrpAsn: 0.45 ± 0.026
0.699TrpPro: 0.699 ± 0.035
0.726TrpGln: 0.726 ± 0.032
1.131TrpArg: 1.131 ± 0.037
0.847TrpSer: 0.847 ± 0.039
0.732TrpThr: 0.732 ± 0.034
1.148TrpVal: 1.148 ± 0.048
0.266TrpTrp: 0.266 ± 0.022
0.333TrpTyr: 0.333 ± 0.021
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.853TyrAla: 2.853 ± 0.071
0.185TyrCys: 0.185 ± 0.015
1.289TyrAsp: 1.289 ± 0.049
1.136TyrGlu: 1.136 ± 0.039
1.004TyrPhe: 1.004 ± 0.037
1.979TyrGly: 1.979 ± 0.05
0.42TyrHis: 0.42 ± 0.027
0.872TyrIle: 0.872 ± 0.038
0.754TyrLys: 0.754 ± 0.036
2.285TyrLeu: 2.285 ± 0.066
0.492TyrMet: 0.492 ± 0.027
0.661TyrAsn: 0.661 ± 0.034
1.105TyrPro: 1.105 ± 0.041
0.815TyrGln: 0.815 ± 0.038
1.711TyrArg: 1.711 ± 0.055
1.301TyrSer: 1.301 ± 0.04
1.292TyrThr: 1.292 ± 0.062
1.8TyrVal: 1.8 ± 0.053
0.398TyrTrp: 0.398 ± 0.027
0.543TyrTyr: 0.543 ± 0.036
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2055 proteins (676245 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski