Amino acid dipepetide frequency for Listeria booriae

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.492AlaAla: 6.492 ± 0.111
0.605AlaCys: 0.605 ± 0.024
5.138AlaAsp: 5.138 ± 0.37
5.381AlaGlu: 5.381 ± 0.115
3.684AlaPhe: 3.684 ± 0.061
5.834AlaGly: 5.834 ± 0.087
1.342AlaHis: 1.342 ± 0.044
6.551AlaIle: 6.551 ± 0.098
5.697AlaLys: 5.697 ± 0.071
7.818AlaLeu: 7.818 ± 0.097
2.262AlaMet: 2.262 ± 0.054
3.653AlaAsn: 3.653 ± 0.067
2.566AlaPro: 2.566 ± 0.061
2.897AlaGln: 2.897 ± 0.07
2.843AlaArg: 2.843 ± 0.064
4.775AlaSer: 4.775 ± 0.078
5.443AlaThr: 5.443 ± 0.114
5.871AlaVal: 5.871 ± 0.084
0.713AlaTrp: 0.713 ± 0.03
2.722AlaTyr: 2.722 ± 0.052
0.0AlaXaa: 0.0 ± 0.0
Cys
0.443CysAla: 0.443 ± 0.022
0.074CysCys: 0.074 ± 0.009
0.345CysAsp: 0.345 ± 0.017
0.36CysGlu: 0.36 ± 0.019
0.348CysPhe: 0.348 ± 0.02
0.552CysGly: 0.552 ± 0.027
0.164CysHis: 0.164 ± 0.012
0.499CysIle: 0.499 ± 0.024
0.25CysLys: 0.25 ± 0.016
0.69CysLeu: 0.69 ± 0.03
0.163CysMet: 0.163 ± 0.014
0.217CysAsn: 0.217 ± 0.015
0.247CysPro: 0.247 ± 0.019
0.204CysGln: 0.204 ± 0.014
0.228CysArg: 0.228 ± 0.016
0.386CysSer: 0.386 ± 0.022
0.352CysThr: 0.352 ± 0.016
0.426CysVal: 0.426 ± 0.024
0.065CysTrp: 0.065 ± 0.009
0.241CysTyr: 0.241 ± 0.016
0.0CysXaa: 0.0 ± 0.0
Asp
5.322AspAla: 5.322 ± 0.375
0.325AspCys: 0.325 ± 0.018
2.747AspAsp: 2.747 ± 0.047
3.772AspGlu: 3.772 ± 0.066
2.803AspPhe: 2.803 ± 0.054
4.258AspGly: 4.258 ± 0.082
0.85AspHis: 0.85 ± 0.029
4.158AspIle: 4.158 ± 0.067
3.63AspLys: 3.63 ± 0.072
4.952AspLeu: 4.952 ± 0.079
1.472AspMet: 1.472 ± 0.05
2.125AspAsn: 2.125 ± 0.051
1.994AspPro: 1.994 ± 0.052
1.724AspGln: 1.724 ± 0.046
1.992AspArg: 1.992 ± 0.05
2.876AspSer: 2.876 ± 0.057
3.333AspThr: 3.333 ± 0.072
4.41AspVal: 4.41 ± 0.082
0.693AspTrp: 0.693 ± 0.029
2.324AspTyr: 2.324 ± 0.052
0.0AspXaa: 0.0 ± 0.0
Glu
5.856GluAla: 5.856 ± 0.111
0.269GluCys: 0.269 ± 0.018
3.398GluAsp: 3.398 ± 0.075
4.972GluGlu: 4.972 ± 0.109
2.249GluPhe: 2.249 ± 0.047
3.499GluGly: 3.499 ± 0.069
1.228GluHis: 1.228 ± 0.043
4.876GluIle: 4.876 ± 0.081
5.453GluLys: 5.453 ± 0.092
6.517GluLeu: 6.517 ± 0.099
2.127GluMet: 2.127 ± 0.053
3.441GluAsn: 3.441 ± 0.067
1.859GluPro: 1.859 ± 0.046
3.206GluGln: 3.206 ± 0.067
3.173GluArg: 3.173 ± 0.07
3.051GluSer: 3.051 ± 0.062
4.073GluThr: 4.073 ± 0.08
4.711GluVal: 4.711 ± 0.091
0.649GluTrp: 0.649 ± 0.03
1.898GluTyr: 1.898 ± 0.046
0.0GluXaa: 0.0 ± 0.0
Phe
3.474PheAla: 3.474 ± 0.071
0.37PheCys: 0.37 ± 0.019
2.673PheAsp: 2.673 ± 0.056
2.613PheGlu: 2.613 ± 0.06
2.325PhePhe: 2.325 ± 0.066
3.356PheGly: 3.356 ± 0.067
0.868PheHis: 0.868 ± 0.028
3.578PheIle: 3.578 ± 0.07
2.148PheLys: 2.148 ± 0.051
4.571PheLeu: 4.571 ± 0.095
1.184PheMet: 1.184 ± 0.037
1.72PheAsn: 1.72 ± 0.041
1.573PhePro: 1.573 ± 0.04
1.796PheGln: 1.796 ± 0.042
1.652PheArg: 1.652 ± 0.037
2.995PheSer: 2.995 ± 0.066
2.834PheThr: 2.834 ± 0.062
3.471PheVal: 3.471 ± 0.061
0.504PheTrp: 0.504 ± 0.025
1.774PheTyr: 1.774 ± 0.046
0.0PheXaa: 0.0 ± 0.0
Gly
5.346GlyAla: 5.346 ± 0.086
0.48GlyCys: 0.48 ± 0.023
3.615GlyAsp: 3.615 ± 0.064
4.146GlyGlu: 4.146 ± 0.074
3.308GlyPhe: 3.308 ± 0.063
4.454GlyGly: 4.454 ± 0.088
1.169GlyHis: 1.169 ± 0.036
5.597GlyIle: 5.597 ± 0.088
4.777GlyLys: 4.777 ± 0.086
6.345GlyLeu: 6.345 ± 0.098
1.924GlyMet: 1.924 ± 0.048
3.069GlyAsn: 3.069 ± 0.07
1.453GlyPro: 1.453 ± 0.041
2.117GlyGln: 2.117 ± 0.043
2.437GlyArg: 2.437 ± 0.055
3.975GlySer: 3.975 ± 0.076
4.777GlyThr: 4.777 ± 0.106
5.118GlyVal: 5.118 ± 0.075
0.794GlyTrp: 0.794 ± 0.027
2.742GlyTyr: 2.742 ± 0.052
0.0GlyXaa: 0.0 ± 0.0
His
1.263HisAla: 1.263 ± 0.039
0.183HisCys: 0.183 ± 0.014
1.094HisAsp: 1.094 ± 0.038
1.197HisGlu: 1.197 ± 0.038
1.04HisPhe: 1.04 ± 0.029
1.307HisGly: 1.307 ± 0.04
0.53HisHis: 0.53 ± 0.026
1.372HisIle: 1.372 ± 0.039
0.85HisLys: 0.85 ± 0.032
1.805HisLeu: 1.805 ± 0.04
0.478HisMet: 0.478 ± 0.022
0.654HisAsn: 0.654 ± 0.022
0.881HisPro: 0.881 ± 0.029
0.704HisGln: 0.704 ± 0.028
0.816HisArg: 0.816 ± 0.032
0.911HisSer: 0.911 ± 0.032
0.996HisThr: 0.996 ± 0.037
1.414HisVal: 1.414 ± 0.047
0.197HisTrp: 0.197 ± 0.015
0.778HisTyr: 0.778 ± 0.031
0.0HisXaa: 0.0 ± 0.0
Ile
6.879IleAla: 6.879 ± 0.09
0.683IleCys: 0.683 ± 0.029
4.175IleAsp: 4.175 ± 0.065
4.596IleGlu: 4.596 ± 0.081
3.366IlePhe: 3.366 ± 0.075
5.709IleGly: 5.709 ± 0.087
1.452IleHis: 1.452 ± 0.045
5.344IleIle: 5.344 ± 0.107
3.823IleLys: 3.823 ± 0.07
6.821IleLeu: 6.821 ± 0.096
1.738IleMet: 1.738 ± 0.045
2.905IleAsn: 2.905 ± 0.06
3.352IlePro: 3.352 ± 0.063
2.651IleGln: 2.651 ± 0.061
3.016IleArg: 3.016 ± 0.065
4.699IleSer: 4.699 ± 0.079
4.902IleThr: 4.902 ± 0.088
5.467IleVal: 5.467 ± 0.086
0.721IleTrp: 0.721 ± 0.032
2.391IleTyr: 2.391 ± 0.054
0.0IleXaa: 0.0 ± 0.0
Lys
5.415LysAla: 5.415 ± 0.095
0.236LysCys: 0.236 ± 0.015
3.595LysAsp: 3.595 ± 0.065
5.256LysGlu: 5.256 ± 0.078
1.85LysPhe: 1.85 ± 0.047
3.636LysGly: 3.636 ± 0.065
1.136LysHis: 1.136 ± 0.037
4.508LysIle: 4.508 ± 0.075
5.192LysLys: 5.192 ± 0.089
5.488LysLeu: 5.488 ± 0.08
2.286LysMet: 2.286 ± 0.048
3.299LysAsn: 3.299 ± 0.055
2.159LysPro: 2.159 ± 0.052
3.267LysGln: 3.267 ± 0.064
2.939LysArg: 2.939 ± 0.047
3.13LysSer: 3.13 ± 0.056
4.125LysThr: 4.125 ± 0.074
4.379LysVal: 4.379 ± 0.07
0.725LysTrp: 0.725 ± 0.028
2.264LysTyr: 2.264 ± 0.047
0.0LysXaa: 0.0 ± 0.0
Leu
8.374LeuAla: 8.374 ± 0.114
0.665LeuCys: 0.665 ± 0.028
5.214LeuAsp: 5.214 ± 0.078
6.298LeuGlu: 6.298 ± 0.114
4.6LeuPhe: 4.6 ± 0.081
6.226LeuGly: 6.226 ± 0.091
1.81LeuHis: 1.81 ± 0.041
6.485LeuIle: 6.485 ± 0.099
5.531LeuLys: 5.531 ± 0.081
9.355LeuLeu: 9.355 ± 0.14
2.388LeuMet: 2.388 ± 0.05
3.812LeuAsn: 3.812 ± 0.075
3.85LeuPro: 3.85 ± 0.068
3.779LeuGln: 3.779 ± 0.074
3.659LeuArg: 3.659 ± 0.062
5.692LeuSer: 5.692 ± 0.079
6.063LeuThr: 6.063 ± 0.101
6.73LeuVal: 6.73 ± 0.092
0.764LeuTrp: 0.764 ± 0.031
2.918LeuTyr: 2.918 ± 0.063
0.0LeuXaa: 0.0 ± 0.0
Met
2.254MetAla: 2.254 ± 0.058
0.152MetCys: 0.152 ± 0.013
1.522MetAsp: 1.522 ± 0.045
1.896MetGlu: 1.896 ± 0.045
1.026MetPhe: 1.026 ± 0.035
1.612MetGly: 1.612 ± 0.042
0.451MetHis: 0.451 ± 0.021
1.945MetIle: 1.945 ± 0.048
2.303MetLys: 2.303 ± 0.05
2.514MetLeu: 2.514 ± 0.055
0.89MetMet: 0.89 ± 0.034
1.309MetAsn: 1.309 ± 0.034
1.11MetPro: 1.11 ± 0.038
1.092MetGln: 1.092 ± 0.034
1.224MetArg: 1.224 ± 0.039
1.619MetSer: 1.619 ± 0.04
1.93MetThr: 1.93 ± 0.046
1.624MetVal: 1.624 ± 0.041
0.18MetTrp: 0.18 ± 0.014
0.824MetTyr: 0.824 ± 0.031
0.0MetXaa: 0.0 ± 0.0
Asn
3.239AsnAla: 3.239 ± 0.07
0.231AsnCys: 0.231 ± 0.016
2.365AsnAsp: 2.365 ± 0.064
2.798AsnGlu: 2.798 ± 0.056
1.85AsnPhe: 1.85 ± 0.046
3.467AsnGly: 3.467 ± 0.078
0.868AsnHis: 0.868 ± 0.03
3.321AsnIle: 3.321 ± 0.059
2.846AsnLys: 2.846 ± 0.053
3.792AsnLeu: 3.792 ± 0.072
1.226AsnMet: 1.226 ± 0.035
2.142AsnAsn: 2.142 ± 0.055
2.153AsnPro: 2.153 ± 0.051
1.948AsnGln: 1.948 ± 0.051
1.794AsnArg: 1.794 ± 0.045
2.107AsnSer: 2.107 ± 0.051
2.616AsnThr: 2.616 ± 0.07
3.217AsnVal: 3.217 ± 0.057
0.617AsnTrp: 0.617 ± 0.03
1.737AsnTyr: 1.737 ± 0.051
0.0AsnXaa: 0.0 ± 0.0
Pro
2.811ProAla: 2.811 ± 0.058
0.168ProCys: 0.168 ± 0.014
2.206ProAsp: 2.206 ± 0.053
3.03ProGlu: 3.03 ± 0.065
1.764ProPhe: 1.764 ± 0.04
2.12ProGly: 2.12 ± 0.055
0.663ProHis: 0.663 ± 0.028
2.649ProIle: 2.649 ± 0.063
2.119ProLys: 2.119 ± 0.041
3.086ProLeu: 3.086 ± 0.057
0.892ProMet: 0.892 ± 0.031
1.845ProAsn: 1.845 ± 0.045
0.798ProPro: 0.798 ± 0.027
1.085ProGln: 1.085 ± 0.036
1.065ProArg: 1.065 ± 0.038
2.055ProSer: 2.055 ± 0.047
2.379ProThr: 2.379 ± 0.06
2.943ProVal: 2.943 ± 0.064
0.321ProTrp: 0.321 ± 0.018
1.312ProTyr: 1.312 ± 0.034
0.0ProXaa: 0.0 ± 0.0
Gln
3.357GlnAla: 3.357 ± 0.077
0.14GlnCys: 0.14 ± 0.012
1.918GlnAsp: 1.918 ± 0.047
2.811GlnGlu: 2.811 ± 0.061
1.673GlnPhe: 1.673 ± 0.042
2.206GlnGly: 2.206 ± 0.05
0.726GlnHis: 0.726 ± 0.028
2.811GlnIle: 2.811 ± 0.053
2.819GlnLys: 2.819 ± 0.066
3.981GlnLeu: 3.981 ± 0.082
1.122GlnMet: 1.122 ± 0.033
1.772GlnAsn: 1.772 ± 0.047
1.197GlnPro: 1.197 ± 0.037
1.862GlnGln: 1.862 ± 0.048
1.308GlnArg: 1.308 ± 0.039
1.9GlnSer: 1.9 ± 0.044
2.323GlnThr: 2.323 ± 0.059
2.858GlnVal: 2.858 ± 0.055
0.328GlnTrp: 0.328 ± 0.017
1.227GlnTyr: 1.227 ± 0.037
0.0GlnXaa: 0.0 ± 0.0
Arg
2.712ArgAla: 2.712 ± 0.063
0.199ArgCys: 0.199 ± 0.015
2.261ArgAsp: 2.261 ± 0.047
3.112ArgGlu: 3.112 ± 0.067
1.943ArgPhe: 1.943 ± 0.045
2.184ArgGly: 2.184 ± 0.051
0.77ArgHis: 0.77 ± 0.032
2.909ArgIle: 2.909 ± 0.056
2.82ArgLys: 2.82 ± 0.061
3.725ArgLeu: 3.725 ± 0.065
1.177ArgMet: 1.177 ± 0.036
1.77ArgAsn: 1.77 ± 0.044
1.142ArgPro: 1.142 ± 0.039
1.521ArgGln: 1.521 ± 0.044
1.696ArgArg: 1.696 ± 0.051
1.876ArgSer: 1.876 ± 0.046
2.057ArgThr: 2.057 ± 0.046
2.829ArgVal: 2.829 ± 0.061
0.366ArgTrp: 0.366 ± 0.021
1.557ArgTyr: 1.557 ± 0.041
0.0ArgXaa: 0.0 ± 0.0
Ser
4.08SerAla: 4.08 ± 0.07
0.319SerCys: 0.319 ± 0.016
3.197SerAsp: 3.197 ± 0.074
3.488SerGlu: 3.488 ± 0.065
2.877SerPhe: 2.877 ± 0.056
4.479SerGly: 4.479 ± 0.08
1.06SerHis: 1.06 ± 0.036
4.281SerIle: 4.281 ± 0.076
3.533SerLys: 3.533 ± 0.058
5.396SerLeu: 5.396 ± 0.083
1.503SerMet: 1.503 ± 0.043
2.56SerAsn: 2.56 ± 0.065
1.927SerPro: 1.927 ± 0.046
1.889SerGln: 1.889 ± 0.047
2.045SerArg: 2.045 ± 0.044
3.375SerSer: 3.375 ± 0.086
3.241SerThr: 3.241 ± 0.076
4.152SerVal: 4.152 ± 0.073
0.599SerTrp: 0.599 ± 0.027
2.2SerTyr: 2.2 ± 0.051
0.0SerXaa: 0.0 ± 0.0
Thr
5.2ThrAla: 5.2 ± 0.105
0.311ThrCys: 0.311 ± 0.018
3.594ThrAsp: 3.594 ± 0.095
3.743ThrGlu: 3.743 ± 0.063
3.03ThrPhe: 3.03 ± 0.058
4.819ThrGly: 4.819 ± 0.094
1.106ThrHis: 1.106 ± 0.033
5.09ThrIle: 5.09 ± 0.098
4.124ThrLys: 4.124 ± 0.078
5.85ThrLeu: 5.85 ± 0.087
1.541ThrMet: 1.541 ± 0.041
2.864ThrAsn: 2.864 ± 0.074
2.749ThrPro: 2.749 ± 0.066
2.028ThrGln: 2.028 ± 0.051
1.962ThrArg: 1.962 ± 0.045
3.578ThrSer: 3.578 ± 0.075
4.528ThrThr: 4.528 ± 0.113
5.148ThrVal: 5.148 ± 0.12
0.672ThrTrp: 0.672 ± 0.026
2.367ThrTyr: 2.367 ± 0.059
0.0ThrXaa: 0.0 ± 0.0
Val
6.59ValAla: 6.59 ± 0.088
0.535ValCys: 0.535 ± 0.024
3.999ValAsp: 3.999 ± 0.066
4.25ValGlu: 4.25 ± 0.074
3.293ValPhe: 3.293 ± 0.065
4.896ValGly: 4.896 ± 0.075
1.206ValHis: 1.206 ± 0.037
5.655ValIle: 5.655 ± 0.079
4.352ValLys: 4.352 ± 0.066
7.049ValLeu: 7.049 ± 0.095
1.902ValMet: 1.902 ± 0.052
3.072ValAsn: 3.072 ± 0.064
2.759ValPro: 2.759 ± 0.054
2.419ValGln: 2.419 ± 0.05
2.681ValArg: 2.681 ± 0.063
4.581ValSer: 4.581 ± 0.082
5.54ValThr: 5.54 ± 0.137
5.569ValVal: 5.569 ± 0.091
0.674ValTrp: 0.674 ± 0.027
2.449ValTyr: 2.449 ± 0.057
0.0ValXaa: 0.0 ± 0.0
Trp
0.589TrpAla: 0.589 ± 0.029
0.055TrpCys: 0.055 ± 0.007
0.587TrpAsp: 0.587 ± 0.025
0.6TrpGlu: 0.6 ± 0.025
0.48TrpPhe: 0.48 ± 0.023
0.677TrpGly: 0.677 ± 0.028
0.253TrpHis: 0.253 ± 0.016
0.728TrpIle: 0.728 ± 0.031
0.734TrpLys: 0.734 ± 0.027
1.061TrpLeu: 1.061 ± 0.034
0.328TrpMet: 0.328 ± 0.019
0.542TrpAsn: 0.542 ± 0.025
0.238TrpPro: 0.238 ± 0.017
0.524TrpGln: 0.524 ± 0.024
0.471TrpArg: 0.471 ± 0.025
0.584TrpSer: 0.584 ± 0.028
0.54TrpThr: 0.54 ± 0.023
0.613TrpVal: 0.613 ± 0.027
0.144TrpTrp: 0.144 ± 0.013
0.364TrpTyr: 0.364 ± 0.018
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.589TyrAla: 2.589 ± 0.066
0.255TyrCys: 0.255 ± 0.017
2.119TyrAsp: 2.119 ± 0.051
2.185TyrGlu: 2.185 ± 0.054
1.967TyrPhe: 1.967 ± 0.053
2.353TyrGly: 2.353 ± 0.053
0.814TyrHis: 0.814 ± 0.039
2.333TyrIle: 2.333 ± 0.057
1.883TyrLys: 1.883 ± 0.045
3.399TyrLeu: 3.399 ± 0.059
0.873TyrMet: 0.873 ± 0.032
1.562TyrAsn: 1.562 ± 0.046
1.303TyrPro: 1.303 ± 0.04
1.627TyrGln: 1.627 ± 0.046
1.627TyrArg: 1.627 ± 0.037
2.039TyrSer: 2.039 ± 0.053
2.252TyrThr: 2.252 ± 0.052
2.522TyrVal: 2.522 ± 0.05
0.387TyrTrp: 0.387 ± 0.021
1.56TyrTyr: 1.56 ± 0.048
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3104 proteins (990240 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski