Amino acid dipepetide frequency for Lactobacillus aviarius subsp. araffinosus DSM 20653

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.955AlaAla: 5.955 ± 0.151
0.479AlaCys: 0.479 ± 0.033
4.289AlaAsp: 4.289 ± 0.125
4.173AlaGlu: 4.173 ± 0.128
3.264AlaPhe: 3.264 ± 0.119
4.962AlaGly: 4.962 ± 0.113
1.493AlaHis: 1.493 ± 0.053
5.774AlaIle: 5.774 ± 0.122
5.236AlaLys: 5.236 ± 0.128
6.925AlaLeu: 6.925 ± 0.135
2.005AlaMet: 2.005 ± 0.077
3.331AlaAsn: 3.331 ± 0.101
2.068AlaPro: 2.068 ± 0.066
3.552AlaGln: 3.552 ± 0.115
2.568AlaArg: 2.568 ± 0.08
3.722AlaSer: 3.722 ± 0.113
4.029AlaThr: 4.029 ± 0.098
5.057AlaVal: 5.057 ± 0.112
0.642AlaTrp: 0.642 ± 0.042
2.263AlaTyr: 2.263 ± 0.082
0.0AlaXaa: 0.0 ± 0.0
Cys
0.486CysAla: 0.486 ± 0.037
0.067CysCys: 0.067 ± 0.01
0.412CysAsp: 0.412 ± 0.03
0.319CysGlu: 0.319 ± 0.024
0.326CysPhe: 0.326 ± 0.03
0.616CysGly: 0.616 ± 0.044
0.163CysHis: 0.163 ± 0.021
0.475CysIle: 0.475 ± 0.035
0.305CysLys: 0.305 ± 0.027
0.709CysLeu: 0.709 ± 0.044
0.123CysMet: 0.123 ± 0.016
0.251CysAsn: 0.251 ± 0.026
0.268CysPro: 0.268 ± 0.029
0.219CysGln: 0.219 ± 0.024
0.249CysArg: 0.249 ± 0.025
0.444CysSer: 0.444 ± 0.031
0.258CysThr: 0.258 ± 0.026
0.465CysVal: 0.465 ± 0.04
0.121CysTrp: 0.121 ± 0.017
0.309CysTyr: 0.309 ± 0.028
0.0CysXaa: 0.0 ± 0.0
Asp
3.887AspAla: 3.887 ± 0.105
0.349AspCys: 0.349 ± 0.032
3.696AspAsp: 3.696 ± 0.115
4.529AspGlu: 4.529 ± 0.127
2.838AspPhe: 2.838 ± 0.089
3.724AspGly: 3.724 ± 0.089
1.484AspHis: 1.484 ± 0.064
3.792AspIle: 3.792 ± 0.1
3.573AspLys: 3.573 ± 0.107
5.45AspLeu: 5.45 ± 0.121
1.454AspMet: 1.454 ± 0.054
2.743AspAsn: 2.743 ± 0.087
2.256AspPro: 2.256 ± 0.083
3.324AspGln: 3.324 ± 0.126
2.154AspArg: 2.154 ± 0.082
2.875AspSer: 2.875 ± 0.086
2.475AspThr: 2.475 ± 0.088
3.878AspVal: 3.878 ± 0.105
0.621AspTrp: 0.621 ± 0.043
2.524AspTyr: 2.524 ± 0.083
0.0AspXaa: 0.0 ± 0.0
Glu
3.903GluAla: 3.903 ± 0.117
0.416GluCys: 0.416 ± 0.03
3.238GluAsp: 3.238 ± 0.087
3.938GluGlu: 3.938 ± 0.134
2.326GluPhe: 2.326 ± 0.078
2.78GluGly: 2.78 ± 0.096
1.177GluHis: 1.177 ± 0.057
4.85GluIle: 4.85 ± 0.137
5.594GluLys: 5.594 ± 0.14
5.825GluLeu: 5.825 ± 0.159
2.056GluMet: 2.056 ± 0.077
3.771GluAsn: 3.771 ± 0.108
1.621GluPro: 1.621 ± 0.066
2.673GluGln: 2.673 ± 0.085
2.661GluArg: 2.661 ± 0.095
2.659GluSer: 2.659 ± 0.082
2.908GluThr: 2.908 ± 0.089
3.896GluVal: 3.896 ± 0.105
0.572GluTrp: 0.572 ± 0.036
2.091GluTyr: 2.091 ± 0.07
0.0GluXaa: 0.0 ± 0.0
Phe
3.001PheAla: 3.001 ± 0.104
0.391PheCys: 0.391 ± 0.033
2.945PheAsp: 2.945 ± 0.097
2.668PheGlu: 2.668 ± 0.077
2.145PhePhe: 2.145 ± 0.087
3.324PheGly: 3.324 ± 0.103
0.87PheHis: 0.87 ± 0.044
3.494PheIle: 3.494 ± 0.123
3.092PheLys: 3.092 ± 0.099
4.113PheLeu: 4.113 ± 0.115
1.228PheMet: 1.228 ± 0.053
2.733PheAsn: 2.733 ± 0.088
1.424PhePro: 1.424 ± 0.066
1.447PheGln: 1.447 ± 0.056
1.275PheArg: 1.275 ± 0.062
2.736PheSer: 2.736 ± 0.104
2.384PheThr: 2.384 ± 0.077
3.017PheVal: 3.017 ± 0.094
0.533PheTrp: 0.533 ± 0.042
1.714PheTyr: 1.714 ± 0.063
0.0PheXaa: 0.0 ± 0.0
Gly
4.355GlyAla: 4.355 ± 0.129
0.486GlyCys: 0.486 ± 0.036
3.245GlyAsp: 3.245 ± 0.1
3.299GlyGlu: 3.299 ± 0.087
3.05GlyPhe: 3.05 ± 0.084
4.136GlyGly: 4.136 ± 0.119
1.403GlyHis: 1.403 ± 0.062
6.234GlyIle: 6.234 ± 0.158
5.367GlyLys: 5.367 ± 0.119
5.997GlyLeu: 5.997 ± 0.147
2.173GlyMet: 2.173 ± 0.086
3.147GlyAsn: 3.147 ± 0.106
1.484GlyPro: 1.484 ± 0.059
2.691GlyGln: 2.691 ± 0.098
2.375GlyArg: 2.375 ± 0.088
3.696GlySer: 3.696 ± 0.105
4.059GlyThr: 4.059 ± 0.099
4.457GlyVal: 4.457 ± 0.114
0.721GlyTrp: 0.721 ± 0.053
2.815GlyTyr: 2.815 ± 0.082
0.0GlyXaa: 0.0 ± 0.0
His
1.428HisAla: 1.428 ± 0.052
0.184HisCys: 0.184 ± 0.021
1.238HisAsp: 1.238 ± 0.059
1.298HisGlu: 1.298 ± 0.06
1.21HisPhe: 1.21 ± 0.056
1.698HisGly: 1.698 ± 0.07
0.835HisHis: 0.835 ± 0.048
1.317HisIle: 1.317 ± 0.061
1.126HisLys: 1.126 ± 0.057
2.28HisLeu: 2.28 ± 0.085
0.488HisMet: 0.488 ± 0.038
1.017HisAsn: 1.017 ± 0.051
1.114HisPro: 1.114 ± 0.054
1.433HisGln: 1.433 ± 0.06
1.024HisArg: 1.024 ± 0.049
1.086HisSer: 1.086 ± 0.058
0.914HisThr: 0.914 ± 0.051
1.41HisVal: 1.41 ± 0.066
0.247HisTrp: 0.247 ± 0.025
0.905HisTyr: 0.905 ± 0.045
0.0HisXaa: 0.0 ± 0.0
Ile
6.139IleAla: 6.139 ± 0.133
0.682IleCys: 0.682 ± 0.042
4.745IleAsp: 4.745 ± 0.121
4.334IleGlu: 4.334 ± 0.122
3.485IlePhe: 3.485 ± 0.126
5.627IleGly: 5.627 ± 0.141
1.552IleHis: 1.552 ± 0.064
6.05IleIle: 6.05 ± 0.159
5.362IleLys: 5.362 ± 0.111
7.341IleLeu: 7.341 ± 0.195
1.882IleMet: 1.882 ± 0.069
4.155IleAsn: 4.155 ± 0.118
3.008IlePro: 3.008 ± 0.086
3.173IleGln: 3.173 ± 0.098
2.687IleArg: 2.687 ± 0.095
4.648IleSer: 4.648 ± 0.119
4.473IleThr: 4.473 ± 0.108
5.425IleVal: 5.425 ± 0.106
0.658IleTrp: 0.658 ± 0.044
2.459IleTyr: 2.459 ± 0.087
0.0IleXaa: 0.0 ± 0.0
Lys
4.866LysAla: 4.866 ± 0.106
0.295LysCys: 0.295 ± 0.027
4.229LysAsp: 4.229 ± 0.12
4.85LysGlu: 4.85 ± 0.129
2.64LysPhe: 2.64 ± 0.084
3.678LysGly: 3.678 ± 0.109
1.447LysHis: 1.447 ± 0.064
5.836LysIle: 5.836 ± 0.138
6.511LysLys: 6.511 ± 0.135
6.457LysLeu: 6.457 ± 0.147
2.784LysMet: 2.784 ± 0.081
4.21LysAsn: 4.21 ± 0.115
2.27LysPro: 2.27 ± 0.076
3.529LysGln: 3.529 ± 0.104
3.317LysArg: 3.317 ± 0.108
3.457LysSer: 3.457 ± 0.095
3.945LysThr: 3.945 ± 0.096
5.104LysVal: 5.104 ± 0.112
0.714LysTrp: 0.714 ± 0.049
2.794LysTyr: 2.794 ± 0.083
0.002LysXaa: 0.002 ± 0.002
Leu
7.402LeuAla: 7.402 ± 0.159
0.593LeuCys: 0.593 ± 0.04
5.397LeuAsp: 5.397 ± 0.132
4.918LeuGlu: 4.918 ± 0.134
4.159LeuPhe: 4.159 ± 0.112
6.374LeuGly: 6.374 ± 0.14
1.954LeuHis: 1.954 ± 0.067
7.7LeuIle: 7.7 ± 0.172
7.355LeuLys: 7.355 ± 0.128
8.509LeuLeu: 8.509 ± 0.177
2.705LeuMet: 2.705 ± 0.093
5.315LeuAsn: 5.315 ± 0.133
3.622LeuPro: 3.622 ± 0.092
3.675LeuGln: 3.675 ± 0.117
3.426LeuArg: 3.426 ± 0.099
5.792LeuSer: 5.792 ± 0.138
5.913LeuThr: 5.913 ± 0.116
6.032LeuVal: 6.032 ± 0.137
0.835LeuTrp: 0.835 ± 0.047
2.782LeuTyr: 2.782 ± 0.088
0.0LeuXaa: 0.0 ± 0.0
Met
2.249MetAla: 2.249 ± 0.079
0.181MetCys: 0.181 ± 0.022
1.614MetAsp: 1.614 ± 0.065
1.482MetGlu: 1.482 ± 0.068
1.049MetPhe: 1.049 ± 0.052
1.84MetGly: 1.84 ± 0.078
0.579MetHis: 0.579 ± 0.033
2.522MetIle: 2.522 ± 0.079
2.363MetLys: 2.363 ± 0.069
2.703MetLeu: 2.703 ± 0.081
0.933MetMet: 0.933 ± 0.05
1.577MetAsn: 1.577 ± 0.059
1.082MetPro: 1.082 ± 0.056
1.226MetGln: 1.226 ± 0.053
1.1MetArg: 1.1 ± 0.05
1.512MetSer: 1.512 ± 0.066
1.726MetThr: 1.726 ± 0.064
1.903MetVal: 1.903 ± 0.079
0.216MetTrp: 0.216 ± 0.025
0.758MetTyr: 0.758 ± 0.045
0.0MetXaa: 0.0 ± 0.0
Asn
3.627AsnAla: 3.627 ± 0.113
0.309AsnCys: 0.309 ± 0.027
3.373AsnAsp: 3.373 ± 0.107
3.424AsnGlu: 3.424 ± 0.092
2.312AsnPhe: 2.312 ± 0.084
4.02AsnGly: 4.02 ± 0.137
1.459AsnHis: 1.459 ± 0.068
3.287AsnIle: 3.287 ± 0.094
3.375AsnLys: 3.375 ± 0.102
4.692AsnLeu: 4.692 ± 0.107
1.289AsnMet: 1.289 ± 0.057
3.057AsnAsn: 3.057 ± 0.129
2.152AsnPro: 2.152 ± 0.07
3.303AsnGln: 3.303 ± 0.126
2.056AsnArg: 2.056 ± 0.067
2.917AsnSer: 2.917 ± 0.098
2.429AsnThr: 2.429 ± 0.075
3.482AsnVal: 3.482 ± 0.087
0.654AsnTrp: 0.654 ± 0.044
2.222AsnTyr: 2.222 ± 0.073
0.0AsnXaa: 0.0 ± 0.0
Pro
2.559ProAla: 2.559 ± 0.079
0.188ProCys: 0.188 ± 0.023
2.189ProAsp: 2.189 ± 0.078
2.694ProGlu: 2.694 ± 0.09
1.631ProPhe: 1.631 ± 0.069
2.068ProGly: 2.068 ± 0.075
0.919ProHis: 0.919 ± 0.053
2.489ProIle: 2.489 ± 0.082
2.047ProLys: 2.047 ± 0.073
3.071ProLeu: 3.071 ± 0.099
0.816ProMet: 0.816 ± 0.046
1.807ProAsn: 1.807 ± 0.065
0.623ProPro: 0.623 ± 0.041
1.942ProGln: 1.942 ± 0.069
1.168ProArg: 1.168 ± 0.049
1.842ProSer: 1.842 ± 0.071
2.052ProThr: 2.052 ± 0.071
2.684ProVal: 2.684 ± 0.085
0.34ProTrp: 0.34 ± 0.028
1.391ProTyr: 1.391 ± 0.053
0.0ProXaa: 0.0 ± 0.0
Gln
3.61GlnAla: 3.61 ± 0.113
0.158GlnCys: 0.158 ± 0.02
2.121GlnAsp: 2.121 ± 0.075
2.447GlnGlu: 2.447 ± 0.087
1.735GlnPhe: 1.735 ± 0.068
2.333GlnGly: 2.333 ± 0.081
1.035GlnHis: 1.035 ± 0.054
3.815GlnIle: 3.815 ± 0.099
3.841GlnLys: 3.841 ± 0.114
4.878GlnLeu: 4.878 ± 0.136
1.449GlnMet: 1.449 ± 0.058
2.764GlnAsn: 2.764 ± 0.094
1.589GlnPro: 1.589 ± 0.065
2.752GlnGln: 2.752 ± 0.117
2.256GlnArg: 2.256 ± 0.093
2.454GlnSer: 2.454 ± 0.095
2.789GlnThr: 2.789 ± 0.082
3.071GlnVal: 3.071 ± 0.096
0.472GlnTrp: 0.472 ± 0.039
1.659GlnTyr: 1.659 ± 0.079
0.0GlnXaa: 0.0 ± 0.0
Arg
2.282ArgAla: 2.282 ± 0.069
0.209ArgCys: 0.209 ± 0.021
2.082ArgAsp: 2.082 ± 0.078
2.594ArgGlu: 2.594 ± 0.09
1.882ArgPhe: 1.882 ± 0.071
2.208ArgGly: 2.208 ± 0.079
0.933ArgHis: 0.933 ± 0.048
2.984ArgIle: 2.984 ± 0.086
3.373ArgLys: 3.373 ± 0.095
3.647ArgLeu: 3.647 ± 0.089
1.179ArgMet: 1.179 ± 0.057
2.038ArgAsn: 2.038 ± 0.075
1.41ArgPro: 1.41 ± 0.061
2.031ArgGln: 2.031 ± 0.071
1.961ArgArg: 1.961 ± 0.073
1.894ArgSer: 1.894 ± 0.071
1.903ArgThr: 1.903 ± 0.068
2.361ArgVal: 2.361 ± 0.09
0.316ArgTrp: 0.316 ± 0.029
1.561ArgTyr: 1.561 ± 0.065
0.0ArgXaa: 0.0 ± 0.0
Ser
3.745SerAla: 3.745 ± 0.105
0.316SerCys: 0.316 ± 0.027
3.026SerAsp: 3.026 ± 0.099
2.903SerGlu: 2.903 ± 0.1
2.761SerPhe: 2.761 ± 0.089
3.803SerGly: 3.803 ± 0.098
1.179SerHis: 1.179 ± 0.055
4.055SerIle: 4.055 ± 0.116
3.98SerLys: 3.98 ± 0.115
5.397SerLeu: 5.397 ± 0.116
1.677SerMet: 1.677 ± 0.069
2.91SerAsn: 2.91 ± 0.098
1.777SerPro: 1.777 ± 0.065
2.617SerGln: 2.617 ± 0.096
2.07SerArg: 2.07 ± 0.073
4.152SerSer: 4.152 ± 0.254
3.073SerThr: 3.073 ± 0.087
3.678SerVal: 3.678 ± 0.083
0.607SerTrp: 0.607 ± 0.041
1.947SerTyr: 1.947 ± 0.064
0.0SerXaa: 0.0 ± 0.0
Thr
4.155ThrAla: 4.155 ± 0.111
0.342ThrCys: 0.342 ± 0.033
3.124ThrAsp: 3.124 ± 0.096
2.922ThrGlu: 2.922 ± 0.092
2.575ThrPhe: 2.575 ± 0.082
4.106ThrGly: 4.106 ± 0.112
1.17ThrHis: 1.17 ± 0.056
4.243ThrIle: 4.243 ± 0.119
3.345ThrLys: 3.345 ± 0.109
5.292ThrLeu: 5.292 ± 0.107
1.345ThrMet: 1.345 ± 0.063
2.777ThrAsn: 2.777 ± 0.088
2.447ThrPro: 2.447 ± 0.072
2.089ThrGln: 2.089 ± 0.076
1.733ThrArg: 1.733 ± 0.061
3.222ThrSer: 3.222 ± 0.1
3.299ThrThr: 3.299 ± 0.103
4.448ThrVal: 4.448 ± 0.108
0.465ThrTrp: 0.465 ± 0.034
1.945ThrTyr: 1.945 ± 0.08
0.0ThrXaa: 0.0 ± 0.0
Val
5.26ValAla: 5.26 ± 0.108
0.577ValCys: 0.577 ± 0.043
3.901ValAsp: 3.901 ± 0.1
3.885ValGlu: 3.885 ± 0.109
2.673ValPhe: 2.673 ± 0.096
4.529ValGly: 4.529 ± 0.106
1.314ValHis: 1.314 ± 0.062
5.713ValIle: 5.713 ± 0.114
4.866ValLys: 4.866 ± 0.112
6.134ValLeu: 6.134 ± 0.137
1.793ValMet: 1.793 ± 0.069
3.575ValAsn: 3.575 ± 0.095
2.624ValPro: 2.624 ± 0.069
2.694ValGln: 2.694 ± 0.076
2.529ValArg: 2.529 ± 0.082
4.085ValSer: 4.085 ± 0.124
4.185ValThr: 4.185 ± 0.108
4.752ValVal: 4.752 ± 0.127
0.633ValTrp: 0.633 ± 0.042
2.363ValTyr: 2.363 ± 0.1
0.0ValXaa: 0.0 ± 0.0
Trp
0.596TrpAla: 0.596 ± 0.043
0.067TrpCys: 0.067 ± 0.012
0.535TrpAsp: 0.535 ± 0.04
0.416TrpGlu: 0.416 ± 0.03
0.458TrpPhe: 0.458 ± 0.041
0.686TrpGly: 0.686 ± 0.042
0.298TrpHis: 0.298 ± 0.028
0.896TrpIle: 0.896 ± 0.059
0.586TrpLys: 0.586 ± 0.038
1.142TrpLeu: 1.142 ± 0.064
0.363TrpMet: 0.363 ± 0.03
0.616TrpAsn: 0.616 ± 0.04
0.209TrpPro: 0.209 ± 0.023
0.53TrpGln: 0.53 ± 0.044
0.423TrpArg: 0.423 ± 0.036
0.477TrpSer: 0.477 ± 0.034
0.451TrpThr: 0.451 ± 0.038
0.602TrpVal: 0.602 ± 0.039
0.123TrpTrp: 0.123 ± 0.02
0.354TrpTyr: 0.354 ± 0.037
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.284TyrAla: 2.284 ± 0.079
0.293TyrCys: 0.293 ± 0.029
2.226TyrAsp: 2.226 ± 0.074
1.866TyrGlu: 1.866 ± 0.065
2.014TyrPhe: 2.014 ± 0.074
2.677TyrGly: 2.677 ± 0.084
0.986TyrHis: 0.986 ± 0.051
2.359TyrIle: 2.359 ± 0.074
1.64TyrLys: 1.64 ± 0.061
3.896TyrLeu: 3.896 ± 0.097
0.868TyrMet: 0.868 ± 0.049
1.645TyrAsn: 1.645 ± 0.063
1.472TyrPro: 1.472 ± 0.059
2.329TyrGln: 2.329 ± 0.076
1.814TyrArg: 1.814 ± 0.067
2.005TyrSer: 2.005 ± 0.065
1.777TyrThr: 1.777 ± 0.079
2.363TyrVal: 2.363 ± 0.073
0.34TyrTrp: 0.34 ± 0.031
1.642TyrTyr: 1.642 ± 0.096
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.002XaaMet: 0.002 ± 0.002
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.007XaaXaa: 0.007 ± 0.007
Statistics based on 1386 proteins (429889 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski