Amino acid dipepetide frequency for Pediococcus pentosaceus (strain ATCC 25745 / CCUG 21536 / LMG 10740 / 183-1w)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.667AlaAla: 5.667 ± 0.151
0.306AlaCys: 0.306 ± 0.023
4.322AlaAsp: 4.322 ± 0.105
4.548AlaGlu: 4.548 ± 0.125
3.079AlaPhe: 3.079 ± 0.087
5.551AlaGly: 5.551 ± 0.102
1.342AlaHis: 1.342 ± 0.051
5.91AlaIle: 5.91 ± 0.123
5.331AlaLys: 5.331 ± 0.114
7.098AlaLeu: 7.098 ± 0.133
2.119AlaMet: 2.119 ± 0.058
3.505AlaAsn: 3.505 ± 0.083
2.07AlaPro: 2.07 ± 0.071
3.025AlaGln: 3.025 ± 0.078
2.743AlaArg: 2.743 ± 0.084
4.4AlaSer: 4.4 ± 0.143
4.585AlaThr: 4.585 ± 0.117
5.359AlaVal: 5.359 ± 0.116
0.658AlaTrp: 0.658 ± 0.036
2.285AlaTyr: 2.285 ± 0.07
0.0AlaXaa: 0.0 ± 0.0
Cys
0.288CysAla: 0.288 ± 0.022
0.045CysCys: 0.045 ± 0.008
0.217CysAsp: 0.217 ± 0.02
0.217CysGlu: 0.217 ± 0.023
0.204CysPhe: 0.204 ± 0.019
0.437CysGly: 0.437 ± 0.033
0.112CysHis: 0.112 ± 0.014
0.297CysIle: 0.297 ± 0.024
0.163CysLys: 0.163 ± 0.017
0.415CysLeu: 0.415 ± 0.03
0.101CysMet: 0.101 ± 0.013
0.144CysAsn: 0.144 ± 0.017
0.185CysPro: 0.185 ± 0.02
0.176CysGln: 0.176 ± 0.02
0.14CysArg: 0.14 ± 0.014
0.252CysSer: 0.252 ± 0.021
0.256CysThr: 0.256 ± 0.022
0.235CysVal: 0.235 ± 0.02
0.058CysTrp: 0.058 ± 0.011
0.149CysTyr: 0.149 ± 0.016
0.0CysXaa: 0.0 ± 0.0
Asp
4.07AspAla: 4.07 ± 0.102
0.191AspCys: 0.191 ± 0.018
3.634AspAsp: 3.634 ± 0.144
4.466AspGlu: 4.466 ± 0.105
2.812AspPhe: 2.812 ± 0.074
3.894AspGly: 3.894 ± 0.143
1.3AspHis: 1.3 ± 0.05
4.118AspIle: 4.118 ± 0.097
3.279AspLys: 3.279 ± 0.099
5.362AspLeu: 5.362 ± 0.109
1.407AspMet: 1.407 ± 0.054
2.73AspAsn: 2.73 ± 0.088
2.192AspPro: 2.192 ± 0.088
3.126AspGln: 3.126 ± 0.095
2.425AspArg: 2.425 ± 0.069
3.47AspSer: 3.47 ± 0.367
3.053AspThr: 3.053 ± 0.143
4.198AspVal: 4.198 ± 0.09
0.701AspTrp: 0.701 ± 0.04
2.451AspTyr: 2.451 ± 0.075
0.0AspXaa: 0.0 ± 0.0
Glu
4.634GluAla: 4.634 ± 0.118
0.232GluCys: 0.232 ± 0.023
3.5GluAsp: 3.5 ± 0.088
4.191GluGlu: 4.191 ± 0.107
2.457GluPhe: 2.457 ± 0.073
3.092GluGly: 3.092 ± 0.084
1.287GluHis: 1.287 ± 0.052
4.858GluIle: 4.858 ± 0.11
4.968GluLys: 4.968 ± 0.113
6.336GluLeu: 6.336 ± 0.132
1.882GluMet: 1.882 ± 0.061
3.464GluAsn: 3.464 ± 0.084
1.624GluPro: 1.624 ± 0.059
3.053GluGln: 3.053 ± 0.091
2.788GluArg: 2.788 ± 0.081
2.926GluSer: 2.926 ± 0.077
3.199GluThr: 3.199 ± 0.081
4.247GluVal: 4.247 ± 0.108
0.59GluTrp: 0.59 ± 0.036
2.033GluTyr: 2.033 ± 0.07
0.0GluXaa: 0.0 ± 0.0
Phe
2.967PheAla: 2.967 ± 0.085
0.232PheCys: 0.232 ± 0.025
2.904PheAsp: 2.904 ± 0.074
2.707PheGlu: 2.707 ± 0.078
2.067PhePhe: 2.067 ± 0.073
3.225PheGly: 3.225 ± 0.098
0.686PheHis: 0.686 ± 0.039
3.32PheIle: 3.32 ± 0.096
2.883PheLys: 2.883 ± 0.081
3.754PheLeu: 3.754 ± 0.101
1.104PheMet: 1.104 ± 0.048
2.556PheAsn: 2.556 ± 0.07
1.495PhePro: 1.495 ± 0.052
1.373PheGln: 1.373 ± 0.056
1.366PheArg: 1.366 ± 0.052
2.95PheSer: 2.95 ± 0.086
2.418PheThr: 2.418 ± 0.068
3.021PheVal: 3.021 ± 0.097
0.536PheTrp: 0.536 ± 0.034
1.635PheTyr: 1.635 ± 0.061
0.0PheXaa: 0.0 ± 0.0
Gly
4.643GlyAla: 4.643 ± 0.102
0.327GlyCys: 0.327 ± 0.027
3.445GlyAsp: 3.445 ± 0.106
3.578GlyGlu: 3.578 ± 0.099
3.062GlyPhe: 3.062 ± 0.09
4.396GlyGly: 4.396 ± 0.123
1.336GlyHis: 1.336 ± 0.051
6.134GlyIle: 6.134 ± 0.135
4.815GlyLys: 4.815 ± 0.103
6.276GlyLeu: 6.276 ± 0.129
2.194GlyMet: 2.194 ± 0.066
3.034GlyAsn: 3.034 ± 0.075
1.527GlyPro: 1.527 ± 0.055
2.543GlyGln: 2.543 ± 0.077
2.618GlyArg: 2.618 ± 0.079
4.339GlySer: 4.339 ± 0.114
4.131GlyThr: 4.131 ± 0.107
5.009GlyVal: 5.009 ± 0.104
0.74GlyTrp: 0.74 ± 0.044
2.642GlyTyr: 2.642 ± 0.081
0.0GlyXaa: 0.0 ± 0.0
His
1.33HisAla: 1.33 ± 0.059
0.116HisCys: 0.116 ± 0.014
1.082HisAsp: 1.082 ± 0.055
1.108HisGlu: 1.108 ± 0.052
1.033HisPhe: 1.033 ± 0.05
1.461HisGly: 1.461 ± 0.061
0.66HisHis: 0.66 ± 0.039
1.282HisIle: 1.282 ± 0.047
0.889HisLys: 0.889 ± 0.043
2.009HisLeu: 2.009 ± 0.066
0.417HisMet: 0.417 ± 0.026
0.833HisAsn: 0.833 ± 0.044
1.035HisPro: 1.035 ± 0.049
1.142HisGln: 1.142 ± 0.051
0.895HisArg: 0.895 ± 0.041
1.102HisSer: 1.102 ± 0.055
0.936HisThr: 0.936 ± 0.046
1.278HisVal: 1.278 ± 0.047
0.23HisTrp: 0.23 ± 0.02
0.895HisTyr: 0.895 ± 0.065
0.0HisXaa: 0.0 ± 0.0
Ile
6.082IleAla: 6.082 ± 0.121
0.435IleCys: 0.435 ± 0.031
4.959IleAsp: 4.959 ± 0.113
4.664IleGlu: 4.664 ± 0.114
3.429IlePhe: 3.429 ± 0.1
5.583IleGly: 5.583 ± 0.13
1.379IleHis: 1.379 ± 0.059
5.716IleIle: 5.716 ± 0.141
5.316IleLys: 5.316 ± 0.113
7.093IleLeu: 7.093 ± 0.149
1.925IleMet: 1.925 ± 0.064
4.195IleAsn: 4.195 ± 0.099
2.876IlePro: 2.876 ± 0.065
2.82IleGln: 2.82 ± 0.077
2.749IleArg: 2.749 ± 0.075
5.189IleSer: 5.189 ± 0.134
4.507IleThr: 4.507 ± 0.086
5.473IleVal: 5.473 ± 0.133
0.618IleTrp: 0.618 ± 0.035
2.38IleTyr: 2.38 ± 0.085
0.0IleXaa: 0.0 ± 0.0
Lys
4.996LysAla: 4.996 ± 0.097
0.155LysCys: 0.155 ± 0.018
3.789LysAsp: 3.789 ± 0.097
4.372LysGlu: 4.372 ± 0.11
2.375LysPhe: 2.375 ± 0.079
3.651LysGly: 3.651 ± 0.088
1.286LysHis: 1.286 ± 0.052
5.465LysIle: 5.465 ± 0.116
5.667LysLys: 5.667 ± 0.121
6.074LysLeu: 6.074 ± 0.116
2.459LysMet: 2.459 ± 0.062
4.346LysAsn: 4.346 ± 0.104
2.093LysPro: 2.093 ± 0.073
3.193LysGln: 3.193 ± 0.095
3.021LysArg: 3.021 ± 0.09
3.821LysSer: 3.821 ± 0.196
4.081LysThr: 4.081 ± 0.097
4.858LysVal: 4.858 ± 0.118
0.628LysTrp: 0.628 ± 0.031
2.683LysTyr: 2.683 ± 0.078
0.0LysXaa: 0.0 ± 0.0
Leu
7.433LeuAla: 7.433 ± 0.145
0.396LeuCys: 0.396 ± 0.03
5.32LeuAsp: 5.32 ± 0.121
5.458LeuGlu: 5.458 ± 0.119
3.894LeuPhe: 3.894 ± 0.099
6.47LeuGly: 6.47 ± 0.135
1.564LeuHis: 1.564 ± 0.056
7.36LeuIle: 7.36 ± 0.156
6.883LeuLys: 6.883 ± 0.116
8.447LeuLeu: 8.447 ± 0.17
2.547LeuMet: 2.547 ± 0.07
5.118LeuAsn: 5.118 ± 0.109
3.604LeuPro: 3.604 ± 0.093
3.391LeuGln: 3.391 ± 0.099
3.412LeuArg: 3.412 ± 0.087
6.267LeuSer: 6.267 ± 0.133
5.775LeuThr: 5.775 ± 0.115
6.605LeuVal: 6.605 ± 0.125
0.734LeuTrp: 0.734 ± 0.04
2.59LeuTyr: 2.59 ± 0.065
0.0LeuXaa: 0.0 ± 0.0
Met
2.304MetAla: 2.304 ± 0.062
0.103MetCys: 0.103 ± 0.014
1.622MetAsp: 1.622 ± 0.059
1.5MetGlu: 1.5 ± 0.052
1.048MetPhe: 1.048 ± 0.047
1.925MetGly: 1.925 ± 0.062
0.534MetHis: 0.534 ± 0.033
2.08MetIle: 2.08 ± 0.073
1.9MetLys: 1.9 ± 0.062
2.468MetLeu: 2.468 ± 0.077
0.848MetMet: 0.848 ± 0.041
1.618MetAsn: 1.618 ± 0.053
1.157MetPro: 1.157 ± 0.044
1.183MetGln: 1.183 ± 0.043
1.007MetArg: 1.007 ± 0.044
1.663MetSer: 1.663 ± 0.058
1.642MetThr: 1.642 ± 0.055
2.024MetVal: 2.024 ± 0.067
0.2MetTrp: 0.2 ± 0.019
0.714MetTyr: 0.714 ± 0.036
0.0MetXaa: 0.0 ± 0.0
Asn
3.668AsnAla: 3.668 ± 0.092
0.224AsnCys: 0.224 ± 0.02
3.15AsnAsp: 3.15 ± 0.081
3.283AsnGlu: 3.283 ± 0.084
2.289AsnPhe: 2.289 ± 0.07
3.847AsnGly: 3.847 ± 0.103
1.241AsnHis: 1.241 ± 0.051
3.651AsnIle: 3.651 ± 0.091
3.18AsnLys: 3.18 ± 0.092
4.686AsnLeu: 4.686 ± 0.1
1.241AsnMet: 1.241 ± 0.053
2.943AsnAsn: 2.943 ± 0.1
2.328AsnPro: 2.328 ± 0.069
2.915AsnGln: 2.915 ± 0.08
2.093AsnArg: 2.093 ± 0.075
3.077AsnSer: 3.077 ± 0.154
2.618AsnThr: 2.618 ± 0.079
3.416AsnVal: 3.416 ± 0.068
0.587AsnTrp: 0.587 ± 0.036
1.96AsnTyr: 1.96 ± 0.066
0.0AsnXaa: 0.0 ± 0.0
Pro
2.504ProAla: 2.504 ± 0.07
0.082ProCys: 0.082 ± 0.015
2.24ProAsp: 2.24 ± 0.095
2.913ProGlu: 2.913 ± 0.084
1.566ProPhe: 1.566 ± 0.058
1.971ProGly: 1.971 ± 0.063
0.661ProHis: 0.661 ± 0.037
2.801ProIle: 2.801 ± 0.078
2.283ProLys: 2.283 ± 0.068
2.948ProLeu: 2.948 ± 0.083
0.811ProMet: 0.811 ± 0.036
1.654ProAsn: 1.654 ± 0.051
0.575ProPro: 0.575 ± 0.034
1.409ProGln: 1.409 ± 0.053
1.114ProArg: 1.114 ± 0.047
2.044ProSer: 2.044 ± 0.073
2.265ProThr: 2.265 ± 0.074
2.678ProVal: 2.678 ± 0.065
0.355ProTrp: 0.355 ± 0.026
1.22ProTyr: 1.22 ± 0.048
0.0ProXaa: 0.0 ± 0.0
Gln
3.464GlnAla: 3.464 ± 0.092
0.09GlnCys: 0.09 ± 0.014
1.986GlnAsp: 1.986 ± 0.071
2.098GlnGlu: 2.098 ± 0.064
1.712GlnPhe: 1.712 ± 0.059
2.246GlnGly: 2.246 ± 0.067
0.929GlnHis: 0.929 ± 0.047
3.445GlnIle: 3.445 ± 0.085
3.346GlnLys: 3.346 ± 0.083
4.634GlnLeu: 4.634 ± 0.132
1.186GlnMet: 1.186 ± 0.048
2.261GlnAsn: 2.261 ± 0.068
1.39GlnPro: 1.39 ± 0.059
2.048GlnGln: 2.048 ± 0.088
1.969GlnArg: 1.969 ± 0.068
2.365GlnSer: 2.365 ± 0.072
2.537GlnThr: 2.537 ± 0.07
3.132GlnVal: 3.132 ± 0.099
0.409GlnTrp: 0.409 ± 0.028
1.504GlnTyr: 1.504 ± 0.059
0.0GlnXaa: 0.0 ± 0.0
Arg
2.577ArgAla: 2.577 ± 0.076
0.163ArgCys: 0.163 ± 0.019
2.065ArgAsp: 2.065 ± 0.066
2.487ArgGlu: 2.487 ± 0.098
1.816ArgPhe: 1.816 ± 0.057
2.281ArgGly: 2.281 ± 0.071
0.858ArgHis: 0.858 ± 0.046
2.928ArgIle: 2.928 ± 0.074
2.846ArgLys: 2.846 ± 0.082
3.757ArgLeu: 3.757 ± 0.098
1.218ArgMet: 1.218 ± 0.049
2.052ArgAsn: 2.052 ± 0.06
1.45ArgPro: 1.45 ± 0.057
1.844ArgGln: 1.844 ± 0.06
2.076ArgArg: 2.076 ± 0.075
2.081ArgSer: 2.081 ± 0.074
2.115ArgThr: 2.115 ± 0.061
2.756ArgVal: 2.756 ± 0.079
0.383ArgTrp: 0.383 ± 0.026
1.525ArgTyr: 1.525 ± 0.058
0.0ArgXaa: 0.0 ± 0.0
Ser
4.29SerAla: 4.29 ± 0.166
0.243SerCys: 0.243 ± 0.024
4.002SerAsp: 4.002 ± 0.37
3.701SerGlu: 3.701 ± 0.087
2.95SerPhe: 2.95 ± 0.086
4.432SerGly: 4.432 ± 0.104
1.155SerHis: 1.155 ± 0.056
4.619SerIle: 4.619 ± 0.137
4.413SerLys: 4.413 ± 0.2
5.164SerLeu: 5.164 ± 0.109
1.584SerMet: 1.584 ± 0.049
3.257SerAsn: 3.257 ± 0.137
1.796SerPro: 1.796 ± 0.069
2.47SerGln: 2.47 ± 0.086
2.324SerArg: 2.324 ± 0.082
4.417SerSer: 4.417 ± 0.213
4.03SerThr: 4.03 ± 0.443
4.197SerVal: 4.197 ± 0.159
0.594SerTrp: 0.594 ± 0.038
2.109SerTyr: 2.109 ± 0.056
0.0SerXaa: 0.0 ± 0.0
Thr
4.318ThrAla: 4.318 ± 0.096
0.191ThrCys: 0.191 ± 0.019
3.552ThrAsp: 3.552 ± 0.119
3.217ThrGlu: 3.217 ± 0.075
2.39ThrPhe: 2.39 ± 0.067
4.271ThrGly: 4.271 ± 0.094
1.13ThrHis: 1.13 ± 0.049
4.615ThrIle: 4.615 ± 0.092
3.95ThrLys: 3.95 ± 0.09
5.383ThrLeu: 5.383 ± 0.119
1.499ThrMet: 1.499 ± 0.06
2.993ThrAsn: 2.993 ± 0.081
2.534ThrPro: 2.534 ± 0.075
2.042ThrGln: 2.042 ± 0.058
1.925ThrArg: 1.925 ± 0.059
4.103ThrSer: 4.103 ± 0.477
3.898ThrThr: 3.898 ± 0.159
4.406ThrVal: 4.406 ± 0.163
0.577ThrTrp: 0.577 ± 0.031
1.902ThrTyr: 1.902 ± 0.072
0.0ThrXaa: 0.0 ± 0.0
Val
5.762ValAla: 5.762 ± 0.104
0.338ValCys: 0.338 ± 0.027
4.413ValAsp: 4.413 ± 0.118
4.552ValGlu: 4.552 ± 0.11
2.777ValPhe: 2.777 ± 0.084
4.987ValGly: 4.987 ± 0.102
1.278ValHis: 1.278 ± 0.053
5.706ValIle: 5.706 ± 0.105
4.58ValLys: 4.58 ± 0.1
6.5ValLeu: 6.5 ± 0.128
1.865ValMet: 1.865 ± 0.059
3.459ValAsn: 3.459 ± 0.076
2.625ValPro: 2.625 ± 0.073
2.577ValGln: 2.577 ± 0.076
2.586ValArg: 2.586 ± 0.083
4.671ValSer: 4.671 ± 0.15
4.492ValThr: 4.492 ± 0.135
5.419ValVal: 5.419 ± 0.116
0.581ValTrp: 0.581 ± 0.032
2.124ValTyr: 2.124 ± 0.066
0.0ValXaa: 0.0 ± 0.0
Trp
0.59TrpAla: 0.59 ± 0.03
0.043TrpCys: 0.043 ± 0.009
0.499TrpAsp: 0.499 ± 0.032
0.42TrpGlu: 0.42 ± 0.027
0.475TrpPhe: 0.475 ± 0.038
0.661TrpGly: 0.661 ± 0.04
0.237TrpHis: 0.237 ± 0.023
0.826TrpIle: 0.826 ± 0.044
0.609TrpLys: 0.609 ± 0.036
1.099TrpLeu: 1.099 ± 0.05
0.303TrpMet: 0.303 ± 0.025
0.602TrpAsn: 0.602 ± 0.038
0.249TrpPro: 0.249 ± 0.023
0.493TrpGln: 0.493 ± 0.03
0.372TrpArg: 0.372 ± 0.028
0.592TrpSer: 0.592 ± 0.041
0.516TrpThr: 0.516 ± 0.039
0.643TrpVal: 0.643 ± 0.036
0.163TrpTrp: 0.163 ± 0.017
0.346TrpTyr: 0.346 ± 0.024
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.317TyrAla: 2.317 ± 0.067
0.179TyrCys: 0.179 ± 0.02
2.179TyrAsp: 2.179 ± 0.069
1.88TyrGlu: 1.88 ± 0.057
1.766TyrPhe: 1.766 ± 0.059
2.407TyrGly: 2.407 ± 0.073
0.759TyrHis: 0.759 ± 0.041
2.253TyrIle: 2.253 ± 0.067
1.712TyrLys: 1.712 ± 0.072
3.763TyrLeu: 3.763 ± 0.092
0.83TyrMet: 0.83 ± 0.041
1.635TyrAsn: 1.635 ± 0.059
1.291TyrPro: 1.291 ± 0.042
1.93TyrGln: 1.93 ± 0.065
1.655TyrArg: 1.655 ± 0.057
2.018TyrSer: 2.018 ± 0.071
1.842TyrThr: 1.842 ± 0.072
2.339TyrVal: 2.339 ± 0.082
0.394TyrTrp: 0.394 ± 0.029
1.5TyrTyr: 1.5 ± 0.063
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1755 proteins (535200 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski