Amino acid dipepetide frequency for Oenococcus oeni (strain ATCC BAA-331 / PSU-1)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.586AlaAla: 6.586 ± 0.132
0.249AlaCys: 0.249 ± 0.023
4.425AlaAsp: 4.425 ± 0.118
3.882AlaGlu: 3.882 ± 0.11
3.446AlaPhe: 3.446 ± 0.088
5.574AlaGly: 5.574 ± 0.125
1.232AlaHis: 1.232 ± 0.056
6.195AlaIle: 6.195 ± 0.136
5.414AlaLys: 5.414 ± 0.118
7.433AlaLeu: 7.433 ± 0.151
1.717AlaMet: 1.717 ± 0.058
3.485AlaAsn: 3.485 ± 0.089
1.96AlaPro: 1.96 ± 0.076
2.661AlaGln: 2.661 ± 0.092
2.831AlaArg: 2.831 ± 0.083
4.882AlaSer: 4.882 ± 0.129
3.674AlaThr: 3.674 ± 0.089
5.26AlaVal: 5.26 ± 0.121
0.679AlaTrp: 0.679 ± 0.037
2.41AlaTyr: 2.41 ± 0.084
0.0AlaXaa: 0.0 ± 0.0
Cys
0.241CysAla: 0.241 ± 0.027
0.021CysCys: 0.021 ± 0.007
0.138CysAsp: 0.138 ± 0.016
0.111CysGlu: 0.111 ± 0.016
0.212CysPhe: 0.212 ± 0.024
0.341CysGly: 0.341 ± 0.032
0.111CysHis: 0.111 ± 0.016
0.222CysIle: 0.222 ± 0.021
0.125CysLys: 0.125 ± 0.019
0.424CysLeu: 0.424 ± 0.032
0.062CysMet: 0.062 ± 0.013
0.132CysAsn: 0.132 ± 0.017
0.167CysPro: 0.167 ± 0.019
0.154CysGln: 0.154 ± 0.021
0.132CysArg: 0.132 ± 0.017
0.251CysSer: 0.251 ± 0.025
0.13CysThr: 0.13 ± 0.017
0.171CysVal: 0.171 ± 0.019
0.051CysTrp: 0.051 ± 0.011
0.13CysTyr: 0.13 ± 0.015
0.0CysXaa: 0.0 ± 0.0
Asp
3.5AspAla: 3.5 ± 0.088
0.169AspCys: 0.169 ± 0.021
3.269AspAsp: 3.269 ± 0.101
3.42AspGlu: 3.42 ± 0.09
3.376AspPhe: 3.376 ± 0.092
3.448AspGly: 3.448 ± 0.104
1.236AspHis: 1.236 ± 0.053
3.781AspIle: 3.781 ± 0.096
3.971AspLys: 3.971 ± 0.11
5.885AspLeu: 5.885 ± 0.135
1.209AspMet: 1.209 ± 0.055
2.696AspAsn: 2.696 ± 0.086
2.299AspPro: 2.299 ± 0.075
3.162AspGln: 3.162 ± 0.093
2.513AspArg: 2.513 ± 0.078
3.656AspSer: 3.656 ± 0.095
2.433AspThr: 2.433 ± 0.075
3.136AspVal: 3.136 ± 0.087
0.769AspTrp: 0.769 ± 0.043
2.502AspTyr: 2.502 ± 0.068
0.0AspXaa: 0.0 ± 0.0
Glu
3.547GluAla: 3.547 ± 0.109
0.134GluCys: 0.134 ± 0.015
2.739GluAsp: 2.739 ± 0.087
3.109GluGlu: 3.109 ± 0.08
2.295GluPhe: 2.295 ± 0.073
2.412GluGly: 2.412 ± 0.069
1.075GluHis: 1.075 ± 0.049
4.953GluIle: 4.953 ± 0.116
5.663GluLys: 5.663 ± 0.123
5.33GluLeu: 5.33 ± 0.129
1.452GluMet: 1.452 ± 0.055
3.874GluAsn: 3.874 ± 0.108
1.388GluPro: 1.388 ± 0.061
2.229GluGln: 2.229 ± 0.065
2.085GluArg: 2.085 ± 0.072
3.029GluSer: 3.029 ± 0.079
2.842GluThr: 2.842 ± 0.073
3.064GluVal: 3.064 ± 0.093
0.391GluTrp: 0.391 ± 0.025
1.75GluTyr: 1.75 ± 0.058
0.0GluXaa: 0.0 ± 0.0
Phe
3.691PheAla: 3.691 ± 0.093
0.253PheCys: 0.253 ± 0.022
3.109PheAsp: 3.109 ± 0.09
2.272PheGlu: 2.272 ± 0.08
2.945PhePhe: 2.945 ± 0.097
3.804PheGly: 3.804 ± 0.108
0.878PheHis: 0.878 ± 0.043
3.701PheIle: 3.701 ± 0.114
3.218PheLys: 3.218 ± 0.088
5.268PheLeu: 5.268 ± 0.156
1.018PheMet: 1.018 ± 0.046
2.521PheAsn: 2.521 ± 0.089
1.772PhePro: 1.772 ± 0.061
1.929PheGln: 1.929 ± 0.061
1.555PheArg: 1.555 ± 0.058
4.335PheSer: 4.335 ± 0.103
2.428PheThr: 2.428 ± 0.074
3.463PheVal: 3.463 ± 0.096
0.615PheTrp: 0.615 ± 0.039
1.886PheTyr: 1.886 ± 0.074
0.0PheXaa: 0.0 ± 0.0
Gly
4.376GlyAla: 4.376 ± 0.13
0.226GlyCys: 0.226 ± 0.026
3.175GlyAsp: 3.175 ± 0.108
3.156GlyGlu: 3.156 ± 0.086
3.313GlyPhe: 3.313 ± 0.096
3.952GlyGly: 3.952 ± 0.096
1.355GlyHis: 1.355 ± 0.054
5.934GlyIle: 5.934 ± 0.136
5.028GlyLys: 5.028 ± 0.112
6.572GlyLeu: 6.572 ± 0.12
1.631GlyMet: 1.631 ± 0.061
3.039GlyAsn: 3.039 ± 0.089
1.61GlyPro: 1.61 ± 0.061
2.708GlyGln: 2.708 ± 0.089
2.774GlyArg: 2.774 ± 0.08
4.54GlySer: 4.54 ± 0.101
3.526GlyThr: 3.526 ± 0.096
4.343GlyVal: 4.343 ± 0.104
0.726GlyTrp: 0.726 ± 0.042
2.408GlyTyr: 2.408 ± 0.064
0.0GlyXaa: 0.0 ± 0.0
His
1.205HisAla: 1.205 ± 0.054
0.049HisCys: 0.049 ± 0.011
1.141HisAsp: 1.141 ± 0.056
1.071HisGlu: 1.071 ± 0.05
1.112HisPhe: 1.112 ± 0.048
1.357HisGly: 1.357 ± 0.048
0.489HisHis: 0.489 ± 0.033
1.219HisIle: 1.219 ± 0.048
1.034HisLys: 1.034 ± 0.045
1.945HisLeu: 1.945 ± 0.062
0.422HisMet: 0.422 ± 0.031
0.802HisAsn: 0.802 ± 0.041
0.931HisPro: 0.931 ± 0.047
0.796HisGln: 0.796 ± 0.039
0.866HisArg: 0.866 ± 0.045
1.151HisSer: 1.151 ± 0.043
0.802HisThr: 0.802 ± 0.045
1.123HisVal: 1.123 ± 0.048
0.255HisTrp: 0.255 ± 0.023
0.822HisTyr: 0.822 ± 0.049
0.0HisXaa: 0.0 ± 0.0
Ile
6.459IleAla: 6.459 ± 0.143
0.385IleCys: 0.385 ± 0.032
5.194IleAsp: 5.194 ± 0.119
4.487IleGlu: 4.487 ± 0.108
4.577IlePhe: 4.577 ± 0.144
5.967IleGly: 5.967 ± 0.138
1.402IleHis: 1.402 ± 0.048
6.341IleIle: 6.341 ± 0.153
5.681IleLys: 5.681 ± 0.118
7.657IleLeu: 7.657 ± 0.153
1.639IleMet: 1.639 ± 0.064
4.789IleAsn: 4.789 ± 0.115
2.953IlePro: 2.953 ± 0.074
2.494IleGln: 2.494 ± 0.072
2.718IleArg: 2.718 ± 0.084
6.541IleSer: 6.541 ± 0.149
4.067IleThr: 4.067 ± 0.102
5.823IleVal: 5.823 ± 0.113
0.788IleTrp: 0.788 ± 0.046
2.605IleTyr: 2.605 ± 0.074
0.0IleXaa: 0.0 ± 0.0
Lys
5.067LysAla: 5.067 ± 0.108
0.162LysCys: 0.162 ± 0.022
3.911LysAsp: 3.911 ± 0.091
4.824LysGlu: 4.824 ± 0.115
2.714LysPhe: 2.714 ± 0.072
3.629LysGly: 3.629 ± 0.087
1.351LysHis: 1.351 ± 0.05
7.15LysIle: 7.15 ± 0.129
7.602LysLys: 7.602 ± 0.145
6.343LysLeu: 6.343 ± 0.135
2.192LysMet: 2.192 ± 0.06
5.346LysAsn: 5.346 ± 0.126
1.949LysPro: 1.949 ± 0.068
3.348LysGln: 3.348 ± 0.088
3.14LysArg: 3.14 ± 0.082
4.557LysSer: 4.557 ± 0.106
4.423LysThr: 4.423 ± 0.111
4.431LysVal: 4.431 ± 0.105
0.596LysTrp: 0.596 ± 0.036
2.683LysTyr: 2.683 ± 0.081
0.0LysXaa: 0.0 ± 0.0
Leu
8.231LeuAla: 8.231 ± 0.152
0.29LeuCys: 0.29 ± 0.024
5.513LeuAsp: 5.513 ± 0.126
4.674LeuGlu: 4.674 ± 0.099
4.875LeuPhe: 4.875 ± 0.147
5.973LeuGly: 5.973 ± 0.109
1.61LeuHis: 1.61 ± 0.055
8.98LeuIle: 8.98 ± 0.193
7.602LeuLys: 7.602 ± 0.135
10.113LeuLeu: 10.113 ± 0.229
2.356LeuMet: 2.356 ± 0.073
5.194LeuAsn: 5.194 ± 0.111
4.026LeuPro: 4.026 ± 0.105
3.173LeuGln: 3.173 ± 0.083
3.584LeuArg: 3.584 ± 0.098
7.723LeuSer: 7.723 ± 0.151
6.072LeuThr: 6.072 ± 0.116
6.066LeuVal: 6.066 ± 0.133
0.798LeuTrp: 0.798 ± 0.048
2.63LeuTyr: 2.63 ± 0.083
0.0LeuXaa: 0.0 ± 0.0
Met
2.155MetAla: 2.155 ± 0.074
0.064MetCys: 0.064 ± 0.01
1.119MetAsp: 1.119 ± 0.049
1.067MetGlu: 1.067 ± 0.048
0.956MetPhe: 0.956 ± 0.051
1.328MetGly: 1.328 ± 0.054
0.372MetHis: 0.372 ± 0.03
2.064MetIle: 2.064 ± 0.068
1.785MetLys: 1.785 ± 0.059
2.04MetLeu: 2.04 ± 0.068
0.607MetMet: 0.607 ± 0.038
1.376MetAsn: 1.376 ± 0.054
1.028MetPro: 1.028 ± 0.052
0.855MetGln: 0.855 ± 0.047
0.87MetArg: 0.87 ± 0.04
1.661MetSer: 1.661 ± 0.059
1.746MetThr: 1.746 ± 0.056
1.497MetVal: 1.497 ± 0.059
0.142MetTrp: 0.142 ± 0.016
0.584MetTyr: 0.584 ± 0.033
0.0MetXaa: 0.0 ± 0.0
Asn
3.158AsnAla: 3.158 ± 0.094
0.226AsnCys: 0.226 ± 0.022
3.051AsnAsp: 3.051 ± 0.073
2.669AsnGlu: 2.669 ± 0.085
3.049AsnPhe: 3.049 ± 0.092
3.613AsnGly: 3.613 ± 0.1
1.193AsnHis: 1.193 ± 0.051
3.646AsnIle: 3.646 ± 0.081
3.786AsnLys: 3.786 ± 0.1
5.35AsnLeu: 5.35 ± 0.118
1.232AsnMet: 1.232 ± 0.049
3.004AsnAsn: 3.004 ± 0.096
2.301AsnPro: 2.301 ± 0.072
2.899AsnGln: 2.899 ± 0.099
2.336AsnArg: 2.336 ± 0.072
3.823AsnSer: 3.823 ± 0.108
2.272AsnThr: 2.272 ± 0.068
2.994AsnVal: 2.994 ± 0.076
0.794AsnTrp: 0.794 ± 0.047
2.213AsnTyr: 2.213 ± 0.074
0.0AsnXaa: 0.0 ± 0.0
Pro
2.317ProAla: 2.317 ± 0.077
0.086ProCys: 0.086 ± 0.014
2.169ProAsp: 2.169 ± 0.079
2.363ProGlu: 2.363 ± 0.074
1.713ProPhe: 1.713 ± 0.063
2.126ProGly: 2.126 ± 0.075
0.596ProHis: 0.596 ± 0.036
2.924ProIle: 2.924 ± 0.076
2.675ProLys: 2.675 ± 0.077
3.148ProLeu: 3.148 ± 0.091
0.74ProMet: 0.74 ± 0.036
1.851ProAsn: 1.851 ± 0.062
0.465ProPro: 0.465 ± 0.031
1.086ProGln: 1.086 ± 0.049
1.092ProArg: 1.092 ± 0.041
2.079ProSer: 2.079 ± 0.06
1.951ProThr: 1.951 ± 0.063
2.398ProVal: 2.398 ± 0.077
0.337ProTrp: 0.337 ± 0.029
1.164ProTyr: 1.164 ± 0.054
0.0ProXaa: 0.0 ± 0.0
Gln
3.559GlnAla: 3.559 ± 0.113
0.086GlnCys: 0.086 ± 0.014
1.67GlnAsp: 1.67 ± 0.063
2.219GlnGlu: 2.219 ± 0.064
1.68GlnPhe: 1.68 ± 0.057
2.025GlnGly: 2.025 ± 0.074
0.646GlnHis: 0.646 ± 0.036
3.65GlnIle: 3.65 ± 0.096
3.072GlnLys: 3.072 ± 0.088
4.481GlnLeu: 4.481 ± 0.11
1.073GlnMet: 1.073 ± 0.045
1.962GlnAsn: 1.962 ± 0.077
1.145GlnPro: 1.145 ± 0.045
1.902GlnGln: 1.902 ± 0.096
1.832GlnArg: 1.832 ± 0.068
2.552GlnSer: 2.552 ± 0.085
2.484GlnThr: 2.484 ± 0.073
2.539GlnVal: 2.539 ± 0.078
0.362GlnTrp: 0.362 ± 0.029
1.195GlnTyr: 1.195 ± 0.049
0.0GlnXaa: 0.0 ± 0.0
Arg
2.646ArgAla: 2.646 ± 0.088
0.121ArgCys: 0.121 ± 0.017
2.085ArgAsp: 2.085 ± 0.063
2.198ArgGlu: 2.198 ± 0.073
2.06ArgPhe: 2.06 ± 0.065
2.258ArgGly: 2.258 ± 0.083
0.812ArgHis: 0.812 ± 0.038
3.286ArgIle: 3.286 ± 0.074
2.77ArgLys: 2.77 ± 0.07
3.995ArgLeu: 3.995 ± 0.096
0.964ArgMet: 0.964 ± 0.05
1.933ArgAsn: 1.933 ± 0.077
1.446ArgPro: 1.446 ± 0.05
1.968ArgGln: 1.968 ± 0.065
1.916ArgArg: 1.916 ± 0.066
2.377ArgSer: 2.377 ± 0.07
1.929ArgThr: 1.929 ± 0.065
2.482ArgVal: 2.482 ± 0.079
0.323ArgTrp: 0.323 ± 0.026
1.423ArgTyr: 1.423 ± 0.061
0.0ArgXaa: 0.0 ± 0.0
Ser
5.056SerAla: 5.056 ± 0.134
0.204SerCys: 0.204 ± 0.024
4.195SerAsp: 4.195 ± 0.098
3.703SerGlu: 3.703 ± 0.086
3.851SerPhe: 3.851 ± 0.109
5.313SerGly: 5.313 ± 0.112
1.151SerHis: 1.151 ± 0.044
5.319SerIle: 5.319 ± 0.113
5.289SerLys: 5.289 ± 0.112
7.096SerLeu: 7.096 ± 0.144
1.577SerMet: 1.577 ± 0.059
3.609SerAsn: 3.609 ± 0.087
1.83SerPro: 1.83 ± 0.063
2.776SerGln: 2.776 ± 0.083
2.679SerArg: 2.679 ± 0.076
6.027SerSer: 6.027 ± 0.25
3.699SerThr: 3.699 ± 0.111
4.306SerVal: 4.306 ± 0.103
0.825SerTrp: 0.825 ± 0.045
2.393SerTyr: 2.393 ± 0.068
0.0SerXaa: 0.0 ± 0.0
Thr
4.316ThrAla: 4.316 ± 0.098
0.121ThrCys: 0.121 ± 0.018
3.074ThrAsp: 3.074 ± 0.09
2.733ThrGlu: 2.733 ± 0.075
2.492ThrPhe: 2.492 ± 0.074
3.95ThrGly: 3.95 ± 0.1
0.979ThrHis: 0.979 ± 0.049
4.795ThrIle: 4.795 ± 0.102
3.792ThrLys: 3.792 ± 0.097
4.902ThrLeu: 4.902 ± 0.12
1.026ThrMet: 1.026 ± 0.048
2.836ThrAsn: 2.836 ± 0.083
1.997ThrPro: 1.997 ± 0.068
1.585ThrGln: 1.585 ± 0.058
1.873ThrArg: 1.873 ± 0.056
3.687ThrSer: 3.687 ± 0.109
3.088ThrThr: 3.088 ± 0.1
3.888ThrVal: 3.888 ± 0.089
0.508ThrTrp: 0.508 ± 0.033
1.733ThrTyr: 1.733 ± 0.065
0.0ThrXaa: 0.0 ± 0.0
Val
5.079ValAla: 5.079 ± 0.105
0.286ValCys: 0.286 ± 0.025
4.022ValAsp: 4.022 ± 0.104
3.337ValGlu: 3.337 ± 0.094
3.333ValPhe: 3.333 ± 0.097
4.363ValGly: 4.363 ± 0.103
1.034ValHis: 1.034 ± 0.05
5.315ValIle: 5.315 ± 0.118
4.318ValLys: 4.318 ± 0.098
6.191ValLeu: 6.191 ± 0.123
1.456ValMet: 1.456 ± 0.056
3.099ValAsn: 3.099 ± 0.09
2.389ValPro: 2.389 ± 0.078
1.933ValGln: 1.933 ± 0.065
2.101ValArg: 2.101 ± 0.082
4.982ValSer: 4.982 ± 0.096
3.613ValThr: 3.613 ± 0.095
4.421ValVal: 4.421 ± 0.109
0.547ValTrp: 0.547 ± 0.038
2.064ValTyr: 2.064 ± 0.054
0.0ValXaa: 0.0 ± 0.0
Trp
0.617TrpAla: 0.617 ± 0.043
0.035TrpCys: 0.035 ± 0.009
0.475TrpAsp: 0.475 ± 0.036
0.376TrpGlu: 0.376 ± 0.031
0.545TrpPhe: 0.545 ± 0.035
0.629TrpGly: 0.629 ± 0.035
0.257TrpHis: 0.257 ± 0.026
0.931TrpIle: 0.931 ± 0.049
0.58TrpLys: 0.58 ± 0.035
1.256TrpLeu: 1.256 ± 0.049
0.273TrpMet: 0.273 ± 0.026
0.539TrpAsn: 0.539 ± 0.032
0.321TrpPro: 0.321 ± 0.026
0.623TrpGln: 0.623 ± 0.036
0.446TrpArg: 0.446 ± 0.032
0.674TrpSer: 0.674 ± 0.037
0.459TrpThr: 0.459 ± 0.032
0.541TrpVal: 0.541 ± 0.036
0.154TrpTrp: 0.154 ± 0.021
0.335TrpTyr: 0.335 ± 0.029
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.19TyrAla: 2.19 ± 0.071
0.156TyrCys: 0.156 ± 0.02
1.955TyrAsp: 1.955 ± 0.072
1.688TyrGlu: 1.688 ± 0.058
2.116TyrPhe: 2.116 ± 0.082
2.326TyrGly: 2.326 ± 0.071
0.814TyrHis: 0.814 ± 0.042
2.159TyrIle: 2.159 ± 0.065
2.015TyrLys: 2.015 ± 0.074
4.053TyrLeu: 4.053 ± 0.095
0.668TyrMet: 0.668 ± 0.033
1.524TyrAsn: 1.524 ± 0.056
1.304TyrPro: 1.304 ± 0.052
1.853TyrGln: 1.853 ± 0.068
1.649TyrArg: 1.649 ± 0.052
2.373TyrSer: 2.373 ± 0.073
1.738TyrThr: 1.738 ± 0.067
1.941TyrVal: 1.941 ± 0.062
0.409TyrTrp: 0.409 ± 0.029
1.289TyrTyr: 1.289 ± 0.07
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1682 proteins (486326 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski