Amino acid dipepetide frequency for Oenococcus alcoholitolerans

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.29AlaAla: 7.29 ± 0.189
0.338AlaCys: 0.338 ± 0.032
5.135AlaAsp: 5.135 ± 0.156
3.79AlaGlu: 3.79 ± 0.126
3.594AlaPhe: 3.594 ± 0.123
6.05AlaGly: 6.05 ± 0.144
1.314AlaHis: 1.314 ± 0.066
5.824AlaIle: 5.824 ± 0.163
5.29AlaLys: 5.29 ± 0.166
6.993AlaLeu: 6.993 ± 0.15
1.615AlaMet: 1.615 ± 0.078
3.152AlaAsn: 3.152 ± 0.11
1.986AlaPro: 1.986 ± 0.084
2.922AlaGln: 2.922 ± 0.101
2.946AlaArg: 2.946 ± 0.103
4.733AlaSer: 4.733 ± 0.142
3.176AlaThr: 3.176 ± 0.096
5.76AlaVal: 5.76 ± 0.172
0.649AlaTrp: 0.649 ± 0.046
2.486AlaTyr: 2.486 ± 0.104
0.003AlaXaa: 0.003 ± 0.003
Cys
0.24CysAla: 0.24 ± 0.03
0.051CysCys: 0.051 ± 0.013
0.189CysAsp: 0.189 ± 0.024
0.152CysGlu: 0.152 ± 0.023
0.294CysPhe: 0.294 ± 0.035
0.348CysGly: 0.348 ± 0.039
0.122CysHis: 0.122 ± 0.02
0.209CysIle: 0.209 ± 0.027
0.122CysLys: 0.122 ± 0.021
0.557CysLeu: 0.557 ± 0.042
0.125CysMet: 0.125 ± 0.02
0.132CysAsn: 0.132 ± 0.021
0.22CysPro: 0.22 ± 0.033
0.216CysGln: 0.216 ± 0.028
0.27CysArg: 0.27 ± 0.031
0.334CysSer: 0.334 ± 0.036
0.162CysThr: 0.162 ± 0.025
0.209CysVal: 0.209 ± 0.027
0.095CysTrp: 0.095 ± 0.018
0.155CysTyr: 0.155 ± 0.027
0.0CysXaa: 0.0 ± 0.0
Asp
3.594AspAla: 3.594 ± 0.129
0.22AspCys: 0.22 ± 0.027
4.098AspAsp: 4.098 ± 0.14
3.179AspGlu: 3.179 ± 0.117
3.77AspPhe: 3.77 ± 0.131
3.415AspGly: 3.415 ± 0.106
1.811AspHis: 1.811 ± 0.086
4.192AspIle: 4.192 ± 0.122
4.415AspLys: 4.415 ± 0.138
6.827AspLeu: 6.827 ± 0.166
1.101AspMet: 1.101 ± 0.061
2.811AspAsn: 2.811 ± 0.103
3.0AspPro: 3.0 ± 0.118
4.044AspGln: 4.044 ± 0.14
3.152AspArg: 3.152 ± 0.095
3.784AspSer: 3.784 ± 0.116
2.385AspThr: 2.385 ± 0.103
3.233AspVal: 3.233 ± 0.116
0.878AspTrp: 0.878 ± 0.052
2.74AspTyr: 2.74 ± 0.09
0.0AspXaa: 0.0 ± 0.0
Glu
3.517GluAla: 3.517 ± 0.116
0.152GluCys: 0.152 ± 0.024
2.922GluAsp: 2.922 ± 0.101
2.905GluGlu: 2.905 ± 0.121
2.169GluPhe: 2.169 ± 0.09
2.334GluGly: 2.334 ± 0.088
0.946GluHis: 0.946 ± 0.06
4.105GluIle: 4.105 ± 0.13
5.118GluLys: 5.118 ± 0.146
5.176GluLeu: 5.176 ± 0.14
1.304GluMet: 1.304 ± 0.073
3.544GluAsn: 3.544 ± 0.112
1.436GluPro: 1.436 ± 0.067
2.317GluGln: 2.317 ± 0.091
2.236GluArg: 2.236 ± 0.088
2.709GluSer: 2.709 ± 0.101
2.294GluThr: 2.294 ± 0.091
2.834GluVal: 2.834 ± 0.108
0.389GluTrp: 0.389 ± 0.035
1.821GluTyr: 1.821 ± 0.071
0.003GluXaa: 0.003 ± 0.003
Phe
3.882PheAla: 3.882 ± 0.113
0.351PheCys: 0.351 ± 0.033
3.557PheAsp: 3.557 ± 0.117
2.169PheGlu: 2.169 ± 0.093
3.142PhePhe: 3.142 ± 0.139
3.659PheGly: 3.659 ± 0.113
0.834PheHis: 0.834 ± 0.055
3.926PheIle: 3.926 ± 0.132
3.203PheLys: 3.203 ± 0.104
5.04PheLeu: 5.04 ± 0.164
1.203PheMet: 1.203 ± 0.065
2.473PheAsn: 2.473 ± 0.092
1.818PhePro: 1.818 ± 0.079
1.79PheGln: 1.79 ± 0.073
1.73PheArg: 1.73 ± 0.073
4.503PheSer: 4.503 ± 0.14
2.27PheThr: 2.27 ± 0.09
3.223PheVal: 3.223 ± 0.134
0.696PheTrp: 0.696 ± 0.049
2.128PheTyr: 2.128 ± 0.101
0.0PheXaa: 0.0 ± 0.0
Gly
4.405GlyAla: 4.405 ± 0.137
0.28GlyCys: 0.28 ± 0.033
3.49GlyAsp: 3.49 ± 0.116
2.611GlyGlu: 2.611 ± 0.095
3.361GlyPhe: 3.361 ± 0.115
4.165GlyGly: 4.165 ± 0.14
1.375GlyHis: 1.375 ± 0.071
5.608GlyIle: 5.608 ± 0.153
4.888GlyLys: 4.888 ± 0.137
6.513GlyLeu: 6.513 ± 0.147
1.669GlyMet: 1.669 ± 0.079
2.851GlyAsn: 2.851 ± 0.101
1.723GlyPro: 1.723 ± 0.075
3.091GlyGln: 3.091 ± 0.113
3.199GlyArg: 3.199 ± 0.113
4.628GlySer: 4.628 ± 0.118
3.392GlyThr: 3.392 ± 0.122
3.784GlyVal: 3.784 ± 0.132
0.787GlyTrp: 0.787 ± 0.053
2.527GlyTyr: 2.527 ± 0.096
0.0GlyXaa: 0.0 ± 0.0
His
1.145HisAla: 1.145 ± 0.064
0.071HisCys: 0.071 ± 0.017
1.122HisAsp: 1.122 ± 0.064
0.99HisGlu: 0.99 ± 0.063
1.132HisPhe: 1.132 ± 0.054
1.443HisGly: 1.443 ± 0.064
0.51HisHis: 0.51 ± 0.044
1.284HisIle: 1.284 ± 0.07
0.966HisLys: 0.966 ± 0.052
1.963HisLeu: 1.963 ± 0.09
0.361HisMet: 0.361 ± 0.034
0.841HisAsn: 0.841 ± 0.043
0.888HisPro: 0.888 ± 0.053
1.03HisGln: 1.03 ± 0.067
1.003HisArg: 1.003 ± 0.052
1.186HisSer: 1.186 ± 0.076
0.821HisThr: 0.821 ± 0.057
1.051HisVal: 1.051 ± 0.062
0.226HisTrp: 0.226 ± 0.027
0.72HisTyr: 0.72 ± 0.055
0.0HisXaa: 0.0 ± 0.0
Ile
5.946IleAla: 5.946 ± 0.161
0.412IleCys: 0.412 ± 0.037
5.807IleAsp: 5.807 ± 0.16
4.169IleGlu: 4.169 ± 0.124
4.642IlePhe: 4.642 ± 0.162
5.696IleGly: 5.696 ± 0.165
1.26IleHis: 1.26 ± 0.065
5.949IleIle: 5.949 ± 0.166
5.77IleLys: 5.77 ± 0.143
7.03IleLeu: 7.03 ± 0.175
1.649IleMet: 1.649 ± 0.079
4.044IleAsn: 4.044 ± 0.117
3.067IlePro: 3.067 ± 0.109
2.544IleGln: 2.544 ± 0.092
2.98IleArg: 2.98 ± 0.099
6.287IleSer: 6.287 ± 0.153
3.787IleThr: 3.787 ± 0.119
5.189IleVal: 5.189 ± 0.134
0.666IleTrp: 0.666 ± 0.043
2.544IleTyr: 2.544 ± 0.094
0.0IleXaa: 0.0 ± 0.0
Lys
4.888LysAla: 4.888 ± 0.138
0.209LysCys: 0.209 ± 0.026
4.513LysAsp: 4.513 ± 0.135
4.672LysGlu: 4.672 ± 0.143
2.652LysPhe: 2.652 ± 0.099
3.598LysGly: 3.598 ± 0.106
1.186LysHis: 1.186 ± 0.063
6.733LysIle: 6.733 ± 0.162
7.361LysLys: 7.361 ± 0.215
6.162LysLeu: 6.162 ± 0.144
2.03LysMet: 2.03 ± 0.081
5.358LysAsn: 5.358 ± 0.145
2.0LysPro: 2.0 ± 0.083
3.04LysGln: 3.04 ± 0.109
3.378LysArg: 3.378 ± 0.113
4.415LysSer: 4.415 ± 0.134
4.165LysThr: 4.165 ± 0.115
4.128LysVal: 4.128 ± 0.115
0.578LysTrp: 0.578 ± 0.042
2.422LysTyr: 2.422 ± 0.091
0.0LysXaa: 0.0 ± 0.0
Leu
8.273LeuAla: 8.273 ± 0.172
0.422LeuCys: 0.422 ± 0.036
5.78LeuAsp: 5.78 ± 0.151
4.395LeuGlu: 4.395 ± 0.128
5.03LeuPhe: 5.03 ± 0.151
5.453LeuGly: 5.453 ± 0.153
1.581LeuHis: 1.581 ± 0.091
8.361LeuIle: 8.361 ± 0.212
7.557LeuLys: 7.557 ± 0.154
9.5LeuLeu: 9.5 ± 0.229
2.263LeuMet: 2.263 ± 0.085
5.037LeuAsn: 5.037 ± 0.138
4.176LeuPro: 4.176 ± 0.117
3.355LeuGln: 3.355 ± 0.122
3.723LeuArg: 3.723 ± 0.106
7.534LeuSer: 7.534 ± 0.176
5.486LeuThr: 5.486 ± 0.135
6.246LeuVal: 6.246 ± 0.139
0.787LeuTrp: 0.787 ± 0.048
2.72LeuTyr: 2.72 ± 0.082
0.003LeuXaa: 0.003 ± 0.003
Met
2.051MetAla: 2.051 ± 0.085
0.057MetCys: 0.057 ± 0.014
1.125MetAsp: 1.125 ± 0.066
1.003MetGlu: 1.003 ± 0.068
0.905MetPhe: 0.905 ± 0.052
1.334MetGly: 1.334 ± 0.066
0.405MetHis: 0.405 ± 0.034
2.253MetIle: 2.253 ± 0.091
1.581MetLys: 1.581 ± 0.079
1.899MetLeu: 1.899 ± 0.076
0.578MetMet: 0.578 ± 0.054
1.263MetAsn: 1.263 ± 0.065
0.98MetPro: 0.98 ± 0.055
0.905MetGln: 0.905 ± 0.051
0.966MetArg: 0.966 ± 0.053
1.682MetSer: 1.682 ± 0.077
1.733MetThr: 1.733 ± 0.08
1.568MetVal: 1.568 ± 0.066
0.122MetTrp: 0.122 ± 0.023
0.588MetTyr: 0.588 ± 0.042
0.0MetXaa: 0.0 ± 0.0
Asn
3.203AsnAla: 3.203 ± 0.107
0.209AsnCys: 0.209 ± 0.027
3.375AsnAsp: 3.375 ± 0.107
2.622AsnGlu: 2.622 ± 0.094
2.72AsnPhe: 2.72 ± 0.108
3.703AsnGly: 3.703 ± 0.119
0.976AsnHis: 0.976 ± 0.055
3.73AsnIle: 3.73 ± 0.113
3.564AsnLys: 3.564 ± 0.126
4.648AsnLeu: 4.648 ± 0.124
1.024AsnMet: 1.024 ± 0.058
2.814AsnAsn: 2.814 ± 0.112
1.922AsnPro: 1.922 ± 0.083
2.247AsnGln: 2.247 ± 0.094
2.449AsnArg: 2.449 ± 0.102
3.628AsnSer: 3.628 ± 0.109
2.115AsnThr: 2.115 ± 0.089
2.75AsnVal: 2.75 ± 0.111
0.689AsnTrp: 0.689 ± 0.049
2.152AsnTyr: 2.152 ± 0.088
0.0AsnXaa: 0.0 ± 0.0
Pro
2.649ProAla: 2.649 ± 0.1
0.139ProCys: 0.139 ± 0.02
2.676ProAsp: 2.676 ± 0.093
2.513ProGlu: 2.513 ± 0.095
1.868ProPhe: 1.868 ± 0.075
2.064ProGly: 2.064 ± 0.072
0.568ProHis: 0.568 ± 0.045
2.622ProIle: 2.622 ± 0.086
2.52ProLys: 2.52 ± 0.077
3.338ProLeu: 3.338 ± 0.101
0.787ProMet: 0.787 ± 0.054
1.763ProAsn: 1.763 ± 0.071
0.591ProPro: 0.591 ± 0.044
1.324ProGln: 1.324 ± 0.062
1.294ProArg: 1.294 ± 0.066
2.226ProSer: 2.226 ± 0.089
1.811ProThr: 1.811 ± 0.069
2.564ProVal: 2.564 ± 0.092
0.395ProTrp: 0.395 ± 0.039
1.304ProTyr: 1.304 ± 0.057
0.0ProXaa: 0.0 ± 0.0
Gln
4.054GlnAla: 4.054 ± 0.133
0.098GlnCys: 0.098 ± 0.018
2.064GlnAsp: 2.064 ± 0.08
2.216GlnGlu: 2.216 ± 0.088
1.527GlnPhe: 1.527 ± 0.072
2.432GlnGly: 2.432 ± 0.094
0.76GlnHis: 0.76 ± 0.056
3.645GlnIle: 3.645 ± 0.104
3.53GlnLys: 3.53 ± 0.115
4.412GlnLeu: 4.412 ± 0.14
1.182GlnMet: 1.182 ± 0.069
2.155GlnAsn: 2.155 ± 0.094
1.236GlnPro: 1.236 ± 0.065
2.236GlnGln: 2.236 ± 0.11
2.263GlnArg: 2.263 ± 0.102
2.618GlnSer: 2.618 ± 0.095
2.476GlnThr: 2.476 ± 0.104
2.736GlnVal: 2.736 ± 0.103
0.392GlnTrp: 0.392 ± 0.037
1.311GlnTyr: 1.311 ± 0.067
0.0GlnXaa: 0.0 ± 0.0
Arg
2.844ArgAla: 2.844 ± 0.101
0.172ArgCys: 0.172 ± 0.023
2.084ArgAsp: 2.084 ± 0.076
2.095ArgGlu: 2.095 ± 0.076
2.361ArgPhe: 2.361 ± 0.096
2.358ArgGly: 2.358 ± 0.094
0.818ArgHis: 0.818 ± 0.05
3.524ArgIle: 3.524 ± 0.112
3.003ArgLys: 3.003 ± 0.096
4.453ArgLeu: 4.453 ± 0.139
1.101ArgMet: 1.101 ± 0.071
2.017ArgAsn: 2.017 ± 0.082
1.882ArgPro: 1.882 ± 0.083
2.493ArgGln: 2.493 ± 0.1
2.442ArgArg: 2.442 ± 0.095
3.213ArgSer: 3.213 ± 0.112
2.105ArgThr: 2.105 ± 0.073
2.679ArgVal: 2.679 ± 0.094
0.361ArgTrp: 0.361 ± 0.034
1.74ArgTyr: 1.74 ± 0.084
0.0ArgXaa: 0.0 ± 0.0
Ser
5.192SerAla: 5.192 ± 0.128
0.27SerCys: 0.27 ± 0.029
4.716SerAsp: 4.716 ± 0.139
3.219SerGlu: 3.219 ± 0.098
4.047SerPhe: 4.047 ± 0.137
5.3SerGly: 5.3 ± 0.133
1.287SerHis: 1.287 ± 0.059
5.013SerIle: 5.013 ± 0.127
4.736SerLys: 4.736 ± 0.127
7.348SerLeu: 7.348 ± 0.162
1.622SerMet: 1.622 ± 0.078
3.149SerAsn: 3.149 ± 0.09
2.047SerPro: 2.047 ± 0.08
3.118SerGln: 3.118 ± 0.101
3.152SerArg: 3.152 ± 0.113
5.476SerSer: 5.476 ± 0.211
3.152SerThr: 3.152 ± 0.105
4.246SerVal: 4.246 ± 0.121
0.865SerTrp: 0.865 ± 0.058
2.25SerTyr: 2.25 ± 0.088
0.0SerXaa: 0.0 ± 0.0
Thr
4.338ThrAla: 4.338 ± 0.126
0.166ThrCys: 0.166 ± 0.022
3.23ThrAsp: 3.23 ± 0.104
2.415ThrGlu: 2.415 ± 0.088
2.463ThrPhe: 2.463 ± 0.091
3.946ThrGly: 3.946 ± 0.113
0.899ThrHis: 0.899 ± 0.055
4.186ThrIle: 4.186 ± 0.122
3.311ThrLys: 3.311 ± 0.112
4.449ThrLeu: 4.449 ± 0.136
1.041ThrMet: 1.041 ± 0.057
2.378ThrAsn: 2.378 ± 0.098
1.878ThrPro: 1.878 ± 0.071
1.571ThrGln: 1.571 ± 0.079
1.699ThrArg: 1.699 ± 0.083
3.317ThrSer: 3.317 ± 0.113
2.645ThrThr: 2.645 ± 0.094
3.557ThrVal: 3.557 ± 0.119
0.466ThrTrp: 0.466 ± 0.037
1.493ThrTyr: 1.493 ± 0.08
0.0ThrXaa: 0.0 ± 0.0
Val
4.703ValAla: 4.703 ± 0.135
0.328ValCys: 0.328 ± 0.033
4.101ValAsp: 4.101 ± 0.134
3.182ValGlu: 3.182 ± 0.097
3.236ValPhe: 3.236 ± 0.126
4.226ValGly: 4.226 ± 0.147
0.976ValHis: 0.976 ± 0.053
5.189ValIle: 5.189 ± 0.132
4.081ValLys: 4.081 ± 0.13
5.986ValLeu: 5.986 ± 0.134
1.392ValMet: 1.392 ± 0.063
2.801ValAsn: 2.801 ± 0.094
2.341ValPro: 2.341 ± 0.093
2.182ValGln: 2.182 ± 0.101
2.328ValArg: 2.328 ± 0.096
4.841ValSer: 4.841 ± 0.128
3.338ValThr: 3.338 ± 0.122
4.027ValVal: 4.027 ± 0.126
0.682ValTrp: 0.682 ± 0.045
2.128ValTyr: 2.128 ± 0.1
0.0ValXaa: 0.0 ± 0.0
Trp
0.574TrpAla: 0.574 ± 0.037
0.054TrpCys: 0.054 ± 0.012
0.52TrpAsp: 0.52 ± 0.044
0.436TrpGlu: 0.436 ± 0.037
0.534TrpPhe: 0.534 ± 0.045
0.547TrpGly: 0.547 ± 0.047
0.264TrpHis: 0.264 ± 0.03
0.946TrpIle: 0.946 ± 0.069
0.588TrpLys: 0.588 ± 0.043
1.253TrpLeu: 1.253 ± 0.075
0.26TrpMet: 0.26 ± 0.029
0.483TrpAsn: 0.483 ± 0.042
0.334TrpPro: 0.334 ± 0.035
0.686TrpGln: 0.686 ± 0.052
0.534TrpArg: 0.534 ± 0.044
0.706TrpSer: 0.706 ± 0.05
0.493TrpThr: 0.493 ± 0.052
0.5TrpVal: 0.5 ± 0.042
0.145TrpTrp: 0.145 ± 0.02
0.351TrpTyr: 0.351 ± 0.035
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.26TyrAla: 2.26 ± 0.085
0.203TyrCys: 0.203 ± 0.026
2.145TyrAsp: 2.145 ± 0.089
1.497TyrGlu: 1.497 ± 0.071
2.149TyrPhe: 2.149 ± 0.093
2.274TyrGly: 2.274 ± 0.094
0.861TyrHis: 0.861 ± 0.054
2.138TyrIle: 2.138 ± 0.087
1.885TyrLys: 1.885 ± 0.092
4.135TyrLeu: 4.135 ± 0.121
0.693TyrMet: 0.693 ± 0.051
1.476TyrAsn: 1.476 ± 0.079
1.49TyrPro: 1.49 ± 0.065
2.233TyrGln: 2.233 ± 0.081
1.997TyrArg: 1.997 ± 0.081
2.382TyrSer: 2.382 ± 0.09
1.642TyrThr: 1.642 ± 0.061
1.78TyrVal: 1.78 ± 0.078
0.341TyrTrp: 0.341 ± 0.037
1.392TyrTyr: 1.392 ± 0.067
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.003XaaArg: 0.003 ± 0.003
0.003XaaSer: 0.003 ± 0.003
0.0XaaThr: 0.0 ± 0.0
0.003XaaVal: 0.003 ± 0.003
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1597 proteins (296011 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski