Amino acid dipepetide frequency for Furfurilactobacillus siliginis

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.899AlaAla: 9.899 ± 0.404
0.319AlaCys: 0.319 ± 0.028
6.259AlaAsp: 6.259 ± 0.131
4.751AlaGlu: 4.751 ± 0.108
3.541AlaPhe: 3.541 ± 0.099
6.866AlaGly: 6.866 ± 0.14
1.923AlaHis: 1.923 ± 0.064
6.024AlaIle: 6.024 ± 0.133
5.327AlaLys: 5.327 ± 0.108
8.705AlaLeu: 8.705 ± 0.137
2.466AlaMet: 2.466 ± 0.062
4.196AlaAsn: 4.196 ± 0.173
2.737AlaPro: 2.737 ± 0.089
4.274AlaGln: 4.274 ± 0.157
3.322AlaArg: 3.322 ± 0.081
5.325AlaSer: 5.325 ± 0.345
6.717AlaThr: 6.717 ± 0.2
6.831AlaVal: 6.831 ± 0.141
0.888AlaTrp: 0.888 ± 0.046
2.725AlaTyr: 2.725 ± 0.068
0.0AlaXaa: 0.0 ± 0.0
Cys
0.296CysAla: 0.296 ± 0.023
0.035CysCys: 0.035 ± 0.008
0.194CysAsp: 0.194 ± 0.017
0.189CysGlu: 0.189 ± 0.018
0.196CysPhe: 0.196 ± 0.02
0.373CysGly: 0.373 ± 0.028
0.107CysHis: 0.107 ± 0.016
0.219CysIle: 0.219 ± 0.017
0.107CysLys: 0.107 ± 0.015
0.443CysLeu: 0.443 ± 0.029
0.096CysMet: 0.096 ± 0.013
0.105CysAsn: 0.105 ± 0.013
0.152CysPro: 0.152 ± 0.019
0.145CysGln: 0.145 ± 0.017
0.121CysArg: 0.121 ± 0.015
0.175CysSer: 0.175 ± 0.016
0.191CysThr: 0.191 ± 0.02
0.247CysVal: 0.247 ± 0.022
0.061CysTrp: 0.061 ± 0.01
0.152CysTyr: 0.152 ± 0.017
0.0CysXaa: 0.0 ± 0.0
Asp
5.719AspAla: 5.719 ± 0.123
0.184AspCys: 0.184 ± 0.019
4.085AspAsp: 4.085 ± 0.125
4.147AspGlu: 4.147 ± 0.108
2.937AspPhe: 2.937 ± 0.089
4.334AspGly: 4.334 ± 0.118
1.615AspHis: 1.615 ± 0.051
3.29AspIle: 3.29 ± 0.094
3.099AspLys: 3.099 ± 0.09
5.576AspLeu: 5.576 ± 0.117
1.604AspMet: 1.604 ± 0.053
2.488AspAsn: 2.488 ± 0.084
2.476AspPro: 2.476 ± 0.085
3.101AspGln: 3.101 ± 0.082
2.574AspArg: 2.574 ± 0.068
2.912AspSer: 2.912 ± 0.082
3.075AspThr: 3.075 ± 0.078
5.047AspVal: 5.047 ± 0.107
0.821AspTrp: 0.821 ± 0.046
2.311AspTyr: 2.311 ± 0.07
0.0AspXaa: 0.0 ± 0.0
Glu
4.675GluAla: 4.675 ± 0.099
0.147GluCys: 0.147 ± 0.014
2.776GluAsp: 2.776 ± 0.078
2.783GluGlu: 2.783 ± 0.092
1.858GluPhe: 1.858 ± 0.062
2.401GluGly: 2.401 ± 0.069
1.378GluHis: 1.378 ± 0.053
3.268GluIle: 3.268 ± 0.086
2.959GluLys: 2.959 ± 0.085
5.567GluLeu: 5.567 ± 0.113
1.564GluMet: 1.564 ± 0.058
2.462GluAsn: 2.462 ± 0.072
1.846GluPro: 1.846 ± 0.064
3.255GluGln: 3.255 ± 0.089
2.77GluArg: 2.77 ± 0.091
2.446GluSer: 2.446 ± 0.077
3.472GluThr: 3.472 ± 0.084
3.451GluVal: 3.451 ± 0.098
0.485GluTrp: 0.485 ± 0.029
1.432GluTyr: 1.432 ± 0.048
0.0GluXaa: 0.0 ± 0.0
Phe
3.646PheAla: 3.646 ± 0.095
0.205PheCys: 0.205 ± 0.021
3.052PheAsp: 3.052 ± 0.079
2.07PheGlu: 2.07 ± 0.054
1.853PhePhe: 1.853 ± 0.078
3.406PheGly: 3.406 ± 0.087
0.842PheHis: 0.842 ± 0.04
2.607PheIle: 2.607 ± 0.084
2.119PheLys: 2.119 ± 0.067
3.551PheLeu: 3.551 ± 0.099
1.101PheMet: 1.101 ± 0.044
2.058PheAsn: 2.058 ± 0.063
1.45PhePro: 1.45 ± 0.058
1.266PheGln: 1.266 ± 0.051
1.45PheArg: 1.45 ± 0.06
2.762PheSer: 2.762 ± 0.078
2.858PheThr: 2.858 ± 0.076
3.205PheVal: 3.205 ± 0.086
0.518PheTrp: 0.518 ± 0.034
1.464PheTyr: 1.464 ± 0.055
0.0PheXaa: 0.0 ± 0.0
Gly
5.474GlyAla: 5.474 ± 0.115
0.27GlyCys: 0.27 ± 0.023
3.865GlyAsp: 3.865 ± 0.094
3.268GlyGlu: 3.268 ± 0.08
3.122GlyPhe: 3.122 ± 0.085
4.352GlyGly: 4.352 ± 0.113
1.704GlyHis: 1.704 ± 0.056
4.968GlyIle: 4.968 ± 0.107
4.11GlyLys: 4.11 ± 0.107
7.134GlyLeu: 7.134 ± 0.125
1.982GlyMet: 1.982 ± 0.065
2.872GlyAsn: 2.872 ± 0.076
1.641GlyPro: 1.641 ± 0.049
3.22GlyGln: 3.22 ± 0.085
2.942GlyArg: 2.942 ± 0.079
3.951GlySer: 3.951 ± 0.084
4.562GlyThr: 4.562 ± 0.113
5.154GlyVal: 5.154 ± 0.112
0.961GlyTrp: 0.961 ± 0.067
2.45GlyTyr: 2.45 ± 0.073
0.002GlyXaa: 0.002 ± 0.002
His
1.748HisAla: 1.748 ± 0.062
0.105HisCys: 0.105 ± 0.015
1.539HisAsp: 1.539 ± 0.052
1.362HisGlu: 1.362 ± 0.053
1.143HisPhe: 1.143 ± 0.044
1.702HisGly: 1.702 ± 0.058
0.727HisHis: 0.727 ± 0.039
1.387HisIle: 1.387 ± 0.046
0.937HisLys: 0.937 ± 0.041
2.252HisLeu: 2.252 ± 0.071
0.601HisMet: 0.601 ± 0.035
0.918HisAsn: 0.918 ± 0.041
1.133HisPro: 1.133 ± 0.041
1.254HisGln: 1.254 ± 0.05
1.01HisArg: 1.01 ± 0.041
1.114HisSer: 1.114 ± 0.045
1.235HisThr: 1.235 ± 0.048
1.811HisVal: 1.811 ± 0.058
0.282HisTrp: 0.282 ± 0.023
0.981HisTyr: 0.981 ± 0.046
0.0HisXaa: 0.0 ± 0.0
Ile
5.91IleAla: 5.91 ± 0.129
0.362IleCys: 0.362 ± 0.027
4.206IleAsp: 4.206 ± 0.091
3.212IleGlu: 3.212 ± 0.083
2.471IlePhe: 2.471 ± 0.089
4.852IleGly: 4.852 ± 0.118
1.329IleHis: 1.329 ± 0.052
4.232IleIle: 4.232 ± 0.105
3.311IleLys: 3.311 ± 0.082
5.267IleLeu: 5.267 ± 0.12
1.62IleMet: 1.62 ± 0.066
3.178IleAsn: 3.178 ± 0.072
2.441IlePro: 2.441 ± 0.067
2.166IleGln: 2.166 ± 0.062
2.389IleArg: 2.389 ± 0.071
4.047IleSer: 4.047 ± 0.088
4.453IleThr: 4.453 ± 0.143
4.858IleVal: 4.858 ± 0.107
0.597IleTrp: 0.597 ± 0.033
1.706IleTyr: 1.706 ± 0.059
0.0IleXaa: 0.0 ± 0.0
Lys
4.469LysAla: 4.469 ± 0.123
0.077LysCys: 0.077 ± 0.013
3.063LysAsp: 3.063 ± 0.086
2.802LysGlu: 2.802 ± 0.082
1.501LysPhe: 1.501 ± 0.053
2.919LysGly: 2.919 ± 0.084
1.145LysHis: 1.145 ± 0.048
3.063LysIle: 3.063 ± 0.079
2.931LysLys: 2.931 ± 0.092
4.844LysLeu: 4.844 ± 0.096
1.926LysMet: 1.926 ± 0.063
2.462LysAsn: 2.462 ± 0.067
2.182LysPro: 2.182 ± 0.08
3.42LysGln: 3.42 ± 0.082
2.881LysArg: 2.881 ± 0.078
2.681LysSer: 2.681 ± 0.082
3.644LysThr: 3.644 ± 0.096
3.772LysVal: 3.772 ± 0.095
0.588LysTrp: 0.588 ± 0.036
1.737LysTyr: 1.737 ± 0.066
0.002LysXaa: 0.002 ± 0.002
Leu
10.127LeuAla: 10.127 ± 0.186
0.412LeuCys: 0.412 ± 0.028
5.323LeuAsp: 5.323 ± 0.101
3.954LeuGlu: 3.954 ± 0.098
4.11LeuPhe: 4.11 ± 0.105
6.663LeuGly: 6.663 ± 0.155
2.052LeuHis: 2.052 ± 0.069
6.273LeuIle: 6.273 ± 0.15
4.826LeuLys: 4.826 ± 0.108
9.854LeuLeu: 9.854 ± 0.21
2.635LeuMet: 2.635 ± 0.067
4.504LeuAsn: 4.504 ± 0.102
4.273LeuPro: 4.273 ± 0.105
4.38LeuGln: 4.38 ± 0.1
4.094LeuArg: 4.094 ± 0.111
6.46LeuSer: 6.46 ± 0.136
7.864LeuThr: 7.864 ± 0.144
7.162LeuVal: 7.162 ± 0.143
0.965LeuTrp: 0.965 ± 0.044
2.432LeuTyr: 2.432 ± 0.077
0.0LeuXaa: 0.0 ± 0.0
Met
2.741MetAla: 2.741 ± 0.074
0.102MetCys: 0.102 ± 0.013
1.436MetAsp: 1.436 ± 0.058
1.135MetGlu: 1.135 ± 0.046
0.972MetPhe: 0.972 ± 0.051
1.679MetGly: 1.679 ± 0.059
0.559MetHis: 0.559 ± 0.031
1.923MetIle: 1.923 ± 0.056
1.622MetLys: 1.622 ± 0.051
2.508MetLeu: 2.508 ± 0.073
0.858MetMet: 0.858 ± 0.042
1.352MetAsn: 1.352 ± 0.044
1.098MetPro: 1.098 ± 0.049
1.376MetGln: 1.376 ± 0.062
1.145MetArg: 1.145 ± 0.051
1.716MetSer: 1.716 ± 0.056
2.175MetThr: 2.175 ± 0.068
1.772MetVal: 1.772 ± 0.063
0.271MetTrp: 0.271 ± 0.025
0.646MetTyr: 0.646 ± 0.031
0.0MetXaa: 0.0 ± 0.0
Asn
3.863AsnAla: 3.863 ± 0.1
0.137AsnCys: 0.137 ± 0.016
2.994AsnAsp: 2.994 ± 0.075
2.553AsnGlu: 2.553 ± 0.075
1.772AsnPhe: 1.772 ± 0.057
3.516AsnGly: 3.516 ± 0.096
1.166AsnHis: 1.166 ± 0.047
2.469AsnIle: 2.469 ± 0.059
2.276AsnLys: 2.276 ± 0.081
3.942AsnLeu: 3.942 ± 0.101
1.173AsnMet: 1.173 ± 0.046
2.115AsnAsn: 2.115 ± 0.068
1.991AsnPro: 1.991 ± 0.073
2.238AsnGln: 2.238 ± 0.072
2.101AsnArg: 2.101 ± 0.067
2.252AsnSer: 2.252 ± 0.097
2.311AsnThr: 2.311 ± 0.067
3.529AsnVal: 3.529 ± 0.084
0.66AsnTrp: 0.66 ± 0.04
1.707AsnTyr: 1.707 ± 0.082
0.0AsnXaa: 0.0 ± 0.0
Pro
3.63ProAla: 3.63 ± 0.1
0.093ProCys: 0.093 ± 0.012
2.576ProAsp: 2.576 ± 0.074
2.606ProGlu: 2.606 ± 0.073
1.711ProPhe: 1.711 ± 0.054
2.329ProGly: 2.329 ± 0.074
0.788ProHis: 0.788 ± 0.039
2.229ProIle: 2.229 ± 0.059
1.898ProLys: 1.898 ± 0.061
3.583ProLeu: 3.583 ± 0.088
0.853ProMet: 0.853 ± 0.044
1.644ProAsn: 1.644 ± 0.071
0.602ProPro: 0.602 ± 0.033
1.576ProGln: 1.576 ± 0.054
1.284ProArg: 1.284 ± 0.048
2.028ProSer: 2.028 ± 0.068
2.888ProThr: 2.888 ± 0.109
3.245ProVal: 3.245 ± 0.093
0.361ProTrp: 0.361 ± 0.032
1.2ProTyr: 1.2 ± 0.046
0.0ProXaa: 0.0 ± 0.0
Gln
4.817GlnAla: 4.817 ± 0.219
0.11GlnCys: 0.11 ± 0.016
2.122GlnAsp: 2.122 ± 0.068
2.075GlnGlu: 2.075 ± 0.062
1.863GlnPhe: 1.863 ± 0.059
2.394GlnGly: 2.394 ± 0.071
1.233GlnHis: 1.233 ± 0.051
2.73GlnIle: 2.73 ± 0.081
2.348GlnLys: 2.348 ± 0.068
5.866GlnLeu: 5.866 ± 0.135
1.268GlnMet: 1.268 ± 0.05
1.996GlnAsn: 1.996 ± 0.062
2.028GlnPro: 2.028 ± 0.062
3.453GlnGln: 3.453 ± 0.121
2.539GlnArg: 2.539 ± 0.074
2.609GlnSer: 2.609 ± 0.116
3.453GlnThr: 3.453 ± 0.091
3.474GlnVal: 3.474 ± 0.082
0.595GlnTrp: 0.595 ± 0.035
1.446GlnTyr: 1.446 ± 0.048
0.0GlnXaa: 0.0 ± 0.0
Arg
3.275ArgAla: 3.275 ± 0.098
0.135ArgCys: 0.135 ± 0.015
2.52ArgAsp: 2.52 ± 0.069
2.518ArgGlu: 2.518 ± 0.079
2.063ArgPhe: 2.063 ± 0.067
2.452ArgGly: 2.452 ± 0.084
1.245ArgHis: 1.245 ± 0.05
2.585ArgIle: 2.585 ± 0.078
2.259ArgLys: 2.259 ± 0.073
4.493ArgLeu: 4.493 ± 0.115
1.205ArgMet: 1.205 ± 0.048
1.849ArgAsn: 1.849 ± 0.055
1.639ArgPro: 1.639 ± 0.057
2.488ArgGln: 2.488 ± 0.08
2.436ArgArg: 2.436 ± 0.085
2.133ArgSer: 2.133 ± 0.065
2.332ArgThr: 2.332 ± 0.071
3.056ArgVal: 3.056 ± 0.086
0.494ArgTrp: 0.494 ± 0.034
1.615ArgTyr: 1.615 ± 0.055
0.0ArgXaa: 0.0 ± 0.0
Ser
5.325SerAla: 5.325 ± 0.315
0.175SerCys: 0.175 ± 0.016
3.557SerAsp: 3.557 ± 0.077
3.021SerGlu: 3.021 ± 0.076
2.6SerPhe: 2.6 ± 0.079
4.371SerGly: 4.371 ± 0.104
1.292SerHis: 1.292 ± 0.047
3.325SerIle: 3.325 ± 0.082
2.833SerLys: 2.833 ± 0.088
5.866SerLeu: 5.866 ± 0.178
1.574SerMet: 1.574 ± 0.063
2.32SerAsn: 2.32 ± 0.073
1.69SerPro: 1.69 ± 0.058
2.725SerGln: 2.725 ± 0.152
2.467SerArg: 2.467 ± 0.066
3.31SerSer: 3.31 ± 0.1
3.529SerThr: 3.529 ± 0.123
4.371SerVal: 4.371 ± 0.097
0.753SerTrp: 0.753 ± 0.039
1.833SerTyr: 1.833 ± 0.056
0.0SerXaa: 0.0 ± 0.0
Thr
6.553ThrAla: 6.553 ± 0.189
0.2ThrCys: 0.2 ± 0.02
4.406ThrAsp: 4.406 ± 0.118
3.136ThrGlu: 3.136 ± 0.073
2.793ThrPhe: 2.793 ± 0.073
5.173ThrGly: 5.173 ± 0.124
1.466ThrHis: 1.466 ± 0.052
4.479ThrIle: 4.479 ± 0.119
3.569ThrLys: 3.569 ± 0.109
6.595ThrLeu: 6.595 ± 0.137
1.592ThrMet: 1.592 ± 0.055
3.096ThrAsn: 3.096 ± 0.087
3.187ThrPro: 3.187 ± 0.121
2.569ThrGln: 2.569 ± 0.075
2.276ThrArg: 2.276 ± 0.061
3.886ThrSer: 3.886 ± 0.179
4.999ThrThr: 4.999 ± 0.156
5.616ThrVal: 5.616 ± 0.149
0.644ThrTrp: 0.644 ± 0.037
2.164ThrTyr: 2.164 ± 0.074
0.002ThrXaa: 0.002 ± 0.002
Val
7.579ValAla: 7.579 ± 0.172
0.329ValCys: 0.329 ± 0.026
4.723ValAsp: 4.723 ± 0.098
3.444ValGlu: 3.444 ± 0.093
2.814ValPhe: 2.814 ± 0.078
5.281ValGly: 5.281 ± 0.112
1.471ValHis: 1.471 ± 0.051
5.227ValIle: 5.227 ± 0.119
3.931ValLys: 3.931 ± 0.103
7.19ValLeu: 7.19 ± 0.141
1.951ValMet: 1.951 ± 0.062
3.441ValAsn: 3.441 ± 0.097
2.986ValPro: 2.986 ± 0.092
2.679ValGln: 2.679 ± 0.078
2.711ValArg: 2.711 ± 0.079
4.854ValSer: 4.854 ± 0.108
6.125ValThr: 6.125 ± 0.167
6.045ValVal: 6.045 ± 0.121
0.7ValTrp: 0.7 ± 0.039
2.247ValTyr: 2.247 ± 0.069
0.0ValXaa: 0.0 ± 0.0
Trp
0.714TrpAla: 0.714 ± 0.042
0.06TrpCys: 0.06 ± 0.01
0.562TrpAsp: 0.562 ± 0.036
0.382TrpGlu: 0.382 ± 0.024
0.576TrpPhe: 0.576 ± 0.036
0.685TrpGly: 0.685 ± 0.044
0.317TrpHis: 0.317 ± 0.024
0.657TrpIle: 0.657 ± 0.041
0.366TrpLys: 0.366 ± 0.028
1.629TrpLeu: 1.629 ± 0.057
0.254TrpMet: 0.254 ± 0.02
0.461TrpAsn: 0.461 ± 0.03
0.313TrpPro: 0.313 ± 0.025
0.87TrpGln: 0.87 ± 0.046
0.658TrpArg: 0.658 ± 0.032
0.692TrpSer: 0.692 ± 0.046
0.615TrpThr: 0.615 ± 0.033
0.765TrpVal: 0.765 ± 0.05
0.266TrpTrp: 0.266 ± 0.027
0.501TrpTyr: 0.501 ± 0.058
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.634TyrAla: 2.634 ± 0.058
0.151TyrCys: 0.151 ± 0.016
2.138TyrAsp: 2.138 ± 0.083
1.59TyrGlu: 1.59 ± 0.052
1.587TyrPhe: 1.587 ± 0.063
2.254TyrGly: 2.254 ± 0.07
0.879TyrHis: 0.879 ± 0.047
1.653TyrIle: 1.653 ± 0.061
1.278TyrLys: 1.278 ± 0.048
3.497TyrLeu: 3.497 ± 0.083
0.76TyrMet: 0.76 ± 0.04
1.252TyrAsn: 1.252 ± 0.05
1.257TyrPro: 1.257 ± 0.047
1.965TyrGln: 1.965 ± 0.067
1.62TyrArg: 1.62 ± 0.056
1.585TyrSer: 1.585 ± 0.057
1.902TyrThr: 1.902 ± 0.093
2.32TyrVal: 2.32 ± 0.063
0.429TyrTrp: 0.429 ± 0.031
1.164TyrTyr: 1.164 ± 0.06
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.002XaaHis: 0.002 ± 0.002
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.002XaaLeu: 0.002 ± 0.002
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.002XaaSer: 0.002 ± 0.002
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.594XaaXaa: 0.594 ± 0.4
Statistics based on 1922 proteins (571064 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski