Amino acid dipepetide frequency for Furfurilactobacillus rossiae DSM 15814

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.526AlaAla: 8.526 ± 0.294
0.317AlaCys: 0.317 ± 0.021
5.559AlaAsp: 5.559 ± 0.099
4.222AlaGlu: 4.222 ± 0.086
3.583AlaPhe: 3.583 ± 0.087
6.382AlaGly: 6.382 ± 0.105
1.866AlaHis: 1.866 ± 0.048
5.936AlaIle: 5.936 ± 0.087
5.067AlaLys: 5.067 ± 0.094
8.118AlaLeu: 8.118 ± 0.114
2.364AlaMet: 2.364 ± 0.055
4.393AlaAsn: 4.393 ± 0.118
2.525AlaPro: 2.525 ± 0.07
3.789AlaGln: 3.789 ± 0.103
3.011AlaArg: 3.011 ± 0.064
5.501AlaSer: 5.501 ± 0.275
6.167AlaThr: 6.167 ± 0.181
6.335AlaVal: 6.335 ± 0.096
0.77AlaTrp: 0.77 ± 0.034
2.539AlaTyr: 2.539 ± 0.06
0.0AlaXaa: 0.0 ± 0.0
Cys
0.308CysAla: 0.308 ± 0.02
0.035CysCys: 0.035 ± 0.007
0.25CysAsp: 0.25 ± 0.017
0.221CysGlu: 0.221 ± 0.016
0.2CysPhe: 0.2 ± 0.016
0.446CysGly: 0.446 ± 0.028
0.119CysHis: 0.119 ± 0.013
0.234CysIle: 0.234 ± 0.019
0.134CysLys: 0.134 ± 0.014
0.484CysLeu: 0.484 ± 0.023
0.091CysMet: 0.091 ± 0.01
0.134CysAsn: 0.134 ± 0.013
0.196CysPro: 0.196 ± 0.016
0.174CysGln: 0.174 ± 0.015
0.167CysArg: 0.167 ± 0.015
0.215CysSer: 0.215 ± 0.016
0.2CysThr: 0.2 ± 0.016
0.27CysVal: 0.27 ± 0.017
0.05CysTrp: 0.05 ± 0.009
0.145CysTyr: 0.145 ± 0.014
0.0CysXaa: 0.0 ± 0.0
Asp
5.321AspAla: 5.321 ± 0.105
0.232AspCys: 0.232 ± 0.018
4.095AspAsp: 4.095 ± 0.1
4.076AspGlu: 4.076 ± 0.097
2.645AspPhe: 2.645 ± 0.066
4.219AspGly: 4.219 ± 0.114
1.719AspHis: 1.719 ± 0.05
3.453AspIle: 3.453 ± 0.074
3.282AspLys: 3.282 ± 0.068
5.267AspLeu: 5.267 ± 0.086
1.523AspMet: 1.523 ± 0.047
2.673AspAsn: 2.673 ± 0.063
2.474AspPro: 2.474 ± 0.083
3.133AspGln: 3.133 ± 0.068
2.621AspArg: 2.621 ± 0.062
3.39AspSer: 3.39 ± 0.091
3.159AspThr: 3.159 ± 0.073
4.536AspVal: 4.536 ± 0.085
0.794AspTrp: 0.794 ± 0.033
2.449AspTyr: 2.449 ± 0.067
0.0AspXaa: 0.0 ± 0.0
Glu
4.161GluAla: 4.161 ± 0.091
0.14GluCys: 0.14 ± 0.013
2.816GluAsp: 2.816 ± 0.072
2.47GluGlu: 2.47 ± 0.074
1.803GluPhe: 1.803 ± 0.052
2.438GluGly: 2.438 ± 0.058
1.32GluHis: 1.32 ± 0.039
3.215GluIle: 3.215 ± 0.067
2.955GluLys: 2.955 ± 0.065
5.097GluLeu: 5.097 ± 0.107
1.607GluMet: 1.607 ± 0.045
2.545GluAsn: 2.545 ± 0.062
1.755GluPro: 1.755 ± 0.059
2.76GluGln: 2.76 ± 0.072
2.607GluArg: 2.607 ± 0.065
2.75GluSer: 2.75 ± 0.067
3.312GluThr: 3.312 ± 0.066
3.322GluVal: 3.322 ± 0.073
0.52GluTrp: 0.52 ± 0.024
1.451GluTyr: 1.451 ± 0.047
0.0GluXaa: 0.0 ± 0.0
Phe
3.64PheAla: 3.64 ± 0.078
0.25PheCys: 0.25 ± 0.018
3.038PheAsp: 3.038 ± 0.063
2.123PheGlu: 2.123 ± 0.057
1.939PhePhe: 1.939 ± 0.061
3.431PheGly: 3.431 ± 0.07
0.899PheHis: 0.899 ± 0.039
2.751PheIle: 2.751 ± 0.079
2.223PheLys: 2.223 ± 0.057
3.611PheLeu: 3.611 ± 0.104
1.089PheMet: 1.089 ± 0.041
2.178PheAsn: 2.178 ± 0.054
1.437PhePro: 1.437 ± 0.049
1.341PheGln: 1.341 ± 0.04
1.394PheArg: 1.394 ± 0.045
2.894PheSer: 2.894 ± 0.072
2.626PheThr: 2.626 ± 0.068
3.253PheVal: 3.253 ± 0.079
0.584PheTrp: 0.584 ± 0.034
1.535PheTyr: 1.535 ± 0.048
0.0PheXaa: 0.0 ± 0.0
Gly
5.205GlyAla: 5.205 ± 0.095
0.306GlyCys: 0.306 ± 0.019
3.715GlyAsp: 3.715 ± 0.074
3.161GlyGlu: 3.161 ± 0.076
3.057GlyPhe: 3.057 ± 0.077
4.39GlyGly: 4.39 ± 0.075
1.785GlyHis: 1.785 ± 0.056
5.041GlyIle: 5.041 ± 0.109
4.083GlyLys: 4.083 ± 0.08
6.832GlyLeu: 6.832 ± 0.107
1.954GlyMet: 1.954 ± 0.056
3.003GlyAsn: 3.003 ± 0.073
1.689GlyPro: 1.689 ± 0.051
3.04GlyGln: 3.04 ± 0.076
2.823GlyArg: 2.823 ± 0.068
4.249GlySer: 4.249 ± 0.117
4.695GlyThr: 4.695 ± 0.124
5.113GlyVal: 5.113 ± 0.084
0.886GlyTrp: 0.886 ± 0.043
2.66GlyTyr: 2.66 ± 0.059
0.0GlyXaa: 0.0 ± 0.0
His
1.667HisAla: 1.667 ± 0.042
0.111HisCys: 0.111 ± 0.01
1.568HisAsp: 1.568 ± 0.047
1.358HisGlu: 1.358 ± 0.045
1.213HisPhe: 1.213 ± 0.041
1.755HisGly: 1.755 ± 0.057
0.814HisHis: 0.814 ± 0.033
1.44HisIle: 1.44 ± 0.045
0.971HisLys: 0.971 ± 0.036
2.241HisLeu: 2.241 ± 0.06
0.598HisMet: 0.598 ± 0.026
0.996HisAsn: 0.996 ± 0.033
1.095HisPro: 1.095 ± 0.038
1.404HisGln: 1.404 ± 0.041
1.133HisArg: 1.133 ± 0.041
1.223HisSer: 1.223 ± 0.071
1.249HisThr: 1.249 ± 0.04
1.84HisVal: 1.84 ± 0.048
0.338HisTrp: 0.338 ± 0.021
1.013HisTyr: 1.013 ± 0.034
0.0HisXaa: 0.0 ± 0.0
Ile
5.973IleAla: 5.973 ± 0.088
0.39IleCys: 0.39 ± 0.022
4.247IleAsp: 4.247 ± 0.078
3.108IleGlu: 3.108 ± 0.068
2.744IlePhe: 2.744 ± 0.085
4.996IleGly: 4.996 ± 0.112
1.418IleHis: 1.418 ± 0.049
4.444IleIle: 4.444 ± 0.116
3.357IleLys: 3.357 ± 0.068
5.733IleLeu: 5.733 ± 0.117
1.629IleMet: 1.629 ± 0.055
3.325IleAsn: 3.325 ± 0.07
2.556IlePro: 2.556 ± 0.06
2.3IleGln: 2.3 ± 0.051
2.463IleArg: 2.463 ± 0.066
4.313IleSer: 4.313 ± 0.084
4.194IleThr: 4.194 ± 0.065
5.055IleVal: 5.055 ± 0.104
0.622IleTrp: 0.622 ± 0.031
1.847IleTyr: 1.847 ± 0.053
0.0IleXaa: 0.0 ± 0.0
Lys
4.367LysAla: 4.367 ± 0.092
0.117LysCys: 0.117 ± 0.014
3.224LysAsp: 3.224 ± 0.071
2.705LysGlu: 2.705 ± 0.067
1.716LysPhe: 1.716 ± 0.049
2.887LysGly: 2.887 ± 0.067
1.267LysHis: 1.267 ± 0.039
3.335LysIle: 3.335 ± 0.065
3.236LysLys: 3.236 ± 0.082
4.836LysLeu: 4.836 ± 0.08
1.959LysMet: 1.959 ± 0.054
2.658LysAsn: 2.658 ± 0.068
2.151LysPro: 2.151 ± 0.056
3.337LysGln: 3.337 ± 0.079
3.077LysArg: 3.077 ± 0.061
3.044LysSer: 3.044 ± 0.069
3.983LysThr: 3.983 ± 0.08
3.772LysVal: 3.772 ± 0.079
0.597LysTrp: 0.597 ± 0.025
1.724LysTyr: 1.724 ± 0.051
0.0LysXaa: 0.0 ± 0.0
Leu
8.92LeuAla: 8.92 ± 0.126
0.469LeuCys: 0.469 ± 0.026
5.227LeuAsp: 5.227 ± 0.103
3.541LeuGlu: 3.541 ± 0.064
4.138LeuPhe: 4.138 ± 0.102
6.505LeuGly: 6.505 ± 0.107
1.98LeuHis: 1.98 ± 0.055
6.611LeuIle: 6.611 ± 0.128
5.109LeuLys: 5.109 ± 0.084
9.094LeuLeu: 9.094 ± 0.169
2.688LeuMet: 2.688 ± 0.062
4.871LeuAsn: 4.871 ± 0.069
4.124LeuPro: 4.124 ± 0.072
3.789LeuGln: 3.789 ± 0.08
3.889LeuArg: 3.889 ± 0.088
6.904LeuSer: 6.904 ± 0.102
7.621LeuThr: 7.621 ± 0.132
6.73LeuVal: 6.73 ± 0.115
0.947LeuTrp: 0.947 ± 0.038
2.486LeuTyr: 2.486 ± 0.053
0.0LeuXaa: 0.0 ± 0.0
Met
2.648MetAla: 2.648 ± 0.055
0.093MetCys: 0.093 ± 0.01
1.435MetAsp: 1.435 ± 0.044
1.058MetGlu: 1.058 ± 0.04
1.11MetPhe: 1.11 ± 0.038
1.745MetGly: 1.745 ± 0.051
0.553MetHis: 0.553 ± 0.029
1.901MetIle: 1.901 ± 0.056
1.716MetLys: 1.716 ± 0.056
2.407MetLeu: 2.407 ± 0.068
0.859MetMet: 0.859 ± 0.037
1.451MetAsn: 1.451 ± 0.04
1.103MetPro: 1.103 ± 0.035
1.232MetGln: 1.232 ± 0.04
1.099MetArg: 1.099 ± 0.042
1.806MetSer: 1.806 ± 0.05
2.199MetThr: 2.199 ± 0.055
1.863MetVal: 1.863 ± 0.05
0.288MetTrp: 0.288 ± 0.021
0.658MetTyr: 0.658 ± 0.031
0.0MetXaa: 0.0 ± 0.0
Asn
4.04AsnAla: 4.04 ± 0.089
0.189AsnCys: 0.189 ± 0.016
3.161AsnAsp: 3.161 ± 0.081
2.64AsnGlu: 2.64 ± 0.057
1.906AsnPhe: 1.906 ± 0.054
3.877AsnGly: 3.877 ± 0.1
1.294AsnHis: 1.294 ± 0.04
2.746AsnIle: 2.746 ± 0.068
2.366AsnLys: 2.366 ± 0.064
4.048AsnLeu: 4.048 ± 0.07
1.314AsnMet: 1.314 ± 0.039
2.494AsnAsn: 2.494 ± 0.068
2.057AsnPro: 2.057 ± 0.062
2.52AsnGln: 2.52 ± 0.065
2.166AsnArg: 2.166 ± 0.051
2.591AsnSer: 2.591 ± 0.099
2.686AsnThr: 2.686 ± 0.09
3.748AsnVal: 3.748 ± 0.079
0.65AsnTrp: 0.65 ± 0.027
1.661AsnTyr: 1.661 ± 0.05
0.0AsnXaa: 0.0 ± 0.0
Pro
3.253ProAla: 3.253 ± 0.077
0.104ProCys: 0.104 ± 0.012
2.645ProAsp: 2.645 ± 0.064
2.524ProGlu: 2.524 ± 0.068
1.666ProPhe: 1.666 ± 0.048
2.251ProGly: 2.251 ± 0.058
0.838ProHis: 0.838 ± 0.035
2.353ProIle: 2.353 ± 0.058
2.038ProLys: 2.038 ± 0.058
3.391ProLeu: 3.391 ± 0.078
0.833ProMet: 0.833 ± 0.031
1.902ProAsn: 1.902 ± 0.06
0.548ProPro: 0.548 ± 0.027
1.579ProGln: 1.579 ± 0.048
1.285ProArg: 1.285 ± 0.036
2.224ProSer: 2.224 ± 0.061
2.761ProThr: 2.761 ± 0.089
3.032ProVal: 3.032 ± 0.071
0.404ProTrp: 0.404 ± 0.025
1.176ProTyr: 1.176 ± 0.043
0.0ProXaa: 0.0 ± 0.0
Gln
4.083GlnAla: 4.083 ± 0.108
0.132GlnCys: 0.132 ± 0.014
2.133GlnAsp: 2.133 ± 0.053
1.97GlnGlu: 1.97 ± 0.058
1.821GlnPhe: 1.821 ± 0.052
2.286GlnGly: 2.286 ± 0.047
1.253GlnHis: 1.253 ± 0.042
2.918GlnIle: 2.918 ± 0.062
2.475GlnLys: 2.475 ± 0.066
5.454GlnLeu: 5.454 ± 0.111
1.305GlnMet: 1.305 ± 0.042
1.942GlnAsn: 1.942 ± 0.055
1.85GlnPro: 1.85 ± 0.044
3.191GlnGln: 3.191 ± 0.089
2.527GlnArg: 2.527 ± 0.071
2.944GlnSer: 2.944 ± 0.089
3.56GlnThr: 3.56 ± 0.082
3.24GlnVal: 3.24 ± 0.07
0.523GlnTrp: 0.523 ± 0.025
1.553GlnTyr: 1.553 ± 0.048
0.0GlnXaa: 0.0 ± 0.0
Arg
3.006ArgAla: 3.006 ± 0.062
0.157ArgCys: 0.157 ± 0.015
2.507ArgAsp: 2.507 ± 0.058
2.512ArgGlu: 2.512 ± 0.062
1.97ArgPhe: 1.97 ± 0.056
2.398ArgGly: 2.398 ± 0.065
1.274ArgHis: 1.274 ± 0.041
2.567ArgIle: 2.567 ± 0.065
2.348ArgLys: 2.348 ± 0.057
4.493ArgLeu: 4.493 ± 0.103
1.179ArgMet: 1.179 ± 0.046
1.99ArgAsn: 1.99 ± 0.049
1.615ArgPro: 1.615 ± 0.051
2.627ArgGln: 2.627 ± 0.07
2.367ArgArg: 2.367 ± 0.071
2.156ArgSer: 2.156 ± 0.056
2.284ArgThr: 2.284 ± 0.054
2.865ArgVal: 2.865 ± 0.079
0.518ArgTrp: 0.518 ± 0.031
1.637ArgTyr: 1.637 ± 0.051
0.0ArgXaa: 0.0 ± 0.0
Ser
5.46SerAla: 5.46 ± 0.211
0.208SerCys: 0.208 ± 0.018
3.972SerAsp: 3.972 ± 0.127
3.231SerGlu: 3.231 ± 0.065
2.857SerPhe: 2.857 ± 0.064
4.616SerGly: 4.616 ± 0.093
1.36SerHis: 1.36 ± 0.038
3.7SerIle: 3.7 ± 0.081
3.253SerLys: 3.253 ± 0.07
6.146SerLeu: 6.146 ± 0.142
1.622SerMet: 1.622 ± 0.046
2.796SerAsn: 2.796 ± 0.085
1.875SerPro: 1.875 ± 0.046
2.996SerGln: 2.996 ± 0.094
2.594SerArg: 2.594 ± 0.056
4.412SerSer: 4.412 ± 0.153
4.23SerThr: 4.23 ± 0.169
4.697SerVal: 4.697 ± 0.093
0.719SerTrp: 0.719 ± 0.029
1.842SerTyr: 1.842 ± 0.05
0.0SerXaa: 0.0 ± 0.0
Thr
6.167ThrAla: 6.167 ± 0.151
0.224ThrCys: 0.224 ± 0.017
4.267ThrAsp: 4.267 ± 0.123
2.987ThrGlu: 2.987 ± 0.068
2.848ThrPhe: 2.848 ± 0.06
5.03ThrGly: 5.03 ± 0.109
1.486ThrHis: 1.486 ± 0.076
4.569ThrIle: 4.569 ± 0.084
3.608ThrLys: 3.608 ± 0.081
6.482ThrLeu: 6.482 ± 0.115
1.522ThrMet: 1.522 ± 0.045
3.265ThrAsn: 3.265 ± 0.084
3.098ThrPro: 3.098 ± 0.096
2.69ThrGln: 2.69 ± 0.062
2.251ThrArg: 2.251 ± 0.055
4.355ThrSer: 4.355 ± 0.149
5.131ThrThr: 5.131 ± 0.264
5.609ThrVal: 5.609 ± 0.139
0.684ThrTrp: 0.684 ± 0.029
2.151ThrTyr: 2.151 ± 0.058
0.0ThrXaa: 0.0 ± 0.0
Val
6.919ValAla: 6.919 ± 0.116
0.378ValCys: 0.378 ± 0.021
4.606ValAsp: 4.606 ± 0.091
3.176ValGlu: 3.176 ± 0.078
2.772ValPhe: 2.772 ± 0.073
5.114ValGly: 5.114 ± 0.089
1.522ValHis: 1.522 ± 0.048
5.096ValIle: 5.096 ± 0.1
4.03ValLys: 4.03 ± 0.078
6.861ValLeu: 6.861 ± 0.113
2.059ValMet: 2.059 ± 0.06
3.681ValAsn: 3.681 ± 0.081
3.062ValPro: 3.062 ± 0.064
2.581ValGln: 2.581 ± 0.053
2.675ValArg: 2.675 ± 0.061
5.122ValSer: 5.122 ± 0.093
5.753ValThr: 5.753 ± 0.132
5.682ValVal: 5.682 ± 0.105
0.756ValTrp: 0.756 ± 0.031
2.193ValTyr: 2.193 ± 0.047
0.0ValXaa: 0.0 ± 0.0
Trp
0.76TrpAla: 0.76 ± 0.03
0.056TrpCys: 0.056 ± 0.008
0.545TrpAsp: 0.545 ± 0.03
0.38TrpGlu: 0.38 ± 0.021
0.562TrpPhe: 0.562 ± 0.027
0.671TrpGly: 0.671 ± 0.031
0.38TrpHis: 0.38 ± 0.023
0.701TrpIle: 0.701 ± 0.032
0.354TrpLys: 0.354 ± 0.025
1.61TrpLeu: 1.61 ± 0.063
0.302TrpMet: 0.302 ± 0.019
0.482TrpAsn: 0.482 ± 0.025
0.351TrpPro: 0.351 ± 0.022
0.794TrpGln: 0.794 ± 0.035
0.651TrpArg: 0.651 ± 0.033
0.71TrpSer: 0.71 ± 0.031
0.686TrpThr: 0.686 ± 0.034
0.71TrpVal: 0.71 ± 0.038
0.224TrpTrp: 0.224 ± 0.016
0.39TrpTyr: 0.39 ± 0.022
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.546TyrAla: 2.546 ± 0.058
0.167TyrCys: 0.167 ± 0.015
2.054TyrAsp: 2.054 ± 0.053
1.578TyrGlu: 1.578 ± 0.046
1.687TyrPhe: 1.687 ± 0.054
2.249TyrGly: 2.249 ± 0.063
0.878TyrHis: 0.878 ± 0.033
1.703TyrIle: 1.703 ± 0.048
1.386TyrLys: 1.386 ± 0.046
3.439TyrLeu: 3.439 ± 0.063
0.726TyrMet: 0.726 ± 0.035
1.377TyrAsn: 1.377 ± 0.041
1.248TyrPro: 1.248 ± 0.044
1.885TyrGln: 1.885 ± 0.052
1.672TyrArg: 1.672 ± 0.047
1.783TyrSer: 1.783 ± 0.051
1.923TyrThr: 1.923 ± 0.073
2.366TyrVal: 2.366 ± 0.049
0.446TyrTrp: 0.446 ± 0.026
1.256TyrTyr: 1.256 ± 0.045
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2635 proteins (804405 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski