Amino acid dipepetide frequency for Dokdonia sp. (strain 4H-3-7-5) (Krokinobacter sp. (strain 4H-3-7-5))

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.331AlaAla: 5.331 ± 0.1
0.567AlaCys: 0.567 ± 0.024
3.859AlaAsp: 3.859 ± 0.104
4.001AlaGlu: 4.001 ± 0.074
3.685AlaPhe: 3.685 ± 0.065
5.107AlaGly: 5.107 ± 0.098
1.326AlaHis: 1.326 ± 0.039
6.074AlaIle: 6.074 ± 0.114
4.546AlaLys: 4.546 ± 0.097
6.943AlaLeu: 6.943 ± 0.101
1.826AlaMet: 1.826 ± 0.052
3.177AlaAsn: 3.177 ± 0.065
2.24AlaPro: 2.24 ± 0.063
3.149AlaGln: 3.149 ± 0.06
2.568AlaArg: 2.568 ± 0.055
4.89AlaSer: 4.89 ± 0.078
4.92AlaThr: 4.92 ± 0.11
4.77AlaVal: 4.77 ± 0.119
0.606AlaTrp: 0.606 ± 0.027
2.709AlaTyr: 2.709 ± 0.063
0.0AlaXaa: 0.0 ± 0.0
Cys
0.501CysAla: 0.501 ± 0.023
0.088CysCys: 0.088 ± 0.009
0.493CysAsp: 0.493 ± 0.033
0.446CysGlu: 0.446 ± 0.024
0.325CysPhe: 0.325 ± 0.019
0.567CysGly: 0.567 ± 0.027
0.163CysHis: 0.163 ± 0.014
0.495CysIle: 0.495 ± 0.022
0.419CysLys: 0.419 ± 0.022
0.582CysLeu: 0.582 ± 0.027
0.139CysMet: 0.139 ± 0.011
0.354CysAsn: 0.354 ± 0.017
0.303CysPro: 0.303 ± 0.022
0.175CysGln: 0.175 ± 0.014
0.189CysArg: 0.189 ± 0.014
0.514CysSer: 0.514 ± 0.029
0.451CysThr: 0.451 ± 0.022
0.456CysVal: 0.456 ± 0.021
0.058CysTrp: 0.058 ± 0.007
0.264CysTyr: 0.264 ± 0.017
0.0CysXaa: 0.0 ± 0.0
Asp
4.713AspAla: 4.713 ± 0.123
0.42AspCys: 0.42 ± 0.027
3.495AspAsp: 3.495 ± 0.105
3.999AspGlu: 3.999 ± 0.08
3.601AspPhe: 3.601 ± 0.071
4.676AspGly: 4.676 ± 0.165
0.982AspHis: 0.982 ± 0.035
4.596AspIle: 4.596 ± 0.073
3.808AspLys: 3.808 ± 0.073
5.166AspLeu: 5.166 ± 0.084
1.259AspMet: 1.259 ± 0.036
3.385AspAsn: 3.385 ± 0.067
2.057AspPro: 2.057 ± 0.098
1.721AspGln: 1.721 ± 0.046
2.186AspArg: 2.186 ± 0.052
3.238AspSer: 3.238 ± 0.073
3.524AspThr: 3.524 ± 0.114
4.21AspVal: 4.21 ± 0.106
0.716AspTrp: 0.716 ± 0.031
2.859AspTyr: 2.859 ± 0.061
0.0AspXaa: 0.0 ± 0.0
Glu
4.65GluAla: 4.65 ± 0.085
0.32GluCys: 0.32 ± 0.018
4.195GluAsp: 4.195 ± 0.092
5.114GluGlu: 5.114 ± 0.091
2.702GluPhe: 2.702 ± 0.057
4.135GluGly: 4.135 ± 0.076
1.231GluHis: 1.231 ± 0.04
5.215GluIle: 5.215 ± 0.076
4.871GluLys: 4.871 ± 0.086
5.925GluLeu: 5.925 ± 0.087
1.648GluMet: 1.648 ± 0.039
4.133GluAsn: 4.133 ± 0.086
1.596GluPro: 1.596 ± 0.044
2.453GluGln: 2.453 ± 0.051
2.902GluArg: 2.902 ± 0.061
3.337GluSer: 3.337 ± 0.068
3.816GluThr: 3.816 ± 0.086
4.707GluVal: 4.707 ± 0.064
0.584GluTrp: 0.584 ± 0.025
2.237GluTyr: 2.237 ± 0.046
0.0GluXaa: 0.0 ± 0.0
Phe
3.355PheAla: 3.355 ± 0.062
0.375PheCys: 0.375 ± 0.021
3.331PheAsp: 3.331 ± 0.057
3.309PheGlu: 3.309 ± 0.06
2.442PhePhe: 2.442 ± 0.063
3.408PheGly: 3.408 ± 0.069
0.752PheHis: 0.752 ± 0.03
3.685PheIle: 3.685 ± 0.071
3.377PheLys: 3.377 ± 0.059
4.28PheLeu: 4.28 ± 0.08
1.11PheMet: 1.11 ± 0.036
2.906PheAsn: 2.906 ± 0.055
1.684PhePro: 1.684 ± 0.046
1.456PheGln: 1.456 ± 0.043
1.581PheArg: 1.581 ± 0.041
3.435PheSer: 3.435 ± 0.064
3.401PheThr: 3.401 ± 0.068
2.89PheVal: 2.89 ± 0.065
0.519PheTrp: 0.519 ± 0.024
2.007PheTyr: 2.007 ± 0.048
0.0PheXaa: 0.0 ± 0.0
Gly
5.028GlyAla: 5.028 ± 0.095
0.557GlyCys: 0.557 ± 0.029
4.174GlyAsp: 4.174 ± 0.111
3.808GlyGlu: 3.808 ± 0.079
3.528GlyPhe: 3.528 ± 0.066
4.852GlyGly: 4.852 ± 0.111
1.177GlyHis: 1.177 ± 0.037
5.195GlyIle: 5.195 ± 0.081
4.505GlyLys: 4.505 ± 0.085
5.883GlyLeu: 5.883 ± 0.095
1.726GlyMet: 1.726 ± 0.048
3.653GlyAsn: 3.653 ± 0.093
1.44GlyPro: 1.44 ± 0.054
1.978GlyGln: 1.978 ± 0.042
2.315GlyArg: 2.315 ± 0.058
4.048GlySer: 4.048 ± 0.068
4.564GlyThr: 4.564 ± 0.129
4.892GlyVal: 4.892 ± 0.08
0.741GlyTrp: 0.741 ± 0.026
2.759GlyTyr: 2.759 ± 0.061
0.0GlyXaa: 0.0 ± 0.0
His
0.998HisAla: 0.998 ± 0.033
0.185HisCys: 0.185 ± 0.015
0.862HisAsp: 0.862 ± 0.031
0.943HisGlu: 0.943 ± 0.035
1.069HisPhe: 1.069 ± 0.031
1.007HisGly: 1.007 ± 0.037
0.441HisHis: 0.441 ± 0.023
1.478HisIle: 1.478 ± 0.039
1.206HisLys: 1.206 ± 0.038
1.899HisLeu: 1.899 ± 0.049
0.33HisMet: 0.33 ± 0.02
0.889HisAsn: 0.889 ± 0.031
0.837HisPro: 0.837 ± 0.027
0.647HisGln: 0.647 ± 0.029
0.678HisArg: 0.678 ± 0.027
0.992HisSer: 0.992 ± 0.035
1.059HisThr: 1.059 ± 0.034
0.931HisVal: 0.931 ± 0.034
0.228HisTrp: 0.228 ± 0.017
0.824HisTyr: 0.824 ± 0.029
0.0HisXaa: 0.0 ± 0.0
Ile
6.331IleAla: 6.331 ± 0.108
0.56IleCys: 0.56 ± 0.026
4.976IleAsp: 4.976 ± 0.077
4.984IleGlu: 4.984 ± 0.078
3.342IlePhe: 3.342 ± 0.06
4.912IleGly: 4.912 ± 0.086
1.264IleHis: 1.264 ± 0.039
5.719IleIle: 5.719 ± 0.095
5.139IleLys: 5.139 ± 0.07
6.643IleLeu: 6.643 ± 0.11
1.278IleMet: 1.278 ± 0.039
4.242IleAsn: 4.242 ± 0.085
3.17IlePro: 3.17 ± 0.078
2.311IleGln: 2.311 ± 0.048
2.405IleArg: 2.405 ± 0.046
5.028IleSer: 5.028 ± 0.074
5.42IleThr: 5.42 ± 0.105
4.777IleVal: 4.777 ± 0.063
0.609IleTrp: 0.609 ± 0.023
2.625IleTyr: 2.625 ± 0.055
0.0IleXaa: 0.0 ± 0.0
Lys
5.065LysAla: 5.065 ± 0.097
0.276LysCys: 0.276 ± 0.018
4.101LysAsp: 4.101 ± 0.067
5.849LysGlu: 5.849 ± 0.095
2.469LysPhe: 2.469 ± 0.055
4.137LysGly: 4.137 ± 0.081
1.19LysHis: 1.19 ± 0.035
4.675LysIle: 4.675 ± 0.074
5.73LysLys: 5.73 ± 0.104
5.696LysLeu: 5.696 ± 0.091
1.822LysMet: 1.822 ± 0.046
4.061LysAsn: 4.061 ± 0.064
2.135LysPro: 2.135 ± 0.047
2.379LysGln: 2.379 ± 0.053
3.025LysArg: 3.025 ± 0.062
3.99LysSer: 3.99 ± 0.07
4.067LysThr: 4.067 ± 0.069
4.185LysVal: 4.185 ± 0.081
0.67LysTrp: 0.67 ± 0.03
2.502LysTyr: 2.502 ± 0.05
0.0LysXaa: 0.0 ± 0.0
Leu
6.33LeuAla: 6.33 ± 0.092
0.67LeuCys: 0.67 ± 0.025
5.646LeuAsp: 5.646 ± 0.081
6.004LeuGlu: 6.004 ± 0.09
4.515LeuPhe: 4.515 ± 0.078
6.033LeuGly: 6.033 ± 0.085
1.545LeuHis: 1.545 ± 0.044
6.416LeuIle: 6.416 ± 0.105
6.549LeuLys: 6.549 ± 0.101
8.764LeuLeu: 8.764 ± 0.133
2.084LeuMet: 2.084 ± 0.056
4.76LeuAsn: 4.76 ± 0.075
3.48LeuPro: 3.48 ± 0.05
3.236LeuGln: 3.236 ± 0.063
3.613LeuArg: 3.613 ± 0.063
6.864LeuSer: 6.864 ± 0.103
5.252LeuThr: 5.252 ± 0.099
5.611LeuVal: 5.611 ± 0.086
0.791LeuTrp: 0.791 ± 0.027
3.1LeuTyr: 3.1 ± 0.06
0.0LeuXaa: 0.0 ± 0.0
Met
1.663MetAla: 1.663 ± 0.042
0.145MetCys: 0.145 ± 0.011
1.271MetAsp: 1.271 ± 0.037
1.41MetGlu: 1.41 ± 0.042
0.781MetPhe: 0.781 ± 0.029
1.477MetGly: 1.477 ± 0.044
0.41MetHis: 0.41 ± 0.022
1.572MetIle: 1.572 ± 0.041
1.98MetLys: 1.98 ± 0.045
2.054MetLeu: 2.054 ± 0.055
0.699MetMet: 0.699 ± 0.032
1.188MetAsn: 1.188 ± 0.039
0.854MetPro: 0.854 ± 0.027
0.819MetGln: 0.819 ± 0.031
1.023MetArg: 1.023 ± 0.028
1.538MetSer: 1.538 ± 0.043
1.307MetThr: 1.307 ± 0.036
1.378MetVal: 1.378 ± 0.043
0.165MetTrp: 0.165 ± 0.013
0.751MetTyr: 0.751 ± 0.027
0.0MetXaa: 0.0 ± 0.0
Asn
3.925AsnAla: 3.925 ± 0.091
0.363AsnCys: 0.363 ± 0.019
3.328AsnAsp: 3.328 ± 0.096
3.555AsnGlu: 3.555 ± 0.112
2.597AsnPhe: 2.597 ± 0.055
3.897AsnGly: 3.897 ± 0.115
0.96AsnHis: 0.96 ± 0.033
4.128AsnIle: 4.128 ± 0.084
3.515AsnLys: 3.515 ± 0.067
4.657AsnLeu: 4.657 ± 0.083
1.156AsnMet: 1.156 ± 0.033
3.312AsnAsn: 3.312 ± 0.098
2.609AsnPro: 2.609 ± 0.064
1.859AsnGln: 1.859 ± 0.055
2.11AsnArg: 2.11 ± 0.045
3.135AsnSer: 3.135 ± 0.062
3.596AsnThr: 3.596 ± 0.077
3.445AsnVal: 3.445 ± 0.078
0.655AsnTrp: 0.655 ± 0.026
2.443AsnTyr: 2.443 ± 0.059
0.0AsnXaa: 0.0 ± 0.0
Pro
2.39ProAla: 2.39 ± 0.064
0.229ProCys: 0.229 ± 0.023
2.245ProAsp: 2.245 ± 0.06
2.905ProGlu: 2.905 ± 0.06
1.855ProPhe: 1.855 ± 0.05
1.954ProGly: 1.954 ± 0.062
0.601ProHis: 0.601 ± 0.025
2.524ProIle: 2.524 ± 0.048
2.15ProLys: 2.15 ± 0.052
3.089ProLeu: 3.089 ± 0.056
0.743ProMet: 0.743 ± 0.027
1.89ProAsn: 1.89 ± 0.05
0.85ProPro: 0.85 ± 0.033
1.315ProGln: 1.315 ± 0.046
1.039ProArg: 1.039 ± 0.033
2.331ProSer: 2.331 ± 0.054
2.2ProThr: 2.2 ± 0.071
2.482ProVal: 2.482 ± 0.065
0.336ProTrp: 0.336 ± 0.019
1.406ProTyr: 1.406 ± 0.037
0.0ProXaa: 0.0 ± 0.0
Gln
2.352GlnAla: 2.352 ± 0.057
0.169GlnCys: 0.169 ± 0.012
1.964GlnAsp: 1.964 ± 0.053
2.769GlnGlu: 2.769 ± 0.058
1.599GlnPhe: 1.599 ± 0.043
2.107GlnGly: 2.107 ± 0.046
0.636GlnHis: 0.636 ± 0.028
2.469GlnIle: 2.469 ± 0.053
2.365GlnLys: 2.365 ± 0.059
3.561GlnLeu: 3.561 ± 0.063
0.864GlnMet: 0.864 ± 0.031
1.78GlnAsn: 1.78 ± 0.069
1.133GlnPro: 1.133 ± 0.04
1.504GlnGln: 1.504 ± 0.048
1.425GlnArg: 1.425 ± 0.039
1.974GlnSer: 1.974 ± 0.048
1.904GlnThr: 1.904 ± 0.052
2.277GlnVal: 2.277 ± 0.049
0.384GlnTrp: 0.384 ± 0.021
1.282GlnTyr: 1.282 ± 0.037
0.0GlnXaa: 0.0 ± 0.0
Arg
2.494ArgAla: 2.494 ± 0.059
0.191ArgCys: 0.191 ± 0.013
2.23ArgAsp: 2.23 ± 0.054
2.516ArgGlu: 2.516 ± 0.056
2.135ArgPhe: 2.135 ± 0.052
2.217ArgGly: 2.217 ± 0.048
0.612ArgHis: 0.612 ± 0.025
2.917ArgIle: 2.917 ± 0.055
2.834ArgLys: 2.834 ± 0.066
3.498ArgLeu: 3.498 ± 0.068
0.892ArgMet: 0.892 ± 0.032
2.203ArgAsn: 2.203 ± 0.051
1.222ArgPro: 1.222 ± 0.038
1.163ArgGln: 1.163 ± 0.038
1.487ArgArg: 1.487 ± 0.048
2.209ArgSer: 2.209 ± 0.051
2.081ArgThr: 2.081 ± 0.045
2.414ArgVal: 2.414 ± 0.049
0.386ArgTrp: 0.386 ± 0.023
1.614ArgTyr: 1.614 ± 0.04
0.0ArgXaa: 0.0 ± 0.0
Ser
4.115SerAla: 4.115 ± 0.072
0.658SerCys: 0.658 ± 0.024
3.639SerAsp: 3.639 ± 0.086
3.615SerGlu: 3.615 ± 0.062
3.605SerPhe: 3.605 ± 0.066
4.544SerGly: 4.544 ± 0.081
1.082SerHis: 1.082 ± 0.033
4.908SerIle: 4.908 ± 0.071
4.224SerLys: 4.224 ± 0.077
6.169SerLeu: 6.169 ± 0.09
1.309SerMet: 1.309 ± 0.034
3.453SerAsn: 3.453 ± 0.072
2.14SerPro: 2.14 ± 0.05
2.311SerGln: 2.311 ± 0.053
2.439SerArg: 2.439 ± 0.054
4.228SerSer: 4.228 ± 0.084
3.762SerThr: 3.762 ± 0.074
3.843SerVal: 3.843 ± 0.064
0.698SerTrp: 0.698 ± 0.029
2.777SerTyr: 2.777 ± 0.058
0.0SerXaa: 0.0 ± 0.0
Thr
4.778ThrAla: 4.778 ± 0.11
0.344ThrCys: 0.344 ± 0.019
3.795ThrAsp: 3.795 ± 0.097
3.603ThrGlu: 3.603 ± 0.058
3.194ThrPhe: 3.194 ± 0.063
4.318ThrGly: 4.318 ± 0.106
1.051ThrHis: 1.051 ± 0.034
5.239ThrIle: 5.239 ± 0.138
3.401ThrLys: 3.401 ± 0.056
5.898ThrLeu: 5.898 ± 0.087
1.149ThrMet: 1.149 ± 0.035
3.143ThrAsn: 3.143 ± 0.095
2.831ThrPro: 2.831 ± 0.084
2.15ThrGln: 2.15 ± 0.079
1.936ThrArg: 1.936 ± 0.047
4.191ThrSer: 4.191 ± 0.08
4.561ThrThr: 4.561 ± 0.168
4.728ThrVal: 4.728 ± 0.216
0.564ThrTrp: 0.564 ± 0.027
2.582ThrTyr: 2.582 ± 0.07
0.0ThrXaa: 0.0 ± 0.0
Val
5.103ValAla: 5.103 ± 0.081
0.479ValCys: 0.479 ± 0.022
3.991ValAsp: 3.991 ± 0.078
3.894ValGlu: 3.894 ± 0.075
3.161ValPhe: 3.161 ± 0.068
4.224ValGly: 4.224 ± 0.078
1.076ValHis: 1.076 ± 0.035
5.153ValIle: 5.153 ± 0.075
3.916ValLys: 3.916 ± 0.071
6.072ValLeu: 6.072 ± 0.088
1.442ValMet: 1.442 ± 0.04
3.592ValAsn: 3.592 ± 0.083
2.34ValPro: 2.34 ± 0.051
2.159ValGln: 2.159 ± 0.052
2.314ValArg: 2.314 ± 0.053
4.543ValSer: 4.543 ± 0.08
4.629ValThr: 4.629 ± 0.219
4.733ValVal: 4.733 ± 0.097
0.613ValTrp: 0.613 ± 0.024
2.227ValTyr: 2.227 ± 0.059
0.0ValXaa: 0.0 ± 0.0
Trp
0.573TrpAla: 0.573 ± 0.022
0.085TrpCys: 0.085 ± 0.009
0.58TrpAsp: 0.58 ± 0.027
0.645TrpGlu: 0.645 ± 0.026
0.501TrpPhe: 0.501 ± 0.023
0.623TrpGly: 0.623 ± 0.028
0.204TrpHis: 0.204 ± 0.014
0.737TrpIle: 0.737 ± 0.028
0.723TrpLys: 0.723 ± 0.03
0.917TrpLeu: 0.917 ± 0.031
0.312TrpMet: 0.312 ± 0.017
0.668TrpAsn: 0.668 ± 0.031
0.232TrpPro: 0.232 ± 0.017
0.368TrpGln: 0.368 ± 0.019
0.412TrpArg: 0.412 ± 0.025
0.645TrpSer: 0.645 ± 0.03
0.504TrpThr: 0.504 ± 0.024
0.595TrpVal: 0.595 ± 0.024
0.148TrpTrp: 0.148 ± 0.013
0.421TrpTyr: 0.421 ± 0.02
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.599TyrAla: 2.599 ± 0.052
0.304TyrCys: 0.304 ± 0.017
2.434TyrAsp: 2.434 ± 0.05
2.403TyrGlu: 2.403 ± 0.052
2.197TyrPhe: 2.197 ± 0.045
2.629TyrGly: 2.629 ± 0.058
0.801TyrHis: 0.801 ± 0.028
2.539TyrIle: 2.539 ± 0.046
2.765TyrLys: 2.765 ± 0.063
3.565TyrLeu: 3.565 ± 0.07
0.695TyrMet: 0.695 ± 0.025
2.458TyrAsn: 2.458 ± 0.071
1.384TyrPro: 1.384 ± 0.038
1.365TyrGln: 1.365 ± 0.042
1.669TyrArg: 1.669 ± 0.042
2.385TyrSer: 2.385 ± 0.058
2.437TyrThr: 2.437 ± 0.062
2.338TyrVal: 2.338 ± 0.049
0.422TyrTrp: 0.422 ± 0.021
1.678TyrTyr: 1.678 ± 0.046
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2978 proteins (1016983 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski