Amino acid dipepetide frequency for Octadecabacter arcticus 238

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
14.272AlaAla: 14.272 ± 0.135
1.209AlaCys: 1.209 ± 0.033
7.036AlaAsp: 7.036 ± 0.082
6.598AlaGlu: 6.598 ± 0.091
4.255AlaPhe: 4.255 ± 0.061
9.186AlaGly: 9.186 ± 0.093
2.338AlaHis: 2.338 ± 0.046
6.318AlaIle: 6.318 ± 0.063
4.602AlaLys: 4.602 ± 0.068
11.947AlaLeu: 11.947 ± 0.103
3.803AlaMet: 3.803 ± 0.055
3.129AlaAsn: 3.129 ± 0.047
4.9AlaPro: 4.9 ± 0.071
4.664AlaGln: 4.664 ± 0.059
7.306AlaArg: 7.306 ± 0.08
5.58AlaSer: 5.58 ± 0.062
6.146AlaThr: 6.146 ± 0.076
7.841AlaVal: 7.841 ± 0.082
1.352AlaTrp: 1.352 ± 0.032
2.5AlaTyr: 2.5 ± 0.045
0.0AlaXaa: 0.0 ± 0.0
Cys
1.288CysAla: 1.288 ± 0.031
0.182CysCys: 0.182 ± 0.011
0.715CysAsp: 0.715 ± 0.024
0.491CysGlu: 0.491 ± 0.018
0.364CysPhe: 0.364 ± 0.018
0.972CysGly: 0.972 ± 0.033
0.281CysHis: 0.281 ± 0.015
0.49CysIle: 0.49 ± 0.02
0.323CysLys: 0.323 ± 0.016
0.931CysLeu: 0.931 ± 0.028
0.214CysMet: 0.214 ± 0.011
0.317CysAsn: 0.317 ± 0.016
0.488CysPro: 0.488 ± 0.022
0.31CysGln: 0.31 ± 0.016
0.734CysArg: 0.734 ± 0.024
0.501CysSer: 0.501 ± 0.02
0.5CysThr: 0.5 ± 0.021
0.661CysVal: 0.661 ± 0.021
0.159CysTrp: 0.159 ± 0.011
0.22CysTyr: 0.22 ± 0.013
0.0CysXaa: 0.0 ± 0.0
Asp
7.692AspAla: 7.692 ± 0.081
0.545AspCys: 0.545 ± 0.018
3.761AspAsp: 3.761 ± 0.067
3.57AspGlu: 3.57 ± 0.055
2.356AspPhe: 2.356 ± 0.041
5.666AspGly: 5.666 ± 0.079
1.48AspHis: 1.48 ± 0.034
3.714AspIle: 3.714 ± 0.063
1.985AspLys: 1.985 ± 0.042
6.323AspLeu: 6.323 ± 0.072
1.806AspMet: 1.806 ± 0.037
1.569AspAsn: 1.569 ± 0.034
3.218AspPro: 3.218 ± 0.053
2.144AspGln: 2.144 ± 0.041
3.858AspArg: 3.858 ± 0.059
2.207AspSer: 2.207 ± 0.05
3.294AspThr: 3.294 ± 0.046
4.851AspVal: 4.851 ± 0.069
1.199AspTrp: 1.199 ± 0.029
1.559AspTyr: 1.559 ± 0.038
0.0AspXaa: 0.0 ± 0.0
Glu
6.671GluAla: 6.671 ± 0.081
0.433GluCys: 0.433 ± 0.015
3.11GluAsp: 3.11 ± 0.056
2.614GluGlu: 2.614 ± 0.057
1.876GluPhe: 1.876 ± 0.038
3.933GluGly: 3.933 ± 0.065
1.165GluHis: 1.165 ± 0.031
3.626GluIle: 3.626 ± 0.052
2.169GluLys: 2.169 ± 0.045
4.921GluLeu: 4.921 ± 0.071
1.82GluMet: 1.82 ± 0.042
1.893GluAsn: 1.893 ± 0.036
2.111GluPro: 2.111 ± 0.037
1.97GluGln: 1.97 ± 0.038
3.838GluArg: 3.838 ± 0.057
2.103GluSer: 2.103 ± 0.042
3.683GluThr: 3.683 ± 0.049
4.006GluVal: 4.006 ± 0.06
0.724GluTrp: 0.724 ± 0.024
1.05GluTyr: 1.05 ± 0.027
0.0GluXaa: 0.0 ± 0.0
Phe
4.271PheAla: 4.271 ± 0.067
0.485PheCys: 0.485 ± 0.019
2.874PheAsp: 2.874 ± 0.048
2.198PheGlu: 2.198 ± 0.045
1.532PhePhe: 1.532 ± 0.038
3.713PheGly: 3.713 ± 0.068
0.791PheHis: 0.791 ± 0.024
1.9PheIle: 1.9 ± 0.043
1.297PheLys: 1.297 ± 0.028
3.344PheLeu: 3.344 ± 0.059
0.985PheMet: 0.985 ± 0.03
1.174PheAsn: 1.174 ± 0.037
1.56PhePro: 1.56 ± 0.036
1.069PheGln: 1.069 ± 0.028
2.101PheArg: 2.101 ± 0.043
2.229PheSer: 2.229 ± 0.053
2.232PheThr: 2.232 ± 0.048
2.805PheVal: 2.805 ± 0.048
0.578PheTrp: 0.578 ± 0.021
0.954PheTyr: 0.954 ± 0.028
0.0PheXaa: 0.0 ± 0.0
Gly
8.77GlyAla: 8.77 ± 0.088
0.853GlyCys: 0.853 ± 0.025
4.626GlyAsp: 4.626 ± 0.066
4.18GlyGlu: 4.18 ± 0.055
3.561GlyPhe: 3.561 ± 0.052
6.784GlyGly: 6.784 ± 0.096
2.025GlyHis: 2.025 ± 0.041
4.561GlyIle: 4.561 ± 0.069
3.443GlyLys: 3.443 ± 0.055
8.64GlyLeu: 8.64 ± 0.084
2.458GlyMet: 2.458 ± 0.043
2.208GlyAsn: 2.208 ± 0.048
3.196GlyPro: 3.196 ± 0.044
3.053GlyGln: 3.053 ± 0.049
4.889GlyArg: 4.889 ± 0.062
4.171GlySer: 4.171 ± 0.057
4.67GlyThr: 4.67 ± 0.066
6.363GlyVal: 6.363 ± 0.072
1.427GlyTrp: 1.427 ± 0.039
2.159GlyTyr: 2.159 ± 0.041
0.0GlyXaa: 0.0 ± 0.0
His
2.358HisAla: 2.358 ± 0.044
0.285HisCys: 0.285 ± 0.016
1.361HisAsp: 1.361 ± 0.037
1.032HisGlu: 1.032 ± 0.03
0.919HisPhe: 0.919 ± 0.028
1.954HisGly: 1.954 ± 0.044
0.747HisHis: 0.747 ± 0.028
1.24HisIle: 1.24 ± 0.032
0.851HisLys: 0.851 ± 0.029
2.219HisLeu: 2.219 ± 0.047
0.687HisMet: 0.687 ± 0.022
0.594HisAsn: 0.594 ± 0.019
1.325HisPro: 1.325 ± 0.032
0.682HisGln: 0.682 ± 0.023
1.384HisArg: 1.384 ± 0.035
1.09HisSer: 1.09 ± 0.036
0.928HisThr: 0.928 ± 0.025
1.632HisVal: 1.632 ± 0.035
0.447HisTrp: 0.447 ± 0.019
0.6HisTyr: 0.6 ± 0.02
0.0HisXaa: 0.0 ± 0.0
Ile
7.231IleAla: 7.231 ± 0.081
0.692IleCys: 0.692 ± 0.023
3.787IleAsp: 3.787 ± 0.052
3.839IleGlu: 3.839 ± 0.05
1.916IlePhe: 1.916 ± 0.049
5.153IleGly: 5.153 ± 0.067
1.047IleHis: 1.047 ± 0.03
2.948IleIle: 2.948 ± 0.046
2.007IleLys: 2.007 ± 0.032
5.06IleLeu: 5.06 ± 0.068
1.276IleMet: 1.276 ± 0.035
1.755IleAsn: 1.755 ± 0.04
2.527IlePro: 2.527 ± 0.048
1.453IleGln: 1.453 ± 0.033
3.157IleArg: 3.157 ± 0.055
3.417IleSer: 3.417 ± 0.057
3.311IleThr: 3.311 ± 0.058
4.164IleVal: 4.164 ± 0.062
0.79IleTrp: 0.79 ± 0.026
1.31IleTyr: 1.31 ± 0.03
0.0IleXaa: 0.0 ± 0.0
Lys
4.417LysAla: 4.417 ± 0.063
0.291LysCys: 0.291 ± 0.017
2.238LysAsp: 2.238 ± 0.042
1.611LysGlu: 1.611 ± 0.041
1.114LysPhe: 1.114 ± 0.032
3.056LysGly: 3.056 ± 0.048
0.934LysHis: 0.934 ± 0.026
2.234LysIle: 2.234 ± 0.048
1.609LysLys: 1.609 ± 0.045
3.61LysLeu: 3.61 ± 0.052
1.097LysMet: 1.097 ± 0.03
1.092LysAsn: 1.092 ± 0.033
1.985LysPro: 1.985 ± 0.044
1.198LysGln: 1.198 ± 0.035
2.801LysArg: 2.801 ± 0.051
2.336LysSer: 2.336 ± 0.044
2.781LysThr: 2.781 ± 0.045
2.613LysVal: 2.613 ± 0.049
0.568LysTrp: 0.568 ± 0.024
0.955LysTyr: 0.955 ± 0.026
0.0LysXaa: 0.0 ± 0.0
Leu
11.346LeuAla: 11.346 ± 0.115
1.094LeuCys: 1.094 ± 0.028
6.054LeuAsp: 6.054 ± 0.064
5.061LeuGlu: 5.061 ± 0.073
3.25LeuPhe: 3.25 ± 0.057
7.672LeuGly: 7.672 ± 0.079
2.048LeuHis: 2.048 ± 0.04
5.322LeuIle: 5.322 ± 0.075
3.856LeuLys: 3.856 ± 0.061
8.196LeuLeu: 8.196 ± 0.11
2.855LeuMet: 2.855 ± 0.047
3.018LeuAsn: 3.018 ± 0.049
4.818LeuPro: 4.818 ± 0.058
2.786LeuGln: 2.786 ± 0.051
6.586LeuArg: 6.586 ± 0.077
6.411LeuSer: 6.411 ± 0.076
5.937LeuThr: 5.937 ± 0.082
6.334LeuVal: 6.334 ± 0.076
1.157LeuTrp: 1.157 ± 0.033
1.941LeuTyr: 1.941 ± 0.04
0.0LeuXaa: 0.0 ± 0.0
Met
3.392MetAla: 3.392 ± 0.047
0.211MetCys: 0.211 ± 0.012
1.62MetAsp: 1.62 ± 0.034
1.392MetGlu: 1.392 ± 0.03
0.908MetPhe: 0.908 ± 0.027
2.354MetGly: 2.354 ± 0.048
0.528MetHis: 0.528 ± 0.019
1.824MetIle: 1.824 ± 0.041
1.245MetLys: 1.245 ± 0.032
2.564MetLeu: 2.564 ± 0.051
0.917MetMet: 0.917 ± 0.025
1.037MetAsn: 1.037 ± 0.029
1.499MetPro: 1.499 ± 0.036
0.945MetGln: 0.945 ± 0.028
1.882MetArg: 1.882 ± 0.042
1.936MetSer: 1.936 ± 0.038
2.274MetThr: 2.274 ± 0.039
1.946MetVal: 1.946 ± 0.046
0.258MetTrp: 0.258 ± 0.016
0.402MetTyr: 0.402 ± 0.019
0.0MetXaa: 0.0 ± 0.0
Asn
3.495AsnAla: 3.495 ± 0.055
0.343AsnCys: 0.343 ± 0.017
1.752AsnAsp: 1.752 ± 0.042
1.37AsnGlu: 1.37 ± 0.033
1.109AsnPhe: 1.109 ± 0.028
2.746AsnGly: 2.746 ± 0.054
0.594AsnHis: 0.594 ± 0.017
1.697AsnIle: 1.697 ± 0.038
0.938AsnLys: 0.938 ± 0.025
2.786AsnLeu: 2.786 ± 0.048
0.783AsnMet: 0.783 ± 0.028
0.932AsnAsn: 0.932 ± 0.03
2.006AsnPro: 2.006 ± 0.037
1.052AsnGln: 1.052 ± 0.035
1.918AsnArg: 1.918 ± 0.042
1.408AsnSer: 1.408 ± 0.035
1.614AsnThr: 1.614 ± 0.041
2.139AsnVal: 2.139 ± 0.039
0.537AsnTrp: 0.537 ± 0.023
0.729AsnTyr: 0.729 ± 0.025
0.0AsnXaa: 0.0 ± 0.0
Pro
4.827ProAla: 4.827 ± 0.068
0.361ProCys: 0.361 ± 0.017
3.743ProAsp: 3.743 ± 0.054
3.318ProGlu: 3.318 ± 0.054
1.939ProPhe: 1.939 ± 0.04
2.877ProGly: 2.877 ± 0.049
1.089ProHis: 1.089 ± 0.029
2.543ProIle: 2.543 ± 0.056
2.211ProLys: 2.211 ± 0.052
4.072ProLeu: 4.072 ± 0.056
1.383ProMet: 1.383 ± 0.032
1.556ProAsn: 1.556 ± 0.034
1.94ProPro: 1.94 ± 0.042
1.593ProGln: 1.593 ± 0.038
2.425ProArg: 2.425 ± 0.051
2.668ProSer: 2.668 ± 0.045
2.804ProThr: 2.804 ± 0.052
3.635ProVal: 3.635 ± 0.055
0.699ProTrp: 0.699 ± 0.025
1.039ProTyr: 1.039 ± 0.029
0.0ProXaa: 0.0 ± 0.0
Gln
4.079GlnAla: 4.079 ± 0.061
0.237GlnCys: 0.237 ± 0.012
1.875GlnAsp: 1.875 ± 0.034
1.503GlnGlu: 1.503 ± 0.038
1.239GlnPhe: 1.239 ± 0.03
2.393GlnGly: 2.393 ± 0.049
0.753GlnHis: 0.753 ± 0.024
2.314GlnIle: 2.314 ± 0.039
1.267GlnLys: 1.267 ± 0.028
3.032GlnLeu: 3.032 ± 0.052
1.143GlnMet: 1.143 ± 0.025
1.072GlnAsn: 1.072 ± 0.024
1.55GlnPro: 1.55 ± 0.031
1.104GlnGln: 1.104 ± 0.031
2.287GlnArg: 2.287 ± 0.047
2.134GlnSer: 2.134 ± 0.042
2.35GlnThr: 2.35 ± 0.051
2.389GlnVal: 2.389 ± 0.043
0.464GlnTrp: 0.464 ± 0.017
0.651GlnTyr: 0.651 ± 0.021
0.0GlnXaa: 0.0 ± 0.0
Arg
6.777ArgAla: 6.777 ± 0.079
0.588ArgCys: 0.588 ± 0.025
4.122ArgAsp: 4.122 ± 0.063
3.364ArgGlu: 3.364 ± 0.054
2.653ArgPhe: 2.653 ± 0.042
4.477ArgGly: 4.477 ± 0.059
1.56ArgHis: 1.56 ± 0.041
3.811ArgIle: 3.811 ± 0.054
2.806ArgLys: 2.806 ± 0.053
6.185ArgLeu: 6.185 ± 0.081
1.844ArgMet: 1.844 ± 0.036
1.94ArgAsn: 1.94 ± 0.041
2.768ArgPro: 2.768 ± 0.054
2.393ArgGln: 2.393 ± 0.051
4.553ArgArg: 4.553 ± 0.067
3.299ArgSer: 3.299 ± 0.055
3.343ArgThr: 3.343 ± 0.049
4.261ArgVal: 4.261 ± 0.06
0.85ArgTrp: 0.85 ± 0.03
1.511ArgTyr: 1.511 ± 0.032
0.0ArgXaa: 0.0 ± 0.0
Ser
5.729SerAla: 5.729 ± 0.066
0.485SerCys: 0.485 ± 0.022
3.701SerAsp: 3.701 ± 0.054
2.779SerGlu: 2.779 ± 0.047
2.239SerPhe: 2.239 ± 0.037
5.512SerGly: 5.512 ± 0.059
1.194SerHis: 1.194 ± 0.031
2.911SerIle: 2.911 ± 0.047
2.071SerLys: 2.071 ± 0.037
4.793SerLeu: 4.793 ± 0.058
1.533SerMet: 1.533 ± 0.032
1.771SerAsn: 1.771 ± 0.036
2.488SerPro: 2.488 ± 0.047
1.805SerGln: 1.805 ± 0.04
3.325SerArg: 3.325 ± 0.05
2.845SerSer: 2.845 ± 0.051
2.837SerThr: 2.837 ± 0.045
4.074SerVal: 4.074 ± 0.057
0.708SerTrp: 0.708 ± 0.025
1.34SerTyr: 1.34 ± 0.034
0.0SerXaa: 0.0 ± 0.0
Thr
6.406ThrAla: 6.406 ± 0.076
0.552ThrCys: 0.552 ± 0.02
3.574ThrAsp: 3.574 ± 0.053
2.812ThrGlu: 2.812 ± 0.043
2.436ThrPhe: 2.436 ± 0.043
5.211ThrGly: 5.211 ± 0.067
1.354ThrHis: 1.354 ± 0.034
3.205ThrIle: 3.205 ± 0.047
2.118ThrLys: 2.118 ± 0.04
6.213ThrLeu: 6.213 ± 0.078
1.461ThrMet: 1.461 ± 0.026
1.688ThrAsn: 1.688 ± 0.039
3.38ThrPro: 3.38 ± 0.058
1.938ThrGln: 1.938 ± 0.038
3.294ThrArg: 3.294 ± 0.054
3.194ThrSer: 3.194 ± 0.05
3.309ThrThr: 3.309 ± 0.053
4.417ThrVal: 4.417 ± 0.072
0.756ThrTrp: 0.756 ± 0.029
1.438ThrTyr: 1.438 ± 0.033
0.0ThrXaa: 0.0 ± 0.0
Val
8.087ValAla: 8.087 ± 0.086
0.837ValCys: 0.837 ± 0.024
4.348ValAsp: 4.348 ± 0.067
4.062ValGlu: 4.062 ± 0.059
2.934ValPhe: 2.934 ± 0.049
5.428ValGly: 5.428 ± 0.069
1.492ValHis: 1.492 ± 0.033
4.408ValIle: 4.408 ± 0.062
2.42ValLys: 2.42 ± 0.047
7.009ValLeu: 7.009 ± 0.082
2.108ValMet: 2.108 ± 0.036
2.076ValAsn: 2.076 ± 0.044
3.274ValPro: 3.274 ± 0.052
2.286ValGln: 2.286 ± 0.043
4.159ValArg: 4.159 ± 0.061
4.428ValSer: 4.428 ± 0.056
4.808ValThr: 4.808 ± 0.077
5.441ValVal: 5.441 ± 0.063
1.002ValTrp: 1.002 ± 0.031
1.509ValTyr: 1.509 ± 0.034
0.0ValXaa: 0.0 ± 0.0
Trp
1.407TrpAla: 1.407 ± 0.038
0.163TrpCys: 0.163 ± 0.011
0.893TrpAsp: 0.893 ± 0.029
0.644TrpGlu: 0.644 ± 0.027
0.585TrpPhe: 0.585 ± 0.022
0.987TrpGly: 0.987 ± 0.03
0.344TrpHis: 0.344 ± 0.017
0.732TrpIle: 0.732 ± 0.025
0.526TrpLys: 0.526 ± 0.019
1.644TrpLeu: 1.644 ± 0.033
0.421TrpMet: 0.421 ± 0.018
0.422TrpAsn: 0.422 ± 0.019
0.653TrpPro: 0.653 ± 0.02
0.61TrpGln: 0.61 ± 0.024
1.106TrpArg: 1.106 ± 0.03
0.809TrpSer: 0.809 ± 0.026
0.802TrpThr: 0.802 ± 0.026
0.985TrpVal: 0.985 ± 0.028
0.211TrpTrp: 0.211 ± 0.013
0.303TrpTyr: 0.303 ± 0.016
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.468TyrAla: 2.468 ± 0.048
0.292TyrCys: 0.292 ± 0.016
1.604TyrAsp: 1.604 ± 0.036
1.28TyrGlu: 1.28 ± 0.03
0.906TyrPhe: 0.906 ± 0.026
1.964TyrGly: 1.964 ± 0.043
0.571TyrHis: 0.571 ± 0.02
1.04TyrIle: 1.04 ± 0.027
0.796TyrLys: 0.796 ± 0.024
2.253TyrLeu: 2.253 ± 0.041
0.54TyrMet: 0.54 ± 0.018
0.706TyrAsn: 0.706 ± 0.021
1.035TyrPro: 1.035 ± 0.029
0.781TyrGln: 0.781 ± 0.021
1.502TyrArg: 1.502 ± 0.036
1.378TyrSer: 1.378 ± 0.033
1.134TyrThr: 1.134 ± 0.032
1.557TyrVal: 1.557 ± 0.033
0.361TyrTrp: 0.361 ± 0.018
0.599TyrTyr: 0.599 ± 0.022
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4331 proteins (1328656 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski