Amino acid dipepetide frequency for Corynebacterium aquilae DSM 44791

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
16.023AlaAla: 16.023 ± 0.28
1.179AlaCys: 1.179 ± 0.045
7.403AlaAsp: 7.403 ± 0.14
7.499AlaGlu: 7.499 ± 0.115
3.558AlaPhe: 3.558 ± 0.078
10.243AlaGly: 10.243 ± 0.142
2.969AlaHis: 2.969 ± 0.074
5.94AlaIle: 5.94 ± 0.12
4.074AlaLys: 4.074 ± 0.11
11.495AlaLeu: 11.495 ± 0.156
2.776AlaMet: 2.776 ± 0.059
2.985AlaAsn: 2.985 ± 0.067
5.506AlaPro: 5.506 ± 0.15
4.874AlaGln: 4.874 ± 0.107
7.162AlaArg: 7.162 ± 0.127
5.866AlaSer: 5.866 ± 0.115
7.884AlaThr: 7.884 ± 0.13
9.407AlaVal: 9.407 ± 0.159
1.374AlaTrp: 1.374 ± 0.05
2.338AlaTyr: 2.338 ± 0.057
0.0AlaXaa: 0.0 ± 0.0
Cys
1.156CysAla: 1.156 ± 0.053
0.11CysCys: 0.11 ± 0.013
0.569CysAsp: 0.569 ± 0.03
0.534CysGlu: 0.534 ± 0.033
0.284CysPhe: 0.284 ± 0.021
0.946CysGly: 0.946 ± 0.042
0.226CysHis: 0.226 ± 0.018
0.354CysIle: 0.354 ± 0.023
0.201CysLys: 0.201 ± 0.019
0.694CysLeu: 0.694 ± 0.037
0.153CysMet: 0.153 ± 0.016
0.218CysAsn: 0.218 ± 0.019
0.505CysPro: 0.505 ± 0.028
0.284CysGln: 0.284 ± 0.019
0.424CysArg: 0.424 ± 0.028
0.468CysSer: 0.468 ± 0.029
0.538CysThr: 0.538 ± 0.032
0.723CysVal: 0.723 ± 0.035
0.11CysTrp: 0.11 ± 0.012
0.181CysTyr: 0.181 ± 0.017
0.0CysXaa: 0.0 ± 0.0
Asp
7.301AspAla: 7.301 ± 0.111
0.58AspCys: 0.58 ± 0.033
3.776AspAsp: 3.776 ± 0.101
4.354AspGlu: 4.354 ± 0.096
2.161AspPhe: 2.161 ± 0.062
5.128AspGly: 5.128 ± 0.106
1.42AspHis: 1.42 ± 0.054
3.424AspIle: 3.424 ± 0.074
2.239AspLys: 2.239 ± 0.078
5.222AspLeu: 5.222 ± 0.093
1.312AspMet: 1.312 ± 0.041
2.073AspAsn: 2.073 ± 0.068
3.727AspPro: 3.727 ± 0.072
1.696AspGln: 1.696 ± 0.052
3.12AspArg: 3.12 ± 0.084
3.217AspSer: 3.217 ± 0.08
3.714AspThr: 3.714 ± 0.088
5.099AspVal: 5.099 ± 0.099
0.75AspTrp: 0.75 ± 0.035
1.543AspTyr: 1.543 ± 0.051
0.0AspXaa: 0.0 ± 0.0
Glu
6.502GluAla: 6.502 ± 0.128
0.442GluCys: 0.442 ± 0.029
3.729GluAsp: 3.729 ± 0.085
3.718GluGlu: 3.718 ± 0.096
2.018GluPhe: 2.018 ± 0.054
4.042GluGly: 4.042 ± 0.083
1.588GluHis: 1.588 ± 0.047
3.012GluIle: 3.012 ± 0.077
2.675GluLys: 2.675 ± 0.071
6.342GluLeu: 6.342 ± 0.1
1.309GluMet: 1.309 ± 0.05
1.877GluAsn: 1.877 ± 0.066
2.544GluPro: 2.544 ± 0.072
2.846GluGln: 2.846 ± 0.078
3.537GluArg: 3.537 ± 0.089
2.539GluSer: 2.539 ± 0.07
2.931GluThr: 2.931 ± 0.07
4.606GluVal: 4.606 ± 0.093
0.662GluTrp: 0.662 ± 0.038
1.469GluTyr: 1.469 ± 0.048
0.0GluXaa: 0.0 ± 0.0
Phe
3.926PheAla: 3.926 ± 0.087
0.265PheCys: 0.265 ± 0.022
2.507PheAsp: 2.507 ± 0.067
1.833PheGlu: 1.833 ± 0.052
1.321PhePhe: 1.321 ± 0.048
3.383PheGly: 3.383 ± 0.078
0.699PheHis: 0.699 ± 0.037
1.652PheIle: 1.652 ± 0.062
0.813PheLys: 0.813 ± 0.044
2.773PheLeu: 2.773 ± 0.084
0.628PheMet: 0.628 ± 0.037
1.036PheAsn: 1.036 ± 0.044
1.478PhePro: 1.478 ± 0.052
0.808PheGln: 0.808 ± 0.034
1.489PheArg: 1.489 ± 0.051
2.128PheSer: 2.128 ± 0.061
2.201PheThr: 2.201 ± 0.057
2.818PheVal: 2.818 ± 0.068
0.355PheTrp: 0.355 ± 0.026
0.856PheTyr: 0.856 ± 0.038
0.0PheXaa: 0.0 ± 0.0
Gly
8.757GlyAla: 8.757 ± 0.138
0.843GlyCys: 0.843 ± 0.042
4.53GlyAsp: 4.53 ± 0.102
4.945GlyGlu: 4.945 ± 0.098
3.207GlyPhe: 3.207 ± 0.078
6.769GlyGly: 6.769 ± 0.143
2.021GlyHis: 2.021 ± 0.059
4.504GlyIle: 4.504 ± 0.094
3.163GlyLys: 3.163 ± 0.082
7.778GlyLeu: 7.778 ± 0.137
2.216GlyMet: 2.216 ± 0.063
2.228GlyAsn: 2.228 ± 0.062
3.22GlyPro: 3.22 ± 0.071
3.053GlyGln: 3.053 ± 0.058
4.757GlyArg: 4.757 ± 0.091
4.659GlySer: 4.659 ± 0.086
5.149GlyThr: 5.149 ± 0.1
7.351GlyVal: 7.351 ± 0.137
1.304GlyTrp: 1.304 ± 0.041
2.196GlyTyr: 2.196 ± 0.06
0.0GlyXaa: 0.0 ± 0.0
His
2.506HisAla: 2.506 ± 0.071
0.233HisCys: 0.233 ± 0.02
1.396HisAsp: 1.396 ± 0.049
1.18HisGlu: 1.18 ± 0.048
0.694HisPhe: 0.694 ± 0.034
1.897HisGly: 1.897 ± 0.061
0.819HisHis: 0.819 ± 0.042
1.153HisIle: 1.153 ± 0.042
0.583HisLys: 0.583 ± 0.03
2.146HisLeu: 2.146 ± 0.062
0.519HisMet: 0.519 ± 0.032
0.767HisAsn: 0.767 ± 0.033
1.719HisPro: 1.719 ± 0.063
0.805HisGln: 0.805 ± 0.038
1.464HisArg: 1.464 ± 0.051
1.228HisSer: 1.228 ± 0.048
1.737HisThr: 1.737 ± 0.055
1.644HisVal: 1.644 ± 0.05
0.279HisTrp: 0.279 ± 0.022
0.586HisTyr: 0.586 ± 0.028
0.0HisXaa: 0.0 ± 0.0
Ile
6.906IleAla: 6.906 ± 0.132
0.406IleCys: 0.406 ± 0.026
3.938IleAsp: 3.938 ± 0.082
2.898IleGlu: 2.898 ± 0.075
1.559IlePhe: 1.559 ± 0.058
4.507IleGly: 4.507 ± 0.103
1.023IleHis: 1.023 ± 0.035
2.835IleIle: 2.835 ± 0.084
1.575IleLys: 1.575 ± 0.056
3.949IleLeu: 3.949 ± 0.091
0.949IleMet: 0.949 ± 0.043
1.711IleAsn: 1.711 ± 0.051
2.695IlePro: 2.695 ± 0.06
1.199IleGln: 1.199 ± 0.044
2.504IleArg: 2.504 ± 0.067
2.818IleSer: 2.818 ± 0.071
3.66IleThr: 3.66 ± 0.092
4.522IleVal: 4.522 ± 0.104
0.386IleTrp: 0.386 ± 0.026
1.023IleTyr: 1.023 ± 0.04
0.0IleXaa: 0.0 ± 0.0
Lys
4.179LysAla: 4.179 ± 0.119
0.16LysCys: 0.16 ± 0.017
2.402LysAsp: 2.402 ± 0.075
1.993LysGlu: 1.993 ± 0.076
0.878LysPhe: 0.878 ± 0.039
2.413LysGly: 2.413 ± 0.07
0.72LysHis: 0.72 ± 0.035
1.774LysIle: 1.774 ± 0.063
1.911LysLys: 1.911 ± 0.067
3.163LysLeu: 3.163 ± 0.082
0.862LysMet: 0.862 ± 0.033
1.283LysAsn: 1.283 ± 0.047
1.874LysPro: 1.874 ± 0.07
1.382LysGln: 1.382 ± 0.047
1.984LysArg: 1.984 ± 0.066
1.559LysSer: 1.559 ± 0.057
2.147LysThr: 2.147 ± 0.071
2.82LysVal: 2.82 ± 0.084
0.372LysTrp: 0.372 ± 0.026
0.694LysTyr: 0.694 ± 0.034
0.0LysXaa: 0.0 ± 0.0
Leu
12.105LeuAla: 12.105 ± 0.166
0.793LeuCys: 0.793 ± 0.037
5.858LeuAsp: 5.858 ± 0.118
4.765LeuGlu: 4.765 ± 0.107
2.686LeuPhe: 2.686 ± 0.071
8.125LeuGly: 8.125 ± 0.127
1.879LeuHis: 1.879 ± 0.053
4.887LeuIle: 4.887 ± 0.109
3.2LeuLys: 3.2 ± 0.087
8.509LeuLeu: 8.509 ± 0.161
1.983LeuMet: 1.983 ± 0.058
2.761LeuAsn: 2.761 ± 0.069
5.225LeuPro: 5.225 ± 0.097
2.446LeuGln: 2.446 ± 0.067
5.401LeuArg: 5.401 ± 0.103
5.887LeuSer: 5.887 ± 0.119
5.901LeuThr: 5.901 ± 0.095
7.516LeuVal: 7.516 ± 0.126
1.156LeuTrp: 1.156 ± 0.048
1.713LeuTyr: 1.713 ± 0.05
0.002LeuXaa: 0.002 ± 0.001
Met
2.722MetAla: 2.722 ± 0.059
0.195MetCys: 0.195 ± 0.016
1.124MetAsp: 1.124 ± 0.042
1.081MetGlu: 1.081 ± 0.048
0.752MetPhe: 0.752 ± 0.035
1.835MetGly: 1.835 ± 0.059
0.433MetHis: 0.433 ± 0.027
1.168MetIle: 1.168 ± 0.048
0.816MetLys: 0.816 ± 0.031
2.163MetLeu: 2.163 ± 0.062
0.525MetMet: 0.525 ± 0.029
0.7MetAsn: 0.7 ± 0.035
1.179MetPro: 1.179 ± 0.043
0.605MetGln: 0.605 ± 0.03
1.495MetArg: 1.495 ± 0.052
1.609MetSer: 1.609 ± 0.049
1.698MetThr: 1.698 ± 0.046
1.789MetVal: 1.789 ± 0.054
0.265MetTrp: 0.265 ± 0.022
0.4MetTyr: 0.4 ± 0.027
0.0MetXaa: 0.0 ± 0.0
Asn
3.171AsnAla: 3.171 ± 0.073
0.241AsnCys: 0.241 ± 0.022
1.63AsnAsp: 1.63 ± 0.057
1.467AsnGlu: 1.467 ± 0.051
0.901AsnPhe: 0.901 ± 0.045
2.253AsnGly: 2.253 ± 0.08
0.735AsnHis: 0.735 ± 0.035
1.53AsnIle: 1.53 ± 0.05
1.159AsnLys: 1.159 ± 0.051
2.736AsnLeu: 2.736 ± 0.07
0.659AsnMet: 0.659 ± 0.028
1.126AsnAsn: 1.126 ± 0.058
2.305AsnPro: 2.305 ± 0.054
1.016AsnGln: 1.016 ± 0.04
1.6AsnArg: 1.6 ± 0.051
1.665AsnSer: 1.665 ± 0.054
2.035AsnThr: 2.035 ± 0.066
2.225AsnVal: 2.225 ± 0.07
0.393AsnTrp: 0.393 ± 0.023
0.796AsnTyr: 0.796 ± 0.039
0.0AsnXaa: 0.0 ± 0.0
Pro
6.703ProAla: 6.703 ± 0.152
0.302ProCys: 0.302 ± 0.023
3.366ProAsp: 3.366 ± 0.078
3.86ProGlu: 3.86 ± 0.071
1.635ProPhe: 1.635 ± 0.047
4.571ProGly: 4.571 ± 0.095
1.33ProHis: 1.33 ± 0.052
2.23ProIle: 2.23 ± 0.06
1.751ProLys: 1.751 ± 0.059
4.405ProLeu: 4.405 ± 0.09
1.051ProMet: 1.051 ± 0.039
1.626ProAsn: 1.626 ± 0.051
2.123ProPro: 2.123 ± 0.084
2.112ProGln: 2.112 ± 0.065
2.631ProArg: 2.631 ± 0.06
2.664ProSer: 2.664 ± 0.06
3.778ProThr: 3.778 ± 0.093
4.4ProVal: 4.4 ± 0.106
0.683ProTrp: 0.683 ± 0.034
1.068ProTyr: 1.068 ± 0.04
0.0ProXaa: 0.0 ± 0.0
Gln
4.618GlnAla: 4.618 ± 0.099
0.232GlnCys: 0.232 ± 0.022
1.676GlnAsp: 1.676 ± 0.057
1.943GlnGlu: 1.943 ± 0.061
1.005GlnPhe: 1.005 ± 0.043
2.384GlnGly: 2.384 ± 0.059
0.921GlnHis: 0.921 ± 0.04
1.498GlnIle: 1.498 ± 0.054
1.055GlnLys: 1.055 ± 0.047
4.081GlnLeu: 4.081 ± 0.084
0.779GlnMet: 0.779 ± 0.037
0.76GlnAsn: 0.76 ± 0.039
2.227GlnPro: 2.227 ± 0.075
1.913GlnGln: 1.913 ± 0.085
2.599GlnArg: 2.599 ± 0.067
1.461GlnSer: 1.461 ± 0.048
1.508GlnThr: 1.508 ± 0.051
2.603GlnVal: 2.603 ± 0.073
0.607GlnTrp: 0.607 ± 0.032
0.65GlnTyr: 0.65 ± 0.033
0.0GlnXaa: 0.0 ± 0.0
Arg
6.371ArgAla: 6.371 ± 0.102
0.444ArgCys: 0.444 ± 0.025
3.511ArgAsp: 3.511 ± 0.09
3.607ArgGlu: 3.607 ± 0.081
2.018ArgPhe: 2.018 ± 0.065
4.316ArgGly: 4.316 ± 0.078
1.367ArgHis: 1.367 ± 0.047
3.13ArgIle: 3.13 ± 0.07
1.971ArgLys: 1.971 ± 0.062
5.413ArgLeu: 5.413 ± 0.111
1.531ArgMet: 1.531 ± 0.052
1.707ArgAsn: 1.707 ± 0.049
2.838ArgPro: 2.838 ± 0.075
2.161ArgGln: 2.161 ± 0.061
4.284ArgArg: 4.284 ± 0.09
2.918ArgSer: 2.918 ± 0.083
3.583ArgThr: 3.583 ± 0.093
4.312ArgVal: 4.312 ± 0.091
0.793ArgTrp: 0.793 ± 0.038
1.492ArgTyr: 1.492 ± 0.05
0.0ArgXaa: 0.0 ± 0.0
Ser
6.072SerAla: 6.072 ± 0.107
0.474SerCys: 0.474 ± 0.023
3.122SerAsp: 3.122 ± 0.077
2.733SerGlu: 2.733 ± 0.069
2.08SerPhe: 2.08 ± 0.06
5.051SerGly: 5.051 ± 0.103
1.165SerHis: 1.165 ± 0.046
2.512SerIle: 2.512 ± 0.067
1.746SerLys: 1.746 ± 0.06
4.992SerLeu: 4.992 ± 0.094
1.431SerMet: 1.431 ± 0.046
1.569SerAsn: 1.569 ± 0.048
2.913SerPro: 2.913 ± 0.06
1.858SerGln: 1.858 ± 0.058
3.163SerArg: 3.163 ± 0.068
3.392SerSer: 3.392 ± 0.084
3.79SerThr: 3.79 ± 0.085
4.199SerVal: 4.199 ± 0.094
0.828SerTrp: 0.828 ± 0.037
1.272SerTyr: 1.272 ± 0.045
0.0SerXaa: 0.0 ± 0.0
Thr
7.365ThrAla: 7.365 ± 0.116
0.595ThrCys: 0.595 ± 0.04
3.442ThrAsp: 3.442 ± 0.066
2.936ThrGlu: 2.936 ± 0.07
2.152ThrPhe: 2.152 ± 0.055
5.431ThrGly: 5.431 ± 0.111
1.679ThrHis: 1.679 ± 0.059
3.663ThrIle: 3.663 ± 0.087
2.099ThrLys: 2.099 ± 0.061
5.719ThrLeu: 5.719 ± 0.089
1.423ThrMet: 1.423 ± 0.045
1.893ThrAsn: 1.893 ± 0.08
4.406ThrPro: 4.406 ± 0.115
2.025ThrGln: 2.025 ± 0.064
3.364ThrArg: 3.364 ± 0.075
3.621ThrSer: 3.621 ± 0.084
4.999ThrThr: 4.999 ± 0.13
5.57ThrVal: 5.57 ± 0.108
0.877ThrTrp: 0.877 ± 0.034
1.466ThrTyr: 1.466 ± 0.049
0.0ThrXaa: 0.0 ± 0.0
Val
10.309ValAla: 10.309 ± 0.167
0.842ValCys: 0.842 ± 0.039
5.802ValAsp: 5.802 ± 0.096
5.085ValGlu: 5.085 ± 0.102
2.785ValPhe: 2.785 ± 0.076
6.506ValGly: 6.506 ± 0.107
1.673ValHis: 1.673 ± 0.05
4.229ValIle: 4.229 ± 0.09
2.527ValLys: 2.527 ± 0.069
7.62ValLeu: 7.62 ± 0.118
1.748ValMet: 1.748 ± 0.052
2.253ValAsn: 2.253 ± 0.063
3.962ValPro: 3.962 ± 0.079
2.004ValGln: 2.004 ± 0.056
4.487ValArg: 4.487 ± 0.084
4.701ValSer: 4.701 ± 0.087
5.303ValThr: 5.303 ± 0.133
7.885ValVal: 7.885 ± 0.124
0.973ValTrp: 0.973 ± 0.042
1.595ValTyr: 1.595 ± 0.05
0.0ValXaa: 0.0 ± 0.0
Trp
1.348TrpAla: 1.348 ± 0.05
0.142TrpCys: 0.142 ± 0.015
0.705TrpAsp: 0.705 ± 0.034
0.721TrpGlu: 0.721 ± 0.041
0.497TrpPhe: 0.497 ± 0.029
0.872TrpGly: 0.872 ± 0.038
0.268TrpHis: 0.268 ± 0.019
0.627TrpIle: 0.627 ± 0.033
0.4TrpLys: 0.4 ± 0.026
1.458TrpLeu: 1.458 ± 0.058
0.348TrpMet: 0.348 ± 0.024
0.381TrpAsn: 0.381 ± 0.026
0.604TrpPro: 0.604 ± 0.031
0.549TrpGln: 0.549 ± 0.027
0.863TrpArg: 0.863 ± 0.037
0.674TrpSer: 0.674 ± 0.032
0.647TrpThr: 0.647 ± 0.036
1.095TrpVal: 1.095 ± 0.043
0.314TrpTrp: 0.314 ± 0.024
0.271TrpTyr: 0.271 ± 0.02
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.517TyrAla: 2.517 ± 0.057
0.198TyrCys: 0.198 ± 0.017
1.37TyrAsp: 1.37 ± 0.047
1.234TyrGlu: 1.234 ± 0.046
0.776TyrPhe: 0.776 ± 0.031
1.981TyrGly: 1.981 ± 0.058
0.419TyrHis: 0.419 ± 0.026
0.943TyrIle: 0.943 ± 0.038
0.613TyrLys: 0.613 ± 0.036
2.138TyrLeu: 2.138 ± 0.058
0.354TyrMet: 0.354 ± 0.025
0.691TyrAsn: 0.691 ± 0.038
1.24TyrPro: 1.24 ± 0.048
0.883TyrGln: 0.883 ± 0.035
1.435TyrArg: 1.435 ± 0.052
1.289TyrSer: 1.289 ± 0.04
1.524TyrThr: 1.524 ± 0.051
1.704TyrVal: 1.704 ± 0.049
0.3TyrTrp: 0.3 ± 0.022
0.578TyrTyr: 0.578 ± 0.036
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.002XaaTrp: 0.002 ± 0.001
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2002 proteins (655671 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski