Amino acid dipepetide frequency for Hoeflea marina

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
17.798AlaAla: 17.798 ± 0.165
1.047AlaCys: 1.047 ± 0.028
7.545AlaAsp: 7.545 ± 0.078
7.585AlaGlu: 7.585 ± 0.091
4.544AlaPhe: 4.544 ± 0.064
12.054AlaGly: 12.054 ± 0.112
2.26AlaHis: 2.26 ± 0.045
6.855AlaIle: 6.855 ± 0.077
3.222AlaLys: 3.222 ± 0.05
12.712AlaLeu: 12.712 ± 0.108
3.862AlaMet: 3.862 ± 0.044
2.935AlaAsn: 2.935 ± 0.051
5.286AlaPro: 5.286 ± 0.074
3.316AlaGln: 3.316 ± 0.054
8.962AlaArg: 8.962 ± 0.093
6.977AlaSer: 6.977 ± 0.092
6.022AlaThr: 6.022 ± 0.068
8.942AlaVal: 8.942 ± 0.094
1.482AlaTrp: 1.482 ± 0.031
2.452AlaTyr: 2.452 ± 0.039
0.0AlaXaa: 0.0 ± 0.0
Cys
0.961CysAla: 0.961 ± 0.026
0.111CysCys: 0.111 ± 0.008
0.514CysAsp: 0.514 ± 0.019
0.411CysGlu: 0.411 ± 0.015
0.348CysPhe: 0.348 ± 0.014
0.938CysGly: 0.938 ± 0.027
0.25CysHis: 0.25 ± 0.013
0.388CysIle: 0.388 ± 0.016
0.16CysLys: 0.16 ± 0.009
0.828CysLeu: 0.828 ± 0.025
0.199CysMet: 0.199 ± 0.011
0.203CysAsn: 0.203 ± 0.011
0.401CysPro: 0.401 ± 0.019
0.212CysGln: 0.212 ± 0.013
0.624CysArg: 0.624 ± 0.022
0.441CysSer: 0.441 ± 0.017
0.381CysThr: 0.381 ± 0.018
0.595CysVal: 0.595 ± 0.02
0.111CysTrp: 0.111 ± 0.007
0.209CysTyr: 0.209 ± 0.012
0.0CysXaa: 0.0 ± 0.0
Asp
6.973AspAla: 6.973 ± 0.076
0.544AspCys: 0.544 ± 0.019
3.194AspAsp: 3.194 ± 0.055
3.339AspGlu: 3.339 ± 0.063
2.348AspPhe: 2.348 ± 0.041
5.54AspGly: 5.54 ± 0.075
1.371AspHis: 1.371 ± 0.033
3.497AspIle: 3.497 ± 0.053
1.617AspLys: 1.617 ± 0.032
5.894AspLeu: 5.894 ± 0.066
1.567AspMet: 1.567 ± 0.029
1.399AspAsn: 1.399 ± 0.03
3.513AspPro: 3.513 ± 0.052
1.632AspGln: 1.632 ± 0.035
4.546AspArg: 4.546 ± 0.061
2.542AspSer: 2.542 ± 0.044
2.838AspThr: 2.838 ± 0.055
4.021AspVal: 4.021 ± 0.052
0.969AspTrp: 0.969 ± 0.024
1.478AspTyr: 1.478 ± 0.029
0.0AspXaa: 0.0 ± 0.0
Glu
7.44GluAla: 7.44 ± 0.083
0.361GluCys: 0.361 ± 0.015
2.714GluAsp: 2.714 ± 0.052
2.766GluGlu: 2.766 ± 0.058
1.9GluPhe: 1.9 ± 0.04
4.071GluGly: 4.071 ± 0.055
1.087GluHis: 1.087 ± 0.028
3.706GluIle: 3.706 ± 0.05
1.958GluLys: 1.958 ± 0.045
5.252GluLeu: 5.252 ± 0.072
1.568GluMet: 1.568 ± 0.03
1.52GluAsn: 1.52 ± 0.039
2.64GluPro: 2.64 ± 0.055
1.823GluGln: 1.823 ± 0.033
4.313GluArg: 4.313 ± 0.069
2.588GluSer: 2.588 ± 0.044
3.572GluThr: 3.572 ± 0.056
3.64GluVal: 3.64 ± 0.051
0.632GluTrp: 0.632 ± 0.019
0.916GluTyr: 0.916 ± 0.025
0.0GluXaa: 0.0 ± 0.0
Phe
4.715PheAla: 4.715 ± 0.062
0.4PheCys: 0.4 ± 0.017
2.833PheAsp: 2.833 ± 0.048
2.134PheGlu: 2.134 ± 0.035
1.459PhePhe: 1.459 ± 0.038
3.81PheGly: 3.81 ± 0.065
0.775PheHis: 0.775 ± 0.025
1.861PheIle: 1.861 ± 0.035
0.947PheLys: 0.947 ± 0.029
3.443PheLeu: 3.443 ± 0.056
0.876PheMet: 0.876 ± 0.023
1.065PheAsn: 1.065 ± 0.028
1.626PhePro: 1.626 ± 0.034
0.973PheGln: 0.973 ± 0.024
2.321PheArg: 2.321 ± 0.044
2.567PheSer: 2.567 ± 0.041
2.009PheThr: 2.009 ± 0.042
2.874PheVal: 2.874 ± 0.053
0.551PheTrp: 0.551 ± 0.02
0.896PheTyr: 0.896 ± 0.027
0.0PheXaa: 0.0 ± 0.0
Gly
9.667GlyAla: 9.667 ± 0.099
0.841GlyCys: 0.841 ± 0.029
4.89GlyAsp: 4.89 ± 0.083
4.889GlyGlu: 4.889 ± 0.057
3.975GlyPhe: 3.975 ± 0.052
7.977GlyGly: 7.977 ± 0.157
2.059GlyHis: 2.059 ± 0.038
5.066GlyIle: 5.066 ± 0.058
3.086GlyLys: 3.086 ± 0.052
9.285GlyLeu: 9.285 ± 0.102
2.572GlyMet: 2.572 ± 0.043
2.366GlyAsn: 2.366 ± 0.065
3.48GlyPro: 3.48 ± 0.048
2.808GlyGln: 2.808 ± 0.043
6.363GlyArg: 6.363 ± 0.069
5.242GlySer: 5.242 ± 0.077
4.747GlyThr: 4.747 ± 0.112
6.209GlyVal: 6.209 ± 0.076
1.353GlyTrp: 1.353 ± 0.031
2.361GlyTyr: 2.361 ± 0.039
0.0GlyXaa: 0.0 ± 0.0
His
2.426HisAla: 2.426 ± 0.044
0.239HisCys: 0.239 ± 0.014
1.255HisAsp: 1.255 ± 0.031
0.969HisGlu: 0.969 ± 0.028
0.859HisPhe: 0.859 ± 0.023
2.007HisGly: 2.007 ± 0.039
0.549HisHis: 0.549 ± 0.02
0.954HisIle: 0.954 ± 0.026
0.436HisLys: 0.436 ± 0.017
2.054HisLeu: 2.054 ± 0.042
0.543HisMet: 0.543 ± 0.019
0.465HisAsn: 0.465 ± 0.021
1.299HisPro: 1.299 ± 0.036
0.593HisGln: 0.593 ± 0.022
1.422HisArg: 1.422 ± 0.032
1.027HisSer: 1.027 ± 0.024
0.788HisThr: 0.788 ± 0.024
1.571HisVal: 1.571 ± 0.033
0.326HisTrp: 0.326 ± 0.016
0.532HisTyr: 0.532 ± 0.02
0.0HisXaa: 0.0 ± 0.0
Ile
7.762IleAla: 7.762 ± 0.081
0.55IleCys: 0.55 ± 0.016
3.912IleAsp: 3.912 ± 0.052
3.588IleGlu: 3.588 ± 0.049
1.841IlePhe: 1.841 ± 0.042
5.472IleGly: 5.472 ± 0.07
1.009IleHis: 1.009 ± 0.025
2.678IleIle: 2.678 ± 0.054
1.407IleLys: 1.407 ± 0.032
4.773IleLeu: 4.773 ± 0.067
1.075IleMet: 1.075 ± 0.027
1.608IleAsn: 1.608 ± 0.033
2.417IlePro: 2.417 ± 0.04
1.134IleGln: 1.134 ± 0.029
3.646IleArg: 3.646 ± 0.053
3.399IleSer: 3.399 ± 0.056
2.789IleThr: 2.789 ± 0.046
4.248IleVal: 4.248 ± 0.052
0.64IleTrp: 0.64 ± 0.023
1.188IleTyr: 1.188 ± 0.029
0.0IleXaa: 0.0 ± 0.0
Lys
3.855LysAla: 3.855 ± 0.057
0.15LysCys: 0.15 ± 0.01
1.512LysAsp: 1.512 ± 0.034
1.207LysGlu: 1.207 ± 0.028
0.843LysPhe: 0.843 ± 0.026
2.37LysGly: 2.37 ± 0.045
0.502LysHis: 0.502 ± 0.018
1.611LysIle: 1.611 ± 0.033
1.044LysLys: 1.044 ± 0.032
2.965LysLeu: 2.965 ± 0.051
0.73LysMet: 0.73 ± 0.025
0.735LysAsn: 0.735 ± 0.025
1.906LysPro: 1.906 ± 0.041
0.87LysGln: 0.87 ± 0.024
2.018LysArg: 2.018 ± 0.04
1.879LysSer: 1.879 ± 0.039
1.822LysThr: 1.822 ± 0.039
2.184LysVal: 2.184 ± 0.042
0.352LysTrp: 0.352 ± 0.015
0.583LysTyr: 0.583 ± 0.02
0.0LysXaa: 0.0 ± 0.0
Leu
13.706LeuAla: 13.706 ± 0.123
0.892LeuCys: 0.892 ± 0.029
6.076LeuAsp: 6.076 ± 0.072
5.129LeuGlu: 5.129 ± 0.059
3.638LeuPhe: 3.638 ± 0.067
8.45LeuGly: 8.45 ± 0.083
1.681LeuHis: 1.681 ± 0.032
5.196LeuIle: 5.196 ± 0.067
3.326LeuLys: 3.326 ± 0.057
9.032LeuLeu: 9.032 ± 0.11
2.635LeuMet: 2.635 ± 0.046
2.446LeuAsn: 2.446 ± 0.037
5.189LeuPro: 5.189 ± 0.063
2.567LeuGln: 2.567 ± 0.044
6.287LeuArg: 6.287 ± 0.09
6.895LeuSer: 6.895 ± 0.091
5.475LeuThr: 5.475 ± 0.073
7.808LeuVal: 7.808 ± 0.081
1.053LeuTrp: 1.053 ± 0.028
1.966LeuTyr: 1.966 ± 0.038
0.0LeuXaa: 0.0 ± 0.0
Met
3.628MetAla: 3.628 ± 0.044
0.164MetCys: 0.164 ± 0.011
1.28MetAsp: 1.28 ± 0.029
1.178MetGlu: 1.178 ± 0.027
0.754MetPhe: 0.754 ± 0.023
1.912MetGly: 1.912 ± 0.041
0.467MetHis: 0.467 ± 0.018
1.619MetIle: 1.619 ± 0.036
0.989MetLys: 0.989 ± 0.028
2.71MetLeu: 2.71 ± 0.047
0.769MetMet: 0.769 ± 0.02
0.854MetAsn: 0.854 ± 0.021
1.563MetPro: 1.563 ± 0.034
0.923MetGln: 0.923 ± 0.025
1.897MetArg: 1.897 ± 0.039
1.813MetSer: 1.813 ± 0.037
2.072MetThr: 2.072 ± 0.031
1.961MetVal: 1.961 ± 0.039
0.211MetTrp: 0.211 ± 0.013
0.307MetTyr: 0.307 ± 0.016
0.0MetXaa: 0.0 ± 0.0
Asn
3.105AsnAla: 3.105 ± 0.053
0.233AsnCys: 0.233 ± 0.012
1.446AsnAsp: 1.446 ± 0.045
1.215AsnGlu: 1.215 ± 0.029
0.981AsnPhe: 0.981 ± 0.026
2.439AsnGly: 2.439 ± 0.051
0.523AsnHis: 0.523 ± 0.017
1.357AsnIle: 1.357 ± 0.034
0.641AsnLys: 0.641 ± 0.025
2.555AsnLeu: 2.555 ± 0.047
0.643AsnMet: 0.643 ± 0.018
0.659AsnAsn: 0.659 ± 0.023
1.827AsnPro: 1.827 ± 0.037
0.742AsnGln: 0.742 ± 0.022
1.929AsnArg: 1.929 ± 0.036
1.429AsnSer: 1.429 ± 0.037
1.204AsnThr: 1.204 ± 0.035
1.872AsnVal: 1.872 ± 0.044
0.434AsnTrp: 0.434 ± 0.016
0.632AsnTyr: 0.632 ± 0.022
0.0AsnXaa: 0.0 ± 0.0
Pro
6.468ProAla: 6.468 ± 0.08
0.299ProCys: 0.299 ± 0.016
3.769ProAsp: 3.769 ± 0.057
3.535ProGlu: 3.535 ± 0.057
1.93ProPhe: 1.93 ± 0.038
4.72ProGly: 4.72 ± 0.059
1.033ProHis: 1.033 ± 0.026
2.132ProIle: 2.132 ± 0.036
1.409ProLys: 1.409 ± 0.032
4.646ProLeu: 4.646 ± 0.064
1.257ProMet: 1.257 ± 0.032
1.147ProAsn: 1.147 ± 0.03
2.167ProPro: 2.167 ± 0.046
1.492ProGln: 1.492 ± 0.032
2.732ProArg: 2.732 ± 0.044
2.761ProSer: 2.761 ± 0.045
2.238ProThr: 2.238 ± 0.043
4.328ProVal: 4.328 ± 0.061
0.652ProTrp: 0.652 ± 0.024
1.111ProTyr: 1.111 ± 0.028
0.0ProXaa: 0.0 ± 0.0
Gln
3.883GlnAla: 3.883 ± 0.057
0.178GlnCys: 0.178 ± 0.011
1.336GlnAsp: 1.336 ± 0.031
1.184GlnGlu: 1.184 ± 0.027
1.044GlnPhe: 1.044 ± 0.026
2.128GlnGly: 2.128 ± 0.043
0.544GlnHis: 0.544 ± 0.023
1.767GlnIle: 1.767 ± 0.037
0.916GlnLys: 0.916 ± 0.025
2.803GlnLeu: 2.803 ± 0.044
0.907GlnMet: 0.907 ± 0.022
0.774GlnAsn: 0.774 ± 0.023
1.58GlnPro: 1.58 ± 0.035
1.03GlnGln: 1.03 ± 0.034
2.044GlnArg: 2.044 ± 0.041
1.85GlnSer: 1.85 ± 0.04
1.682GlnThr: 1.682 ± 0.035
2.094GlnVal: 2.094 ± 0.035
0.375GlnTrp: 0.375 ± 0.015
0.514GlnTyr: 0.514 ± 0.017
0.0GlnXaa: 0.0 ± 0.0
Arg
7.6ArgAla: 7.6 ± 0.087
0.442ArgCys: 0.442 ± 0.016
4.107ArgAsp: 4.107 ± 0.068
3.658ArgGlu: 3.658 ± 0.064
2.942ArgPhe: 2.942 ± 0.047
4.747ArgGly: 4.747 ± 0.06
1.822ArgHis: 1.822 ± 0.039
4.368ArgIle: 4.368 ± 0.061
2.229ArgLys: 2.229 ± 0.039
7.784ArgLeu: 7.784 ± 0.091
2.061ArgMet: 2.061 ± 0.038
1.892ArgAsn: 1.892 ± 0.036
3.593ArgPro: 3.593 ± 0.061
2.59ArgGln: 2.59 ± 0.047
5.472ArgArg: 5.472 ± 0.076
4.005ArgSer: 4.005 ± 0.055
3.383ArgThr: 3.383 ± 0.054
4.403ArgVal: 4.403 ± 0.056
0.836ArgTrp: 0.836 ± 0.025
1.601ArgTyr: 1.601 ± 0.034
0.0ArgXaa: 0.0 ± 0.0
Ser
6.716SerAla: 6.716 ± 0.074
0.421SerCys: 0.421 ± 0.017
3.315SerAsp: 3.315 ± 0.051
2.934SerGlu: 2.934 ± 0.048
2.425SerPhe: 2.425 ± 0.048
6.609SerGly: 6.609 ± 0.124
1.243SerHis: 1.243 ± 0.029
3.209SerIle: 3.209 ± 0.052
1.538SerLys: 1.538 ± 0.034
5.782SerLeu: 5.782 ± 0.073
1.569SerMet: 1.569 ± 0.033
1.476SerAsn: 1.476 ± 0.037
2.927SerPro: 2.927 ± 0.053
1.6SerGln: 1.6 ± 0.03
4.001SerArg: 4.001 ± 0.056
3.203SerSer: 3.203 ± 0.059
2.956SerThr: 2.956 ± 0.058
4.566SerVal: 4.566 ± 0.066
0.77SerTrp: 0.77 ± 0.025
1.395SerTyr: 1.395 ± 0.032
0.0SerXaa: 0.0 ± 0.0
Thr
6.467ThrAla: 6.467 ± 0.081
0.421ThrCys: 0.421 ± 0.016
2.959ThrAsp: 2.959 ± 0.05
2.748ThrGlu: 2.748 ± 0.04
1.851ThrPhe: 1.851 ± 0.034
5.7ThrGly: 5.7 ± 0.096
1.011ThrHis: 1.011 ± 0.025
2.998ThrIle: 2.998 ± 0.053
1.215ThrLys: 1.215 ± 0.033
5.694ThrLeu: 5.694 ± 0.078
1.364ThrMet: 1.364 ± 0.039
1.3ThrAsn: 1.3 ± 0.036
3.045ThrPro: 3.045 ± 0.047
1.317ThrGln: 1.317 ± 0.035
3.337ThrArg: 3.337 ± 0.055
2.951ThrSer: 2.951 ± 0.053
2.815ThrThr: 2.815 ± 0.061
4.489ThrVal: 4.489 ± 0.076
0.608ThrTrp: 0.608 ± 0.023
1.195ThrTyr: 1.195 ± 0.032
0.0ThrXaa: 0.0 ± 0.0
Val
9.191ValAla: 9.191 ± 0.087
0.628ValCys: 0.628 ± 0.021
4.115ValAsp: 4.115 ± 0.051
4.331ValGlu: 4.331 ± 0.052
2.972ValPhe: 2.972 ± 0.046
5.416ValGly: 5.416 ± 0.07
1.351ValHis: 1.351 ± 0.034
4.338ValIle: 4.338 ± 0.057
1.98ValLys: 1.98 ± 0.04
7.409ValLeu: 7.409 ± 0.085
2.028ValMet: 2.028 ± 0.039
2.004ValAsn: 2.004 ± 0.038
3.689ValPro: 3.689 ± 0.054
1.781ValGln: 1.781 ± 0.035
4.861ValArg: 4.861 ± 0.061
4.949ValSer: 4.949 ± 0.063
4.74ValThr: 4.74 ± 0.087
5.534ValVal: 5.534 ± 0.067
0.876ValTrp: 0.876 ± 0.029
1.494ValTyr: 1.494 ± 0.032
0.0ValXaa: 0.0 ± 0.0
Trp
1.154TrpAla: 1.154 ± 0.025
0.126TrpCys: 0.126 ± 0.009
0.644TrpAsp: 0.644 ± 0.022
0.528TrpGlu: 0.528 ± 0.02
0.55TrpPhe: 0.55 ± 0.021
0.82TrpGly: 0.82 ± 0.02
0.3TrpHis: 0.3 ± 0.014
0.657TrpIle: 0.657 ± 0.021
0.448TrpLys: 0.448 ± 0.017
1.582TrpLeu: 1.582 ± 0.038
0.378TrpMet: 0.378 ± 0.015
0.443TrpAsn: 0.443 ± 0.018
0.634TrpPro: 0.634 ± 0.022
0.522TrpGln: 0.522 ± 0.019
1.032TrpArg: 1.032 ± 0.028
0.881TrpSer: 0.881 ± 0.025
0.793TrpThr: 0.793 ± 0.023
0.768TrpVal: 0.768 ± 0.022
0.205TrpTrp: 0.205 ± 0.014
0.293TrpTyr: 0.293 ± 0.013
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.343TyrAla: 2.343 ± 0.041
0.238TyrCys: 0.238 ± 0.013
1.401TyrAsp: 1.401 ± 0.033
1.139TyrGlu: 1.139 ± 0.027
0.929TyrPhe: 0.929 ± 0.027
2.055TyrGly: 2.055 ± 0.036
0.479TyrHis: 0.479 ± 0.019
0.881TyrIle: 0.881 ± 0.026
0.569TyrLys: 0.569 ± 0.021
2.246TyrLeu: 2.246 ± 0.037
0.47TyrMet: 0.47 ± 0.017
0.575TyrAsn: 0.575 ± 0.021
1.046TyrPro: 1.046 ± 0.025
0.706TyrGln: 0.706 ± 0.021
1.727TyrArg: 1.727 ± 0.031
1.259TyrSer: 1.259 ± 0.037
1.16TyrThr: 1.16 ± 0.039
1.579TyrVal: 1.579 ± 0.032
0.322TyrTrp: 0.322 ± 0.015
0.525TyrTyr: 0.525 ± 0.021
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4798 proteins (1545255 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski