Amino acid dipepetide frequency for Hydrocarboniphaga effusa AP103

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
16.068AlaAla: 16.068 ± 0.15
1.194AlaCys: 1.194 ± 0.025
6.897AlaAsp: 6.897 ± 0.072
7.224AlaGlu: 7.224 ± 0.078
3.819AlaPhe: 3.819 ± 0.049
9.964AlaGly: 9.964 ± 0.102
2.276AlaHis: 2.276 ± 0.039
5.605AlaIle: 5.605 ± 0.063
3.732AlaLys: 3.732 ± 0.065
13.618AlaLeu: 13.618 ± 0.141
2.914AlaMet: 2.914 ± 0.048
2.99AlaAsn: 2.99 ± 0.055
5.807AlaPro: 5.807 ± 0.075
5.221AlaGln: 5.221 ± 0.076
8.619AlaArg: 8.619 ± 0.093
7.43AlaSer: 7.43 ± 0.085
5.645AlaThr: 5.645 ± 0.07
8.124AlaVal: 8.124 ± 0.074
1.583AlaTrp: 1.583 ± 0.035
2.714AlaTyr: 2.714 ± 0.048
0.0AlaXaa: 0.0 ± 0.0
Cys
1.114CysAla: 1.114 ± 0.032
0.149CysCys: 0.149 ± 0.011
0.479CysAsp: 0.479 ± 0.017
0.517CysGlu: 0.517 ± 0.021
0.307CysPhe: 0.307 ± 0.014
0.94CysGly: 0.94 ± 0.027
0.249CysHis: 0.249 ± 0.014
0.365CysIle: 0.365 ± 0.016
0.242CysLys: 0.242 ± 0.014
0.803CysLeu: 0.803 ± 0.025
0.164CysMet: 0.164 ± 0.011
0.235CysAsn: 0.235 ± 0.012
0.445CysPro: 0.445 ± 0.021
0.214CysGln: 0.214 ± 0.014
0.584CysArg: 0.584 ± 0.022
0.559CysSer: 0.559 ± 0.019
0.421CysThr: 0.421 ± 0.017
0.698CysVal: 0.698 ± 0.02
0.129CysTrp: 0.129 ± 0.008
0.201CysTyr: 0.201 ± 0.013
0.0CysXaa: 0.0 ± 0.0
Asp
7.183AspAla: 7.183 ± 0.077
0.491AspCys: 0.491 ± 0.019
3.166AspAsp: 3.166 ± 0.053
3.622AspGlu: 3.622 ± 0.05
2.237AspPhe: 2.237 ± 0.035
5.172AspGly: 5.172 ± 0.071
1.164AspHis: 1.164 ± 0.031
2.505AspIle: 2.505 ± 0.048
1.692AspLys: 1.692 ± 0.042
5.709AspLeu: 5.709 ± 0.064
0.943AspMet: 0.943 ± 0.023
1.311AspAsn: 1.311 ± 0.035
3.002AspPro: 3.002 ± 0.041
1.763AspGln: 1.763 ± 0.032
4.192AspArg: 4.192 ± 0.052
2.798AspSer: 2.798 ± 0.046
2.525AspThr: 2.525 ± 0.044
3.836AspVal: 3.836 ± 0.057
1.052AspTrp: 1.052 ± 0.026
1.716AspTyr: 1.716 ± 0.038
0.0AspXaa: 0.0 ± 0.0
Glu
7.405GluAla: 7.405 ± 0.088
0.395GluCys: 0.395 ± 0.016
2.462GluAsp: 2.462 ± 0.042
2.468GluGlu: 2.468 ± 0.048
1.887GluPhe: 1.887 ± 0.033
3.752GluGly: 3.752 ± 0.05
1.409GluHis: 1.409 ± 0.032
2.996GluIle: 2.996 ± 0.043
1.921GluLys: 1.921 ± 0.038
6.648GluLeu: 6.648 ± 0.073
1.066GluMet: 1.066 ± 0.028
1.415GluAsn: 1.415 ± 0.039
2.783GluPro: 2.783 ± 0.049
2.688GluGln: 2.688 ± 0.048
5.217GluArg: 5.217 ± 0.067
3.265GluSer: 3.265 ± 0.043
2.859GluThr: 2.859 ± 0.048
3.79GluVal: 3.79 ± 0.053
0.701GluTrp: 0.701 ± 0.022
1.177GluTyr: 1.177 ± 0.031
0.0GluXaa: 0.0 ± 0.0
Phe
4.069PheAla: 4.069 ± 0.056
0.385PheCys: 0.385 ± 0.016
2.724PheAsp: 2.724 ± 0.047
2.21PheGlu: 2.21 ± 0.041
1.307PhePhe: 1.307 ± 0.041
3.481PheGly: 3.481 ± 0.053
0.723PheHis: 0.723 ± 0.024
1.424PheIle: 1.424 ± 0.033
1.14PheLys: 1.14 ± 0.029
2.794PheLeu: 2.794 ± 0.05
0.766PheMet: 0.766 ± 0.021
1.125PheAsn: 1.125 ± 0.032
1.434PhePro: 1.434 ± 0.029
0.952PheGln: 0.952 ± 0.026
2.098PheArg: 2.098 ± 0.037
2.237PheSer: 2.237 ± 0.041
1.937PheThr: 1.937 ± 0.039
2.83PheVal: 2.83 ± 0.039
0.489PheTrp: 0.489 ± 0.02
0.93PheTyr: 0.93 ± 0.026
0.0PheXaa: 0.0 ± 0.0
Gly
8.693GlyAla: 8.693 ± 0.085
0.814GlyCys: 0.814 ± 0.023
4.575GlyAsp: 4.575 ± 0.062
4.73GlyGlu: 4.73 ± 0.062
3.372GlyPhe: 3.372 ± 0.056
7.275GlyGly: 7.275 ± 0.105
1.763GlyHis: 1.763 ± 0.036
4.166GlyIle: 4.166 ± 0.056
3.144GlyLys: 3.144 ± 0.065
9.024GlyLeu: 9.024 ± 0.086
1.901GlyMet: 1.901 ± 0.041
2.266GlyAsn: 2.266 ± 0.052
2.998GlyPro: 2.998 ± 0.053
2.984GlyGln: 2.984 ± 0.048
5.892GlyArg: 5.892 ± 0.065
5.324GlySer: 5.324 ± 0.084
4.085GlyThr: 4.085 ± 0.063
6.251GlyVal: 6.251 ± 0.07
1.428GlyTrp: 1.428 ± 0.032
2.479GlyTyr: 2.479 ± 0.042
0.0GlyXaa: 0.0 ± 0.0
His
2.412HisAla: 2.412 ± 0.044
0.262HisCys: 0.262 ± 0.013
1.15HisAsp: 1.15 ± 0.029
1.281HisGlu: 1.281 ± 0.032
0.858HisPhe: 0.858 ± 0.028
1.893HisGly: 1.893 ± 0.034
0.611HisHis: 0.611 ± 0.026
0.741HisIle: 0.741 ± 0.021
0.497HisLys: 0.497 ± 0.02
2.179HisLeu: 2.179 ± 0.039
0.394HisMet: 0.394 ± 0.015
0.433HisAsn: 0.433 ± 0.018
1.274HisPro: 1.274 ± 0.03
0.701HisGln: 0.701 ± 0.02
1.675HisArg: 1.675 ± 0.04
1.034HisSer: 1.034 ± 0.028
0.812HisThr: 0.812 ± 0.022
1.356HisVal: 1.356 ± 0.029
0.385HisTrp: 0.385 ± 0.014
0.686HisTyr: 0.686 ± 0.018
0.0HisXaa: 0.0 ± 0.0
Ile
6.684IleAla: 6.684 ± 0.065
0.387IleCys: 0.387 ± 0.017
3.503IleAsp: 3.503 ± 0.054
3.761IleGlu: 3.761 ± 0.054
1.312IlePhe: 1.312 ± 0.036
4.819IleGly: 4.819 ± 0.061
0.803IleHis: 0.803 ± 0.023
1.573IleIle: 1.573 ± 0.035
1.529IleLys: 1.529 ± 0.035
3.149IleLeu: 3.149 ± 0.054
0.677IleMet: 0.677 ± 0.022
1.401IleAsn: 1.401 ± 0.035
2.053IlePro: 2.053 ± 0.042
1.291IleGln: 1.291 ± 0.031
2.911IleArg: 2.911 ± 0.04
2.573IleSer: 2.573 ± 0.048
2.388IleThr: 2.388 ± 0.041
3.946IleVal: 3.946 ± 0.051
0.545IleTrp: 0.545 ± 0.016
1.058IleTyr: 1.058 ± 0.03
0.0IleXaa: 0.0 ± 0.0
Lys
3.974LysAla: 3.974 ± 0.069
0.176LysCys: 0.176 ± 0.011
1.449LysAsp: 1.449 ± 0.036
1.326LysGlu: 1.326 ± 0.036
0.867LysPhe: 0.867 ± 0.026
2.262LysGly: 2.262 ± 0.051
0.603LysHis: 0.603 ± 0.019
1.469LysIle: 1.469 ± 0.036
1.306LysLys: 1.306 ± 0.045
4.153LysLeu: 4.153 ± 0.057
0.634LysMet: 0.634 ± 0.022
0.878LysAsn: 0.878 ± 0.029
2.476LysPro: 2.476 ± 0.049
1.319LysGln: 1.319 ± 0.031
2.421LysArg: 2.421 ± 0.042
1.86LysSer: 1.86 ± 0.036
1.924LysThr: 1.924 ± 0.043
2.205LysVal: 2.205 ± 0.047
0.306LysTrp: 0.306 ± 0.016
0.582LysTyr: 0.582 ± 0.023
0.0LysXaa: 0.0 ± 0.0
Leu
13.571LeuAla: 13.571 ± 0.138
1.014LeuCys: 1.014 ± 0.023
6.522LeuAsp: 6.522 ± 0.071
5.35LeuGlu: 5.35 ± 0.071
3.574LeuPhe: 3.574 ± 0.046
8.534LeuGly: 8.534 ± 0.08
2.28LeuHis: 2.28 ± 0.043
4.828LeuIle: 4.828 ± 0.069
3.63LeuLys: 3.63 ± 0.062
11.616LeuLeu: 11.616 ± 0.141
2.256LeuMet: 2.256 ± 0.043
2.669LeuAsn: 2.669 ± 0.049
6.126LeuPro: 6.126 ± 0.067
4.139LeuGln: 4.139 ± 0.051
8.785LeuArg: 8.785 ± 0.098
7.065LeuSer: 7.065 ± 0.077
5.095LeuThr: 5.095 ± 0.066
7.427LeuVal: 7.427 ± 0.08
1.263LeuTrp: 1.263 ± 0.037
2.348LeuTyr: 2.348 ± 0.043
0.0LeuXaa: 0.0 ± 0.0
Met
2.35MetAla: 2.35 ± 0.041
0.144MetCys: 0.144 ± 0.01
0.844MetAsp: 0.844 ± 0.025
0.915MetGlu: 0.915 ± 0.025
0.648MetPhe: 0.648 ± 0.019
1.504MetGly: 1.504 ± 0.036
0.431MetHis: 0.431 ± 0.017
1.041MetIle: 1.041 ± 0.03
0.913MetLys: 0.913 ± 0.02
2.407MetLeu: 2.407 ± 0.041
0.468MetMet: 0.468 ± 0.018
0.707MetAsn: 0.707 ± 0.022
1.346MetPro: 1.346 ± 0.035
0.932MetGln: 0.932 ± 0.024
1.686MetArg: 1.686 ± 0.036
1.678MetSer: 1.678 ± 0.033
1.277MetThr: 1.277 ± 0.027
1.254MetVal: 1.254 ± 0.03
0.182MetTrp: 0.182 ± 0.012
0.295MetTyr: 0.295 ± 0.015
0.0MetXaa: 0.0 ± 0.0
Asn
3.26AsnAla: 3.26 ± 0.059
0.277AsnCys: 0.277 ± 0.014
1.398AsnAsp: 1.398 ± 0.034
1.389AsnGlu: 1.389 ± 0.033
1.016AsnPhe: 1.016 ± 0.031
2.507AsnGly: 2.507 ± 0.053
0.5AsnHis: 0.5 ± 0.017
1.141AsnIle: 1.141 ± 0.03
0.755AsnLys: 0.755 ± 0.022
2.905AsnLeu: 2.905 ± 0.053
0.442AsnMet: 0.442 ± 0.02
0.752AsnAsn: 0.752 ± 0.026
1.811AsnPro: 1.811 ± 0.04
0.887AsnGln: 0.887 ± 0.025
1.846AsnArg: 1.846 ± 0.041
1.375AsnSer: 1.375 ± 0.037
1.404AsnThr: 1.404 ± 0.035
1.974AsnVal: 1.974 ± 0.041
0.446AsnTrp: 0.446 ± 0.019
0.762AsnTyr: 0.762 ± 0.027
0.0AsnXaa: 0.0 ± 0.0
Pro
6.577ProAla: 6.577 ± 0.093
0.309ProCys: 0.309 ± 0.015
3.17ProAsp: 3.17 ± 0.046
3.43ProGlu: 3.43 ± 0.045
1.665ProPhe: 1.665 ± 0.038
4.329ProGly: 4.329 ± 0.059
0.944ProHis: 0.944 ± 0.024
2.222ProIle: 2.222 ± 0.041
1.704ProLys: 1.704 ± 0.039
5.176ProLeu: 5.176 ± 0.058
1.302ProMet: 1.302 ± 0.031
1.447ProAsn: 1.447 ± 0.032
2.647ProPro: 2.647 ± 0.068
2.179ProGln: 2.179 ± 0.039
3.103ProArg: 3.103 ± 0.05
2.889ProSer: 2.889 ± 0.047
2.592ProThr: 2.592 ± 0.041
3.882ProVal: 3.882 ± 0.056
0.686ProTrp: 0.686 ± 0.023
1.191ProTyr: 1.191 ± 0.035
0.0ProXaa: 0.0 ± 0.0
Gln
5.105GlnAla: 5.105 ± 0.067
0.251GlnCys: 0.251 ± 0.014
1.541GlnAsp: 1.541 ± 0.036
1.461GlnGlu: 1.461 ± 0.034
1.256GlnPhe: 1.256 ± 0.027
2.712GlnGly: 2.712 ± 0.044
0.843GlnHis: 0.843 ± 0.024
1.99GlnIle: 1.99 ± 0.04
1.236GlnLys: 1.236 ± 0.026
4.395GlnLeu: 4.395 ± 0.064
0.851GlnMet: 0.851 ± 0.027
0.952GlnAsn: 0.952 ± 0.024
2.091GlnPro: 2.091 ± 0.042
1.96GlnGln: 1.96 ± 0.042
3.412GlnArg: 3.412 ± 0.06
2.184GlnSer: 2.184 ± 0.039
1.916GlnThr: 1.916 ± 0.038
2.563GlnVal: 2.563 ± 0.046
0.584GlnTrp: 0.584 ± 0.019
0.906GlnTyr: 0.906 ± 0.026
0.0GlnXaa: 0.0 ± 0.0
Arg
7.473ArgAla: 7.473 ± 0.079
0.667ArgCys: 0.667 ± 0.023
4.087ArgAsp: 4.087 ± 0.058
4.748ArgGlu: 4.748 ± 0.071
3.038ArgPhe: 3.038 ± 0.047
5.026ArgGly: 5.026 ± 0.059
1.782ArgHis: 1.782 ± 0.04
4.004ArgIle: 4.004 ± 0.061
2.404ArgLys: 2.404 ± 0.043
8.784ArgLeu: 8.784 ± 0.092
1.664ArgMet: 1.664 ± 0.033
2.015ArgAsn: 2.015 ± 0.038
3.296ArgPro: 3.296 ± 0.053
2.866ArgGln: 2.866 ± 0.049
5.943ArgArg: 5.943 ± 0.083
4.55ArgSer: 4.55 ± 0.062
3.091ArgThr: 3.091 ± 0.05
5.128ArgVal: 5.128 ± 0.066
1.275ArgTrp: 1.275 ± 0.034
2.471ArgTyr: 2.471 ± 0.041
0.0ArgXaa: 0.0 ± 0.0
Ser
7.338SerAla: 7.338 ± 0.085
0.53SerCys: 0.53 ± 0.021
3.115SerAsp: 3.115 ± 0.044
3.095SerGlu: 3.095 ± 0.044
2.332SerPhe: 2.332 ± 0.041
5.818SerGly: 5.818 ± 0.086
1.035SerHis: 1.035 ± 0.027
2.811SerIle: 2.811 ± 0.047
1.761SerLys: 1.761 ± 0.033
6.583SerLeu: 6.583 ± 0.091
1.335SerMet: 1.335 ± 0.028
1.623SerAsn: 1.623 ± 0.034
3.086SerPro: 3.086 ± 0.049
2.059SerGln: 2.059 ± 0.04
4.168SerArg: 4.168 ± 0.065
3.68SerSer: 3.68 ± 0.058
3.148SerThr: 3.148 ± 0.052
4.329SerVal: 4.329 ± 0.062
0.906SerTrp: 0.906 ± 0.024
1.488SerTyr: 1.488 ± 0.036
0.0SerXaa: 0.0 ± 0.0
Thr
5.832ThrAla: 5.832 ± 0.073
0.337ThrCys: 0.337 ± 0.018
2.483ThrAsp: 2.483 ± 0.044
2.407ThrGlu: 2.407 ± 0.044
1.507ThrPhe: 1.507 ± 0.032
4.361ThrGly: 4.361 ± 0.067
1.049ThrHis: 1.049 ± 0.027
2.253ThrIle: 2.253 ± 0.042
1.183ThrLys: 1.183 ± 0.031
6.155ThrLeu: 6.155 ± 0.073
0.929ThrMet: 0.929 ± 0.024
1.189ThrAsn: 1.189 ± 0.032
3.351ThrPro: 3.351 ± 0.045
2.041ThrGln: 2.041 ± 0.038
3.576ThrArg: 3.576 ± 0.049
2.778ThrSer: 2.778 ± 0.051
2.622ThrThr: 2.622 ± 0.053
3.72ThrVal: 3.72 ± 0.055
0.607ThrTrp: 0.607 ± 0.026
1.151ThrTyr: 1.151 ± 0.034
0.0ThrXaa: 0.0 ± 0.0
Val
8.259ValAla: 8.259 ± 0.081
0.679ValCys: 0.679 ± 0.021
4.294ValAsp: 4.294 ± 0.065
4.391ValGlu: 4.391 ± 0.054
2.509ValPhe: 2.509 ± 0.042
5.581ValGly: 5.581 ± 0.068
1.406ValHis: 1.406 ± 0.032
3.677ValIle: 3.677 ± 0.056
2.174ValLys: 2.174 ± 0.041
7.608ValLeu: 7.608 ± 0.083
1.565ValMet: 1.565 ± 0.03
2.129ValAsn: 2.129 ± 0.042
3.597ValPro: 3.597 ± 0.053
2.468ValGln: 2.468 ± 0.044
4.884ValArg: 4.884 ± 0.061
4.519ValSer: 4.519 ± 0.06
3.748ValThr: 3.748 ± 0.055
5.433ValVal: 5.433 ± 0.074
0.855ValTrp: 0.855 ± 0.028
1.606ValTyr: 1.606 ± 0.033
0.0ValXaa: 0.0 ± 0.0
Trp
1.242TrpAla: 1.242 ± 0.031
0.129TrpCys: 0.129 ± 0.01
0.638TrpAsp: 0.638 ± 0.023
0.553TrpGlu: 0.553 ± 0.023
0.581TrpPhe: 0.581 ± 0.022
0.888TrpGly: 0.888 ± 0.022
0.309TrpHis: 0.309 ± 0.014
0.727TrpIle: 0.727 ± 0.025
0.514TrpLys: 0.514 ± 0.021
1.895TrpLeu: 1.895 ± 0.042
0.355TrpMet: 0.355 ± 0.015
0.545TrpAsn: 0.545 ± 0.023
0.704TrpPro: 0.704 ± 0.022
0.638TrpGln: 0.638 ± 0.023
1.178TrpArg: 1.178 ± 0.029
0.972TrpSer: 0.972 ± 0.028
0.754TrpThr: 0.754 ± 0.024
0.877TrpVal: 0.877 ± 0.028
0.231TrpTrp: 0.231 ± 0.013
0.323TrpTyr: 0.323 ± 0.015
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.833TyrAla: 2.833 ± 0.046
0.227TyrCys: 0.227 ± 0.011
1.583TyrAsp: 1.583 ± 0.038
1.429TyrGlu: 1.429 ± 0.032
0.963TyrPhe: 0.963 ± 0.026
2.345TyrGly: 2.345 ± 0.047
0.451TyrHis: 0.451 ± 0.019
0.821TyrIle: 0.821 ± 0.025
0.712TyrLys: 0.712 ± 0.021
2.491TyrLeu: 2.491 ± 0.048
0.396TyrMet: 0.396 ± 0.016
0.763TyrAsn: 0.763 ± 0.028
1.12TyrPro: 1.12 ± 0.035
0.911TyrGln: 0.911 ± 0.027
2.121TyrArg: 2.121 ± 0.037
1.469TyrSer: 1.469 ± 0.035
1.289TyrThr: 1.289 ± 0.035
1.76TyrVal: 1.76 ± 0.037
0.399TyrTrp: 0.399 ± 0.017
0.739TyrTyr: 0.739 ± 0.027
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4474 proteins (1483553 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski