Amino acid dipepetide frequency for Marivita hallyeonensis

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
14.494AlaAla: 14.494 ± 0.163
1.155AlaCys: 1.155 ± 0.031
6.912AlaAsp: 6.912 ± 0.087
7.783AlaGlu: 7.783 ± 0.104
4.463AlaPhe: 4.463 ± 0.068
9.119AlaGly: 9.119 ± 0.099
2.351AlaHis: 2.351 ± 0.053
5.959AlaIle: 5.959 ± 0.073
3.955AlaLys: 3.955 ± 0.066
13.257AlaLeu: 13.257 ± 0.139
3.707AlaMet: 3.707 ± 0.058
2.863AlaAsn: 2.863 ± 0.051
5.319AlaPro: 5.319 ± 0.073
4.437AlaGln: 4.437 ± 0.062
7.884AlaArg: 7.884 ± 0.09
5.736AlaSer: 5.736 ± 0.078
6.044AlaThr: 6.044 ± 0.076
8.241AlaVal: 8.241 ± 0.081
1.416AlaTrp: 1.416 ± 0.031
2.639AlaTyr: 2.639 ± 0.049
0.0AlaXaa: 0.0 ± 0.0
Cys
1.153CysAla: 1.153 ± 0.031
0.116CysCys: 0.116 ± 0.011
0.627CysAsp: 0.627 ± 0.024
0.44CysGlu: 0.44 ± 0.019
0.352CysPhe: 0.352 ± 0.018
0.962CysGly: 0.962 ± 0.033
0.294CysHis: 0.294 ± 0.017
0.429CysIle: 0.429 ± 0.018
0.237CysLys: 0.237 ± 0.013
0.892CysLeu: 0.892 ± 0.028
0.161CysMet: 0.161 ± 0.012
0.224CysAsn: 0.224 ± 0.013
0.506CysPro: 0.506 ± 0.022
0.24CysGln: 0.24 ± 0.015
0.488CysArg: 0.488 ± 0.02
0.448CysSer: 0.448 ± 0.021
0.483CysThr: 0.483 ± 0.021
0.693CysVal: 0.693 ± 0.024
0.106CysTrp: 0.106 ± 0.008
0.215CysTyr: 0.215 ± 0.015
0.0CysXaa: 0.0 ± 0.0
Asp
7.946AspAla: 7.946 ± 0.096
0.513AspCys: 0.513 ± 0.021
3.771AspAsp: 3.771 ± 0.085
3.692AspGlu: 3.692 ± 0.06
2.42AspPhe: 2.42 ± 0.046
5.956AspGly: 5.956 ± 0.089
1.468AspHis: 1.468 ± 0.034
3.281AspIle: 3.281 ± 0.052
1.691AspLys: 1.691 ± 0.041
6.534AspLeu: 6.534 ± 0.072
1.796AspMet: 1.796 ± 0.035
1.372AspAsn: 1.372 ± 0.036
3.761AspPro: 3.761 ± 0.052
2.021AspGln: 2.021 ± 0.04
4.337AspArg: 4.337 ± 0.064
2.191AspSer: 2.191 ± 0.041
3.56AspThr: 3.56 ± 0.063
4.921AspVal: 4.921 ± 0.067
1.227AspTrp: 1.227 ± 0.028
1.524AspTyr: 1.524 ± 0.037
0.0AspXaa: 0.0 ± 0.0
Glu
7.792GluAla: 7.792 ± 0.093
0.377GluCys: 0.377 ± 0.017
3.931GluAsp: 3.931 ± 0.061
3.595GluGlu: 3.595 ± 0.07
1.903GluPhe: 1.903 ± 0.042
4.524GluGly: 4.524 ± 0.06
1.168GluHis: 1.168 ± 0.032
3.618GluIle: 3.618 ± 0.057
2.154GluLys: 2.154 ± 0.05
5.028GluLeu: 5.028 ± 0.065
1.927GluMet: 1.927 ± 0.038
1.865GluAsn: 1.865 ± 0.039
2.558GluPro: 2.558 ± 0.05
1.893GluGln: 1.893 ± 0.042
4.071GluArg: 4.071 ± 0.063
2.149GluSer: 2.149 ± 0.042
4.229GluThr: 4.229 ± 0.058
4.31GluVal: 4.31 ± 0.056
0.696GluTrp: 0.696 ± 0.024
1.108GluTyr: 1.108 ± 0.03
0.0GluXaa: 0.0 ± 0.0
Phe
4.578PheAla: 4.578 ± 0.066
0.454PheCys: 0.454 ± 0.018
3.163PheAsp: 3.163 ± 0.057
2.454PheGlu: 2.454 ± 0.051
1.6PhePhe: 1.6 ± 0.042
3.886PheGly: 3.886 ± 0.055
0.771PheHis: 0.771 ± 0.028
1.797PheIle: 1.797 ± 0.042
0.987PheLys: 0.987 ± 0.029
3.625PheLeu: 3.625 ± 0.064
0.886PheMet: 0.886 ± 0.027
1.102PheAsn: 1.102 ± 0.031
1.625PhePro: 1.625 ± 0.036
1.142PheGln: 1.142 ± 0.031
2.138PheArg: 2.138 ± 0.046
2.137PheSer: 2.137 ± 0.044
2.169PheThr: 2.169 ± 0.045
2.876PheVal: 2.876 ± 0.048
0.624PheTrp: 0.624 ± 0.025
0.936PheTyr: 0.936 ± 0.031
0.0PheXaa: 0.0 ± 0.0
Gly
9.076GlyAla: 9.076 ± 0.093
0.853GlyCys: 0.853 ± 0.029
4.84GlyAsp: 4.84 ± 0.064
4.395GlyGlu: 4.395 ± 0.062
3.818GlyPhe: 3.818 ± 0.059
6.921GlyGly: 6.921 ± 0.087
1.868GlyHis: 1.868 ± 0.043
4.585GlyIle: 4.585 ± 0.066
3.021GlyLys: 3.021 ± 0.055
8.826GlyLeu: 8.826 ± 0.096
2.5GlyMet: 2.5 ± 0.049
2.061GlyAsn: 2.061 ± 0.043
3.607GlyPro: 3.607 ± 0.054
3.141GlyGln: 3.141 ± 0.054
5.093GlyArg: 5.093 ± 0.065
4.307GlySer: 4.307 ± 0.06
4.944GlyThr: 4.944 ± 0.067
6.515GlyVal: 6.515 ± 0.069
1.425GlyTrp: 1.425 ± 0.031
2.259GlyTyr: 2.259 ± 0.045
0.0GlyXaa: 0.0 ± 0.0
His
2.335HisAla: 2.335 ± 0.047
0.239HisCys: 0.239 ± 0.014
1.278HisAsp: 1.278 ± 0.038
1.08HisGlu: 1.08 ± 0.032
0.836HisPhe: 0.836 ± 0.023
1.882HisGly: 1.882 ± 0.042
0.576HisHis: 0.576 ± 0.022
1.064HisIle: 1.064 ± 0.026
0.568HisLys: 0.568 ± 0.024
2.172HisLeu: 2.172 ± 0.043
0.567HisMet: 0.567 ± 0.021
0.472HisAsn: 0.472 ± 0.019
1.382HisPro: 1.382 ± 0.039
0.614HisGln: 0.614 ± 0.026
1.333HisArg: 1.333 ± 0.036
0.948HisSer: 0.948 ± 0.024
0.909HisThr: 0.909 ± 0.026
1.675HisVal: 1.675 ± 0.033
0.347HisTrp: 0.347 ± 0.018
0.537HisTyr: 0.537 ± 0.021
0.0HisXaa: 0.0 ± 0.0
Ile
7.086IleAla: 7.086 ± 0.089
0.576IleCys: 0.576 ± 0.018
3.777IleAsp: 3.777 ± 0.057
3.582IleGlu: 3.582 ± 0.054
1.823IlePhe: 1.823 ± 0.035
5.017IleGly: 5.017 ± 0.071
0.913IleHis: 0.913 ± 0.03
2.322IleIle: 2.322 ± 0.051
1.418IleLys: 1.418 ± 0.037
4.938IleLeu: 4.938 ± 0.08
1.089IleMet: 1.089 ± 0.032
1.464IleAsn: 1.464 ± 0.036
2.42IlePro: 2.42 ± 0.048
1.259IleGln: 1.259 ± 0.034
3.001IleArg: 3.001 ± 0.055
3.067IleSer: 3.067 ± 0.055
3.026IleThr: 3.026 ± 0.051
3.994IleVal: 3.994 ± 0.064
0.76IleTrp: 0.76 ± 0.024
1.164IleTyr: 1.164 ± 0.031
0.0IleXaa: 0.0 ± 0.0
Lys
4.007LysAla: 4.007 ± 0.067
0.189LysCys: 0.189 ± 0.013
1.983LysAsp: 1.983 ± 0.047
1.652LysGlu: 1.652 ± 0.042
0.911LysPhe: 0.911 ± 0.026
2.699LysGly: 2.699 ± 0.05
0.69LysHis: 0.69 ± 0.023
1.622LysIle: 1.622 ± 0.042
1.223LysLys: 1.223 ± 0.041
2.992LysLeu: 2.992 ± 0.049
0.912LysMet: 0.912 ± 0.028
0.873LysAsn: 0.873 ± 0.027
1.904LysPro: 1.904 ± 0.041
0.904LysGln: 0.904 ± 0.03
2.346LysArg: 2.346 ± 0.048
2.007LysSer: 2.007 ± 0.043
2.154LysThr: 2.154 ± 0.043
2.268LysVal: 2.268 ± 0.046
0.392LysTrp: 0.392 ± 0.016
0.655LysTyr: 0.655 ± 0.025
0.0LysXaa: 0.0 ± 0.0
Leu
11.838LeuAla: 11.838 ± 0.127
0.993LeuCys: 0.993 ± 0.032
6.1LeuAsp: 6.1 ± 0.079
5.323LeuGlu: 5.323 ± 0.073
3.726LeuPhe: 3.726 ± 0.059
8.176LeuGly: 8.176 ± 0.106
1.856LeuHis: 1.856 ± 0.042
5.373LeuIle: 5.373 ± 0.084
3.302LeuLys: 3.302 ± 0.049
8.678LeuLeu: 8.678 ± 0.117
2.678LeuMet: 2.678 ± 0.055
2.816LeuAsn: 2.816 ± 0.054
5.458LeuPro: 5.458 ± 0.075
2.811LeuGln: 2.811 ± 0.055
6.731LeuArg: 6.731 ± 0.078
6.952LeuSer: 6.952 ± 0.084
6.156LeuThr: 6.156 ± 0.078
6.788LeuVal: 6.788 ± 0.073
1.343LeuTrp: 1.343 ± 0.039
1.954LeuTyr: 1.954 ± 0.041
0.0LeuXaa: 0.0 ± 0.0
Met
3.359MetAla: 3.359 ± 0.047
0.21MetCys: 0.21 ± 0.014
1.452MetAsp: 1.452 ± 0.033
1.325MetGlu: 1.325 ± 0.034
0.86MetPhe: 0.86 ± 0.027
2.254MetGly: 2.254 ± 0.044
0.487MetHis: 0.487 ± 0.019
1.55MetIle: 1.55 ± 0.039
1.056MetLys: 1.056 ± 0.029
2.559MetLeu: 2.559 ± 0.047
0.782MetMet: 0.782 ± 0.029
0.872MetAsn: 0.872 ± 0.026
1.556MetPro: 1.556 ± 0.037
1.015MetGln: 1.015 ± 0.025
1.945MetArg: 1.945 ± 0.045
1.872MetSer: 1.872 ± 0.045
2.132MetThr: 2.132 ± 0.04
1.919MetVal: 1.919 ± 0.042
0.287MetTrp: 0.287 ± 0.015
0.377MetTyr: 0.377 ± 0.019
0.0MetXaa: 0.0 ± 0.0
Asn
3.354AsnAla: 3.354 ± 0.057
0.278AsnCys: 0.278 ± 0.014
1.58AsnAsp: 1.58 ± 0.037
1.326AsnGlu: 1.326 ± 0.032
0.97AsnPhe: 0.97 ± 0.028
2.504AsnGly: 2.504 ± 0.044
0.524AsnHis: 0.524 ± 0.021
1.436AsnIle: 1.436 ± 0.032
0.697AsnLys: 0.697 ± 0.024
2.527AsnLeu: 2.527 ± 0.045
0.73AsnMet: 0.73 ± 0.025
0.727AsnAsn: 0.727 ± 0.024
1.915AsnPro: 1.915 ± 0.04
0.764AsnGln: 0.764 ± 0.027
1.748AsnArg: 1.748 ± 0.04
1.234AsnSer: 1.234 ± 0.035
1.459AsnThr: 1.459 ± 0.032
2.033AsnVal: 2.033 ± 0.038
0.447AsnTrp: 0.447 ± 0.017
0.66AsnTyr: 0.66 ± 0.025
0.0AsnXaa: 0.0 ± 0.0
Pro
4.968ProAla: 4.968 ± 0.063
0.355ProCys: 0.355 ± 0.019
4.219ProAsp: 4.219 ± 0.057
4.02ProGlu: 4.02 ± 0.052
2.045ProPhe: 2.045 ± 0.042
4.108ProGly: 4.108 ± 0.063
1.137ProHis: 1.137 ± 0.033
2.385ProIle: 2.385 ± 0.043
1.979ProLys: 1.979 ± 0.048
4.446ProLeu: 4.446 ± 0.069
1.338ProMet: 1.338 ± 0.035
1.489ProAsn: 1.489 ± 0.035
2.169ProPro: 2.169 ± 0.048
1.626ProGln: 1.626 ± 0.041
2.57ProArg: 2.57 ± 0.049
2.72ProSer: 2.72 ± 0.05
2.714ProThr: 2.714 ± 0.045
4.222ProVal: 4.222 ± 0.057
0.709ProTrp: 0.709 ± 0.025
1.135ProTyr: 1.135 ± 0.034
0.0ProXaa: 0.0 ± 0.0
Gln
3.847GlnAla: 3.847 ± 0.054
0.216GlnCys: 0.216 ± 0.013
1.912GlnAsp: 1.912 ± 0.037
1.602GlnGlu: 1.602 ± 0.031
1.148GlnPhe: 1.148 ± 0.03
2.467GlnGly: 2.467 ± 0.046
0.629GlnHis: 0.629 ± 0.021
2.023GlnIle: 2.023 ± 0.041
1.152GlnLys: 1.152 ± 0.034
2.831GlnLeu: 2.831 ± 0.047
1.062GlnMet: 1.062 ± 0.032
1.03GlnAsn: 1.03 ± 0.03
1.585GlnPro: 1.585 ± 0.037
1.057GlnGln: 1.057 ± 0.037
2.065GlnArg: 2.065 ± 0.04
1.963GlnSer: 1.963 ± 0.039
1.992GlnThr: 1.992 ± 0.048
2.355GlnVal: 2.355 ± 0.044
0.424GlnTrp: 0.424 ± 0.02
0.603GlnTyr: 0.603 ± 0.022
0.0GlnXaa: 0.0 ± 0.0
Arg
7.287ArgAla: 7.287 ± 0.087
0.49ArgCys: 0.49 ± 0.019
4.297ArgAsp: 4.297 ± 0.065
3.597ArgGlu: 3.597 ± 0.06
2.743ArgPhe: 2.743 ± 0.045
4.311ArgGly: 4.311 ± 0.057
1.458ArgHis: 1.458 ± 0.037
3.646ArgIle: 3.646 ± 0.056
2.316ArgLys: 2.316 ± 0.042
6.78ArgLeu: 6.78 ± 0.086
1.897ArgMet: 1.897 ± 0.039
1.772ArgAsn: 1.772 ± 0.038
3.017ArgPro: 3.017 ± 0.05
2.176ArgGln: 2.176 ± 0.043
4.461ArgArg: 4.461 ± 0.07
3.21ArgSer: 3.21 ± 0.057
3.107ArgThr: 3.107 ± 0.053
4.683ArgVal: 4.683 ± 0.073
0.939ArgTrp: 0.939 ± 0.031
1.51ArgTyr: 1.51 ± 0.037
0.0ArgXaa: 0.0 ± 0.0
Ser
5.882SerAla: 5.882 ± 0.073
0.428SerCys: 0.428 ± 0.019
3.716SerAsp: 3.716 ± 0.055
3.225SerGlu: 3.225 ± 0.055
2.326SerPhe: 2.326 ± 0.045
5.332SerGly: 5.332 ± 0.073
1.104SerHis: 1.104 ± 0.032
2.666SerIle: 2.666 ± 0.049
1.685SerLys: 1.685 ± 0.04
5.084SerLeu: 5.084 ± 0.07
1.408SerMet: 1.408 ± 0.033
1.515SerAsn: 1.515 ± 0.036
2.501SerPro: 2.501 ± 0.043
1.636SerGln: 1.636 ± 0.034
3.137SerArg: 3.137 ± 0.054
2.636SerSer: 2.636 ± 0.054
2.672SerThr: 2.672 ± 0.049
4.108SerVal: 4.108 ± 0.059
0.716SerTrp: 0.716 ± 0.024
1.23SerTyr: 1.23 ± 0.033
0.0SerXaa: 0.0 ± 0.0
Thr
6.236ThrAla: 6.236 ± 0.078
0.544ThrCys: 0.544 ± 0.022
3.567ThrAsp: 3.567 ± 0.053
3.349ThrGlu: 3.349 ± 0.059
2.256ThrPhe: 2.256 ± 0.042
5.359ThrGly: 5.359 ± 0.067
1.238ThrHis: 1.238 ± 0.033
2.894ThrIle: 2.894 ± 0.058
1.672ThrLys: 1.672 ± 0.036
6.303ThrLeu: 6.303 ± 0.073
1.334ThrMet: 1.334 ± 0.034
1.432ThrAsn: 1.432 ± 0.035
3.57ThrPro: 3.57 ± 0.056
1.743ThrGln: 1.743 ± 0.038
3.556ThrArg: 3.556 ± 0.056
2.938ThrSer: 2.938 ± 0.055
3.026ThrThr: 3.026 ± 0.047
4.638ThrVal: 4.638 ± 0.066
0.73ThrTrp: 0.73 ± 0.026
1.32ThrTyr: 1.32 ± 0.029
0.0ThrXaa: 0.0 ± 0.0
Val
8.624ValAla: 8.624 ± 0.087
0.687ValCys: 0.687 ± 0.024
4.396ValAsp: 4.396 ± 0.064
4.443ValGlu: 4.443 ± 0.069
3.15ValPhe: 3.15 ± 0.051
5.499ValGly: 5.499 ± 0.072
1.415ValHis: 1.415 ± 0.031
4.379ValIle: 4.379 ± 0.059
2.187ValLys: 2.187 ± 0.045
7.645ValLeu: 7.645 ± 0.087
2.187ValMet: 2.187 ± 0.043
1.984ValAsn: 1.984 ± 0.037
3.754ValPro: 3.754 ± 0.054
2.225ValGln: 2.225 ± 0.043
4.249ValArg: 4.249 ± 0.062
4.516ValSer: 4.516 ± 0.066
4.897ValThr: 4.897 ± 0.071
5.636ValVal: 5.636 ± 0.077
1.058ValTrp: 1.058 ± 0.03
1.503ValTyr: 1.503 ± 0.033
0.0ValXaa: 0.0 ± 0.0
Trp
1.355TrpAla: 1.355 ± 0.034
0.149TrpCys: 0.149 ± 0.01
0.867TrpAsp: 0.867 ± 0.024
0.712TrpGlu: 0.712 ± 0.025
0.616TrpPhe: 0.616 ± 0.023
1.006TrpGly: 1.006 ± 0.03
0.327TrpHis: 0.327 ± 0.018
0.718TrpIle: 0.718 ± 0.027
0.455TrpLys: 0.455 ± 0.017
1.669TrpLeu: 1.669 ± 0.047
0.416TrpMet: 0.416 ± 0.018
0.425TrpAsn: 0.425 ± 0.019
0.701TrpPro: 0.701 ± 0.023
0.565TrpGln: 0.565 ± 0.022
1.061TrpArg: 1.061 ± 0.029
0.833TrpSer: 0.833 ± 0.024
0.852TrpThr: 0.852 ± 0.028
0.992TrpVal: 0.992 ± 0.029
0.23TrpTrp: 0.23 ± 0.015
0.291TrpTyr: 0.291 ± 0.014
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.517TyrAla: 2.517 ± 0.041
0.244TyrCys: 0.244 ± 0.014
1.595TyrAsp: 1.595 ± 0.035
1.297TyrGlu: 1.297 ± 0.036
0.983TyrPhe: 0.983 ± 0.032
2.071TyrGly: 2.071 ± 0.042
0.537TyrHis: 0.537 ± 0.02
0.951TyrIle: 0.951 ± 0.027
0.625TyrLys: 0.625 ± 0.024
2.266TyrLeu: 2.266 ± 0.04
0.49TyrMet: 0.49 ± 0.02
0.56TyrAsn: 0.56 ± 0.022
1.048TyrPro: 1.048 ± 0.028
0.695TyrGln: 0.695 ± 0.021
1.483TyrArg: 1.483 ± 0.042
1.142TyrSer: 1.142 ± 0.034
1.181TyrThr: 1.181 ± 0.033
1.569TyrVal: 1.569 ± 0.032
0.366TyrTrp: 0.366 ± 0.02
0.568TyrTyr: 0.568 ± 0.021
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4131 proteins (1271774 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski