Amino acid dipepetide frequency for Massariosphaeria phaeospora

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.286AlaAla: 9.286 ± 0.064
1.17AlaCys: 1.17 ± 0.016
4.419AlaAsp: 4.419 ± 0.031
5.133AlaGlu: 5.133 ± 0.039
3.188AlaPhe: 3.188 ± 0.023
5.848AlaGly: 5.848 ± 0.036
2.043AlaHis: 2.043 ± 0.019
4.119AlaIle: 4.119 ± 0.029
4.145AlaLys: 4.145 ± 0.031
8.093AlaLeu: 8.093 ± 0.045
1.985AlaMet: 1.985 ± 0.019
3.032AlaAsn: 3.032 ± 0.024
5.4AlaPro: 5.4 ± 0.043
3.58AlaGln: 3.58 ± 0.027
5.384AlaArg: 5.384 ± 0.031
7.431AlaSer: 7.431 ± 0.045
5.522AlaThr: 5.522 ± 0.036
5.526AlaVal: 5.526 ± 0.033
1.265AlaTrp: 1.265 ± 0.014
2.247AlaTyr: 2.247 ± 0.021
0.0AlaXaa: 0.0 ± 0.0
Cys
1.069CysAla: 1.069 ± 0.014
0.309CysCys: 0.309 ± 0.007
0.665CysAsp: 0.665 ± 0.011
0.616CysGlu: 0.616 ± 0.011
0.555CysPhe: 0.555 ± 0.011
1.033CysGly: 1.033 ± 0.016
0.36CysHis: 0.36 ± 0.01
0.699CysIle: 0.699 ± 0.01
0.565CysLys: 0.565 ± 0.01
1.265CysLeu: 1.265 ± 0.019
0.302CysMet: 0.302 ± 0.008
0.444CysAsn: 0.444 ± 0.009
0.752CysPro: 0.752 ± 0.012
0.445CysGln: 0.445 ± 0.01
0.875CysArg: 0.875 ± 0.015
0.996CysSer: 0.996 ± 0.013
0.791CysThr: 0.791 ± 0.013
0.853CysVal: 0.853 ± 0.016
0.237CysTrp: 0.237 ± 0.006
0.388CysTyr: 0.388 ± 0.009
0.0CysXaa: 0.0 ± 0.0
Asp
5.049AspAla: 5.049 ± 0.033
0.649AspCys: 0.649 ± 0.011
4.187AspAsp: 4.187 ± 0.04
4.33AspGlu: 4.33 ± 0.035
2.174AspPhe: 2.174 ± 0.02
4.011AspGly: 4.011 ± 0.031
1.238AspHis: 1.238 ± 0.016
2.907AspIle: 2.907 ± 0.024
2.356AspLys: 2.356 ± 0.024
4.797AspLeu: 4.797 ± 0.036
1.329AspMet: 1.329 ± 0.014
1.813AspAsn: 1.813 ± 0.022
3.238AspPro: 3.238 ± 0.025
1.791AspGln: 1.791 ± 0.019
3.076AspArg: 3.076 ± 0.026
3.922AspSer: 3.922 ± 0.029
3.066AspThr: 3.066 ± 0.023
3.749AspVal: 3.749 ± 0.026
0.935AspTrp: 0.935 ± 0.012
1.571AspTyr: 1.571 ± 0.016
0.0AspXaa: 0.0 ± 0.0
Glu
5.215GluAla: 5.215 ± 0.039
0.662GluCys: 0.662 ± 0.011
4.202GluAsp: 4.202 ± 0.03
5.078GluGlu: 5.078 ± 0.047
1.887GluPhe: 1.887 ± 0.017
3.821GluGly: 3.821 ± 0.032
1.536GluHis: 1.536 ± 0.017
2.769GluIle: 2.769 ± 0.023
3.483GluLys: 3.483 ± 0.031
5.168GluLeu: 5.168 ± 0.03
1.409GluMet: 1.409 ± 0.017
2.067GluAsn: 2.067 ± 0.02
2.818GluPro: 2.818 ± 0.026
2.505GluGln: 2.505 ± 0.022
3.995GluArg: 3.995 ± 0.032
3.959GluSer: 3.959 ± 0.027
3.332GluThr: 3.332 ± 0.027
3.553GluVal: 3.553 ± 0.026
0.926GluTrp: 0.926 ± 0.013
1.646GluTyr: 1.646 ± 0.019
0.0GluXaa: 0.0 ± 0.0
Phe
3.185PheAla: 3.185 ± 0.024
0.579PheCys: 0.579 ± 0.011
2.226PheAsp: 2.226 ± 0.021
2.157PheGlu: 2.157 ± 0.02
1.541PhePhe: 1.541 ± 0.015
2.79PheGly: 2.79 ± 0.024
0.914PheHis: 0.914 ± 0.013
1.617PheIle: 1.617 ± 0.019
1.505PheLys: 1.505 ± 0.016
3.361PheLeu: 3.361 ± 0.028
0.727PheMet: 0.727 ± 0.011
1.335PheAsn: 1.335 ± 0.014
1.973PhePro: 1.973 ± 0.017
1.375PheGln: 1.375 ± 0.014
2.023PheArg: 2.023 ± 0.02
2.922PheSer: 2.922 ± 0.023
2.14PheThr: 2.14 ± 0.02
2.389PheVal: 2.389 ± 0.022
0.656PheTrp: 0.656 ± 0.01
1.057PheTyr: 1.057 ± 0.014
0.0PheXaa: 0.0 ± 0.0
Gly
5.482GlyAla: 5.482 ± 0.035
0.944GlyCys: 0.944 ± 0.014
3.571GlyAsp: 3.571 ± 0.027
3.685GlyGlu: 3.685 ± 0.026
2.7GlyPhe: 2.7 ± 0.025
5.935GlyGly: 5.935 ± 0.057
1.696GlyHis: 1.696 ± 0.02
3.247GlyIle: 3.247 ± 0.026
3.458GlyLys: 3.458 ± 0.03
5.876GlyLeu: 5.876 ± 0.038
1.619GlyMet: 1.619 ± 0.018
2.442GlyAsn: 2.442 ± 0.025
3.356GlyPro: 3.356 ± 0.028
2.426GlyGln: 2.426 ± 0.025
4.334GlyArg: 4.334 ± 0.032
5.319GlySer: 5.319 ± 0.032
4.008GlyThr: 4.008 ± 0.027
4.419GlyVal: 4.419 ± 0.032
1.19GlyTrp: 1.19 ± 0.015
2.02GlyTyr: 2.02 ± 0.019
0.0GlyXaa: 0.0 ± 0.0
His
2.123HisAla: 2.123 ± 0.02
0.391HisCys: 0.391 ± 0.009
1.402HisAsp: 1.402 ± 0.016
1.371HisGlu: 1.371 ± 0.016
0.975HisPhe: 0.975 ± 0.015
1.752HisGly: 1.752 ± 0.02
0.928HisHis: 0.928 ± 0.015
1.266HisIle: 1.266 ± 0.015
0.969HisLys: 0.969 ± 0.014
2.351HisLeu: 2.351 ± 0.019
0.525HisMet: 0.525 ± 0.011
0.894HisAsn: 0.894 ± 0.012
1.796HisPro: 1.796 ± 0.018
1.049HisGln: 1.049 ± 0.015
1.652HisArg: 1.652 ± 0.017
1.987HisSer: 1.987 ± 0.019
1.522HisThr: 1.522 ± 0.017
1.567HisVal: 1.567 ± 0.015
0.379HisTrp: 0.379 ± 0.008
0.733HisTyr: 0.733 ± 0.01
0.0HisXaa: 0.0 ± 0.0
Ile
4.202IleAla: 4.202 ± 0.031
0.734IleCys: 0.734 ± 0.011
2.654IleAsp: 2.654 ± 0.022
2.764IleGlu: 2.764 ± 0.023
1.798IlePhe: 1.798 ± 0.019
2.932IleGly: 2.932 ± 0.027
1.165IleHis: 1.165 ± 0.013
2.216IleIle: 2.216 ± 0.022
2.052IleLys: 2.052 ± 0.018
4.203IleLeu: 4.203 ± 0.029
0.979IleMet: 0.979 ± 0.014
1.638IleAsn: 1.638 ± 0.017
2.965IlePro: 2.965 ± 0.025
1.741IleGln: 1.741 ± 0.018
2.741IleArg: 2.741 ± 0.026
3.538IleSer: 3.538 ± 0.027
2.792IleThr: 2.792 ± 0.022
3.061IleVal: 3.061 ± 0.022
0.688IleTrp: 0.688 ± 0.012
1.344IleTyr: 1.344 ± 0.016
0.0IleXaa: 0.0 ± 0.0
Lys
4.252LysAla: 4.252 ± 0.032
0.539LysCys: 0.539 ± 0.01
2.782LysAsp: 2.782 ± 0.026
3.285LysGlu: 3.285 ± 0.035
1.438LysPhe: 1.438 ± 0.017
2.958LysGly: 2.958 ± 0.024
1.198LysHis: 1.198 ± 0.015
2.143LysIle: 2.143 ± 0.024
3.29LysLys: 3.29 ± 0.043
4.043LysLeu: 4.043 ± 0.03
1.016LysMet: 1.016 ± 0.013
1.671LysAsn: 1.671 ± 0.018
2.785LysPro: 2.785 ± 0.026
1.886LysGln: 1.886 ± 0.015
3.48LysArg: 3.48 ± 0.028
3.371LysSer: 3.371 ± 0.023
2.854LysThr: 2.854 ± 0.024
2.73LysVal: 2.73 ± 0.023
0.683LysTrp: 0.683 ± 0.013
1.323LysTyr: 1.323 ± 0.017
0.0LysXaa: 0.0 ± 0.0
Leu
8.014LeuAla: 8.014 ± 0.039
1.276LeuCys: 1.276 ± 0.016
5.218LeuAsp: 5.218 ± 0.036
5.408LeuGlu: 5.408 ± 0.038
3.285LeuPhe: 3.285 ± 0.03
5.764LeuGly: 5.764 ± 0.039
2.475LeuHis: 2.475 ± 0.022
3.659LeuIle: 3.659 ± 0.025
4.094LeuLys: 4.094 ± 0.031
8.457LeuLeu: 8.457 ± 0.058
1.691LeuMet: 1.691 ± 0.018
3.027LeuAsn: 3.027 ± 0.022
5.761LeuPro: 5.761 ± 0.035
3.87LeuGln: 3.87 ± 0.029
5.989LeuArg: 5.989 ± 0.038
7.073LeuSer: 7.073 ± 0.039
4.828LeuThr: 4.828 ± 0.03
5.434LeuVal: 5.434 ± 0.032
1.217LeuTrp: 1.217 ± 0.013
2.387LeuTyr: 2.387 ± 0.022
0.0LeuXaa: 0.0 ± 0.0
Met
2.205MetAla: 2.205 ± 0.02
0.263MetCys: 0.263 ± 0.007
1.232MetAsp: 1.232 ± 0.014
1.232MetGlu: 1.232 ± 0.014
0.769MetPhe: 0.769 ± 0.012
1.454MetGly: 1.454 ± 0.016
0.54MetHis: 0.54 ± 0.01
0.888MetIle: 0.888 ± 0.013
0.979MetLys: 0.979 ± 0.013
1.892MetLeu: 1.892 ± 0.019
0.549MetMet: 0.549 ± 0.009
0.791MetAsn: 0.791 ± 0.011
1.32MetPro: 1.32 ± 0.016
0.898MetGln: 0.898 ± 0.013
1.363MetArg: 1.363 ± 0.014
1.825MetSer: 1.825 ± 0.019
1.213MetThr: 1.213 ± 0.014
1.291MetVal: 1.291 ± 0.017
0.28MetTrp: 0.28 ± 0.007
0.557MetTyr: 0.557 ± 0.011
0.0MetXaa: 0.0 ± 0.0
Asn
3.159AsnAla: 3.159 ± 0.027
0.441AsnCys: 0.441 ± 0.008
1.854AsnAsp: 1.854 ± 0.018
1.932AsnGlu: 1.932 ± 0.018
1.321AsnPhe: 1.321 ± 0.015
2.829AsnGly: 2.829 ± 0.026
0.869AsnHis: 0.869 ± 0.013
1.862AsnIle: 1.862 ± 0.021
1.52AsnLys: 1.52 ± 0.017
3.111AsnLeu: 3.111 ± 0.025
0.82AsnMet: 0.82 ± 0.011
1.41AsnAsn: 1.41 ± 0.017
2.444AsnPro: 2.444 ± 0.023
1.283AsnGln: 1.283 ± 0.017
1.937AsnArg: 1.937 ± 0.017
2.601AsnSer: 2.601 ± 0.021
2.262AsnThr: 2.262 ± 0.023
2.222AsnVal: 2.222 ± 0.021
0.556AsnTrp: 0.556 ± 0.01
1.024AsnTyr: 1.024 ± 0.013
0.0AsnXaa: 0.0 ± 0.0
Pro
5.749ProAla: 5.749 ± 0.041
0.594ProCys: 0.594 ± 0.012
3.19ProAsp: 3.19 ± 0.024
3.597ProGlu: 3.597 ± 0.032
2.083ProPhe: 2.083 ± 0.021
3.897ProGly: 3.897 ± 0.029
1.594ProHis: 1.594 ± 0.017
2.568ProIle: 2.568 ± 0.025
2.726ProLys: 2.726 ± 0.025
5.005ProLeu: 5.005 ± 0.029
1.101ProMet: 1.101 ± 0.016
2.291ProAsn: 2.291 ± 0.022
5.911ProPro: 5.911 ± 0.062
2.626ProGln: 2.626 ± 0.032
3.783ProArg: 3.783 ± 0.031
6.238ProSer: 6.238 ± 0.048
4.484ProThr: 4.484 ± 0.03
3.57ProVal: 3.57 ± 0.028
0.785ProTrp: 0.785 ± 0.013
1.558ProTyr: 1.558 ± 0.018
0.0ProXaa: 0.0 ± 0.0
Gln
3.436GlnAla: 3.436 ± 0.026
0.481GlnCys: 0.481 ± 0.009
2.108GlnAsp: 2.108 ± 0.019
2.296GlnGlu: 2.296 ± 0.023
1.311GlnPhe: 1.311 ± 0.015
2.309GlnGly: 2.309 ± 0.023
1.242GlnHis: 1.242 ± 0.018
1.803GlnIle: 1.803 ± 0.022
1.948GlnLys: 1.948 ± 0.02
3.527GlnLeu: 3.527 ± 0.03
0.852GlnMet: 0.852 ± 0.012
1.516GlnAsn: 1.516 ± 0.017
2.659GlnPro: 2.659 ± 0.028
2.394GlnGln: 2.394 ± 0.046
2.758GlnArg: 2.758 ± 0.022
3.056GlnSer: 3.056 ± 0.023
2.378GlnThr: 2.378 ± 0.024
2.09GlnVal: 2.09 ± 0.018
0.593GlnTrp: 0.593 ± 0.01
1.203GlnTyr: 1.203 ± 0.015
0.0GlnXaa: 0.0 ± 0.0
Arg
5.122ArgAla: 5.122 ± 0.035
0.854ArgCys: 0.854 ± 0.015
3.385ArgAsp: 3.385 ± 0.03
3.798ArgGlu: 3.798 ± 0.037
2.166ArgPhe: 2.166 ± 0.021
3.983ArgGly: 3.983 ± 0.033
1.698ArgHis: 1.698 ± 0.018
2.921ArgIle: 2.921 ± 0.023
3.562ArgLys: 3.562 ± 0.028
5.669ArgLeu: 5.669 ± 0.032
1.383ArgMet: 1.383 ± 0.016
2.35ArgAsn: 2.35 ± 0.019
3.865ArgPro: 3.865 ± 0.03
2.644ArgGln: 2.644 ± 0.022
5.396ArgArg: 5.396 ± 0.043
4.894ArgSer: 4.894 ± 0.035
3.65ArgThr: 3.65 ± 0.027
3.552ArgVal: 3.552 ± 0.024
0.992ArgTrp: 0.992 ± 0.011
1.696ArgTyr: 1.696 ± 0.019
0.0ArgXaa: 0.0 ± 0.0
Ser
6.859SerAla: 6.859 ± 0.036
0.957SerCys: 0.957 ± 0.013
3.999SerAsp: 3.999 ± 0.032
3.92SerGlu: 3.92 ± 0.027
2.876SerPhe: 2.876 ± 0.026
5.371SerGly: 5.371 ± 0.034
1.993SerHis: 1.993 ± 0.019
3.772SerIle: 3.772 ± 0.03
3.583SerLys: 3.583 ± 0.028
6.997SerLeu: 6.997 ± 0.038
1.648SerMet: 1.648 ± 0.017
2.826SerAsn: 2.826 ± 0.026
5.67SerPro: 5.67 ± 0.044
3.079SerGln: 3.079 ± 0.025
5.044SerArg: 5.044 ± 0.034
8.389SerSer: 8.389 ± 0.061
5.77SerThr: 5.77 ± 0.037
4.528SerVal: 4.528 ± 0.029
1.095SerTrp: 1.095 ± 0.014
2.011SerTyr: 2.011 ± 0.019
0.0SerXaa: 0.0 ± 0.0
Thr
5.579ThrAla: 5.579 ± 0.033
0.82ThrCys: 0.82 ± 0.014
2.829ThrAsp: 2.829 ± 0.02
3.061ThrGlu: 3.061 ± 0.027
2.284ThrPhe: 2.284 ± 0.019
4.015ThrGly: 4.015 ± 0.031
1.5ThrHis: 1.5 ± 0.017
3.008ThrIle: 3.008 ± 0.025
2.615ThrLys: 2.615 ± 0.024
5.515ThrLeu: 5.515 ± 0.037
1.281ThrMet: 1.281 ± 0.016
2.125ThrAsn: 2.125 ± 0.02
4.781ThrPro: 4.781 ± 0.039
2.251ThrGln: 2.251 ± 0.022
3.403ThrArg: 3.403 ± 0.026
5.349ThrSer: 5.349 ± 0.031
4.497ThrThr: 4.497 ± 0.039
3.735ThrVal: 3.735 ± 0.026
0.863ThrTrp: 0.863 ± 0.012
1.634ThrTyr: 1.634 ± 0.015
0.0ThrXaa: 0.0 ± 0.0
Val
5.415ValAla: 5.415 ± 0.034
0.892ValCys: 0.892 ± 0.012
3.716ValAsp: 3.716 ± 0.027
3.969ValGlu: 3.969 ± 0.03
2.435ValPhe: 2.435 ± 0.022
4.053ValGly: 4.053 ± 0.032
1.474ValHis: 1.474 ± 0.015
2.656ValIle: 2.656 ± 0.019
2.937ValLys: 2.937 ± 0.025
5.64ValLeu: 5.64 ± 0.033
1.295ValMet: 1.295 ± 0.013
2.096ValAsn: 2.096 ± 0.02
3.661ValPro: 3.661 ± 0.032
2.462ValGln: 2.462 ± 0.02
3.684ValArg: 3.684 ± 0.028
4.447ValSer: 4.447 ± 0.027
3.379ValThr: 3.379 ± 0.024
4.422ValVal: 4.422 ± 0.038
0.91ValTrp: 0.91 ± 0.012
1.687ValTyr: 1.687 ± 0.017
0.0ValXaa: 0.0 ± 0.0
Trp
1.169TrpAla: 1.169 ± 0.015
0.229TrpCys: 0.229 ± 0.006
0.905TrpAsp: 0.905 ± 0.013
0.85TrpGlu: 0.85 ± 0.014
0.535TrpPhe: 0.535 ± 0.01
0.98TrpGly: 0.98 ± 0.015
0.395TrpHis: 0.395 ± 0.009
0.759TrpIle: 0.759 ± 0.012
0.825TrpLys: 0.825 ± 0.012
1.381TrpLeu: 1.381 ± 0.017
0.396TrpMet: 0.396 ± 0.007
0.629TrpAsn: 0.629 ± 0.011
0.664TrpPro: 0.664 ± 0.012
0.576TrpGln: 0.576 ± 0.009
1.02TrpArg: 1.02 ± 0.013
1.071TrpSer: 1.071 ± 0.014
0.966TrpThr: 0.966 ± 0.015
0.907TrpVal: 0.907 ± 0.012
0.299TrpTrp: 0.299 ± 0.007
0.433TrpTyr: 0.433 ± 0.009
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.248TyrAla: 2.248 ± 0.02
0.435TyrCys: 0.435 ± 0.009
1.645TyrAsp: 1.645 ± 0.018
1.548TyrGlu: 1.548 ± 0.019
1.154TyrPhe: 1.154 ± 0.014
1.993TyrGly: 1.993 ± 0.023
0.773TyrHis: 0.773 ± 0.013
1.324TyrIle: 1.324 ± 0.016
1.114TyrLys: 1.114 ± 0.015
2.577TyrLeu: 2.577 ± 0.024
0.637TyrMet: 0.637 ± 0.01
1.059TyrAsn: 1.059 ± 0.014
1.532TyrPro: 1.532 ± 0.018
1.079TyrGln: 1.079 ± 0.015
1.64TyrArg: 1.64 ± 0.019
1.999TyrSer: 1.999 ± 0.021
1.689TyrThr: 1.689 ± 0.016
1.631TyrVal: 1.631 ± 0.017
0.439TyrTrp: 0.439 ± 0.009
0.906TyrTyr: 0.906 ± 0.012
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 14012 proteins (6065376 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski