Amino acid dipepetide frequency for Prunus yedoensis var. nudiflora

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.369AlaAla: 6.369 ± 0.033
1.34AlaCys: 1.34 ± 0.013
3.103AlaAsp: 3.103 ± 0.018
4.249AlaGlu: 4.249 ± 0.022
2.798AlaPhe: 2.798 ± 0.016
4.147AlaGly: 4.147 ± 0.021
1.385AlaHis: 1.385 ± 0.011
3.726AlaIle: 3.726 ± 0.017
3.911AlaLys: 3.911 ± 0.02
6.834AlaLeu: 6.834 ± 0.027
1.849AlaMet: 1.849 ± 0.012
2.621AlaAsn: 2.621 ± 0.015
2.918AlaPro: 2.918 ± 0.021
2.286AlaGln: 2.286 ± 0.015
3.389AlaArg: 3.389 ± 0.015
6.112AlaSer: 6.112 ± 0.026
3.668AlaThr: 3.668 ± 0.017
4.765AlaVal: 4.765 ± 0.021
0.834AlaTrp: 0.834 ± 0.009
1.845AlaTyr: 1.845 ± 0.013
0.0AlaXaa: 0.0 ± 0.0
Cys
1.021CysAla: 1.021 ± 0.009
0.547CysCys: 0.547 ± 0.006
0.845CysAsp: 0.845 ± 0.008
0.876CysGlu: 0.876 ± 0.009
0.897CysPhe: 0.897 ± 0.009
1.446CysGly: 1.446 ± 0.012
0.487CysHis: 0.487 ± 0.007
1.003CysIle: 1.003 ± 0.01
1.176CysLys: 1.176 ± 0.012
1.928CysLeu: 1.928 ± 0.014
0.488CysMet: 0.488 ± 0.006
0.878CysAsn: 0.878 ± 0.009
0.956CysPro: 0.956 ± 0.009
0.639CysGln: 0.639 ± 0.007
1.079CysArg: 1.079 ± 0.01
1.869CysSer: 1.869 ± 0.012
0.914CysThr: 0.914 ± 0.009
1.068CysVal: 1.068 ± 0.01
0.273CysTrp: 0.273 ± 0.005
0.537CysTyr: 0.537 ± 0.007
0.0CysXaa: 0.0 ± 0.0
Asp
3.386AspAla: 3.386 ± 0.019
0.939AspCys: 0.939 ± 0.009
3.462AspAsp: 3.462 ± 0.019
3.861AspGlu: 3.861 ± 0.02
2.35AspPhe: 2.35 ± 0.014
3.757AspGly: 3.757 ± 0.018
1.302AspHis: 1.302 ± 0.011
2.884AspIle: 2.884 ± 0.015
2.621AspLys: 2.621 ± 0.017
5.083AspLeu: 5.083 ± 0.024
1.359AspMet: 1.359 ± 0.011
2.02AspAsn: 2.02 ± 0.014
2.622AspPro: 2.622 ± 0.016
1.882AspGln: 1.882 ± 0.012
2.333AspArg: 2.333 ± 0.014
4.071AspSer: 4.071 ± 0.023
2.169AspThr: 2.169 ± 0.014
3.594AspVal: 3.594 ± 0.016
0.735AspTrp: 0.735 ± 0.008
1.517AspTyr: 1.517 ± 0.013
0.0AspXaa: 0.0 ± 0.0
Glu
4.965GluAla: 4.965 ± 0.023
0.897GluCys: 0.897 ± 0.009
3.904GluAsp: 3.904 ± 0.021
6.055GluGlu: 6.055 ± 0.03
2.388GluPhe: 2.388 ± 0.014
3.9GluGly: 3.9 ± 0.018
1.262GluHis: 1.262 ± 0.01
3.629GluIle: 3.629 ± 0.02
4.586GluLys: 4.586 ± 0.022
5.943GluLeu: 5.943 ± 0.026
1.731GluMet: 1.731 ± 0.011
3.015GluAsn: 3.015 ± 0.019
2.208GluPro: 2.208 ± 0.014
2.149GluGln: 2.149 ± 0.014
3.316GluArg: 3.316 ± 0.02
4.409GluSer: 4.409 ± 0.02
3.01GluThr: 3.01 ± 0.018
4.193GluVal: 4.193 ± 0.02
0.745GluTrp: 0.745 ± 0.007
1.59GluTyr: 1.59 ± 0.011
0.0GluXaa: 0.0 ± 0.0
Phe
2.492PheAla: 2.492 ± 0.017
0.924PheCys: 0.924 ± 0.01
2.335PheAsp: 2.335 ± 0.014
2.298PheGlu: 2.298 ± 0.013
1.91PhePhe: 1.91 ± 0.014
3.188PheGly: 3.188 ± 0.018
1.1PheHis: 1.1 ± 0.01
2.015PheIle: 2.015 ± 0.014
2.155PheLys: 2.155 ± 0.013
4.328PheLeu: 4.328 ± 0.018
1.009PheMet: 1.009 ± 0.009
1.781PheAsn: 1.781 ± 0.013
2.046PhePro: 2.046 ± 0.013
1.613PheGln: 1.613 ± 0.012
2.039PheArg: 2.039 ± 0.013
4.029PheSer: 4.029 ± 0.019
2.0PheThr: 2.0 ± 0.013
2.768PheVal: 2.768 ± 0.015
0.616PheTrp: 0.616 ± 0.008
1.256PheTyr: 1.256 ± 0.011
0.0PheXaa: 0.0 ± 0.0
Gly
3.959GlyAla: 3.959 ± 0.022
1.359GlyCys: 1.359 ± 0.012
3.4GlyAsp: 3.4 ± 0.015
3.621GlyGlu: 3.621 ± 0.017
3.324GlyPhe: 3.324 ± 0.015
5.378GlyGly: 5.378 ± 0.039
1.602GlyHis: 1.602 ± 0.012
3.587GlyIle: 3.587 ± 0.018
4.084GlyLys: 4.084 ± 0.018
6.248GlyLeu: 6.248 ± 0.027
1.581GlyMet: 1.581 ± 0.013
3.158GlyAsn: 3.158 ± 0.019
2.597GlyPro: 2.597 ± 0.015
2.183GlyGln: 2.183 ± 0.014
3.693GlyArg: 3.693 ± 0.02
6.103GlySer: 6.103 ± 0.027
3.299GlyThr: 3.299 ± 0.018
4.3GlyVal: 4.3 ± 0.022
0.957GlyTrp: 0.957 ± 0.008
2.051GlyTyr: 2.051 ± 0.015
0.0GlyXaa: 0.0 ± 0.0
His
1.45HisAla: 1.45 ± 0.011
0.555HisCys: 0.555 ± 0.006
1.17HisAsp: 1.17 ± 0.01
1.319HisGlu: 1.319 ± 0.011
1.051HisPhe: 1.051 ± 0.009
1.815HisGly: 1.815 ± 0.014
1.026HisHis: 1.026 ± 0.013
1.217HisIle: 1.217 ± 0.01
1.218HisLys: 1.218 ± 0.01
2.507HisLeu: 2.507 ± 0.014
0.578HisMet: 0.578 ± 0.007
1.013HisAsn: 1.013 ± 0.01
1.401HisPro: 1.401 ± 0.012
1.093HisGln: 1.093 ± 0.01
1.376HisArg: 1.376 ± 0.01
1.921HisSer: 1.921 ± 0.012
1.003HisThr: 1.003 ± 0.009
1.581HisVal: 1.581 ± 0.011
0.319HisTrp: 0.319 ± 0.005
0.698HisTyr: 0.698 ± 0.008
0.0HisXaa: 0.0 ± 0.0
Ile
3.524IleAla: 3.524 ± 0.019
1.084IleCys: 1.084 ± 0.009
2.763IleAsp: 2.763 ± 0.015
3.105IleGlu: 3.105 ± 0.017
2.193IlePhe: 2.193 ± 0.013
3.408IleGly: 3.408 ± 0.016
1.283IleHis: 1.283 ± 0.011
2.645IleIle: 2.645 ± 0.017
2.837IleLys: 2.837 ± 0.015
5.082IleLeu: 5.082 ± 0.022
1.147IleMet: 1.147 ± 0.009
2.111IleAsn: 2.111 ± 0.013
3.025IlePro: 3.025 ± 0.022
1.949IleGln: 1.949 ± 0.012
2.576IleArg: 2.576 ± 0.015
4.803IleSer: 4.803 ± 0.018
2.557IleThr: 2.557 ± 0.016
3.356IleVal: 3.356 ± 0.016
0.719IleTrp: 0.719 ± 0.007
1.446IleTyr: 1.446 ± 0.011
0.0IleXaa: 0.0 ± 0.0
Lys
4.139LysAla: 4.139 ± 0.019
0.998LysCys: 0.998 ± 0.01
3.176LysAsp: 3.176 ± 0.018
4.521LysGlu: 4.521 ± 0.024
2.173LysPhe: 2.173 ± 0.014
3.698LysGly: 3.698 ± 0.017
1.361LysHis: 1.361 ± 0.011
3.09LysIle: 3.09 ± 0.015
4.567LysLys: 4.567 ± 0.028
6.072LysLeu: 6.072 ± 0.024
1.487LysMet: 1.487 ± 0.01
2.622LysAsn: 2.622 ± 0.014
2.829LysPro: 2.829 ± 0.017
2.307LysGln: 2.307 ± 0.013
3.623LysArg: 3.623 ± 0.016
4.586LysSer: 4.586 ± 0.023
2.856LysThr: 2.856 ± 0.016
3.839LysVal: 3.839 ± 0.018
0.8LysTrp: 0.8 ± 0.009
1.609LysTyr: 1.609 ± 0.012
0.0LysXaa: 0.0 ± 0.0
Leu
6.614LeuAla: 6.614 ± 0.028
1.935LeuCys: 1.935 ± 0.014
5.15LeuAsp: 5.15 ± 0.023
6.348LeuGlu: 6.348 ± 0.029
3.831LeuPhe: 3.831 ± 0.019
6.149LeuGly: 6.149 ± 0.025
2.683LeuHis: 2.683 ± 0.014
4.607LeuIle: 4.607 ± 0.023
6.167LeuLys: 6.167 ± 0.025
10.07LeuLeu: 10.07 ± 0.037
2.222LeuMet: 2.222 ± 0.013
4.009LeuAsn: 4.009 ± 0.018
5.327LeuPro: 5.327 ± 0.025
4.395LeuGln: 4.395 ± 0.022
5.417LeuArg: 5.417 ± 0.022
8.762LeuSer: 8.762 ± 0.04
4.539LeuThr: 4.539 ± 0.022
6.557LeuVal: 6.557 ± 0.023
1.195LeuTrp: 1.195 ± 0.01
2.488LeuTyr: 2.488 ± 0.017
0.0LeuXaa: 0.0 ± 0.0
Met
2.224MetAla: 2.224 ± 0.014
0.343MetCys: 0.343 ± 0.005
1.414MetAsp: 1.414 ± 0.011
2.009MetGlu: 2.009 ± 0.012
0.817MetPhe: 0.817 ± 0.009
1.686MetGly: 1.686 ± 0.01
0.534MetHis: 0.534 ± 0.007
1.154MetIle: 1.154 ± 0.01
1.639MetLys: 1.639 ± 0.012
2.248MetLeu: 2.248 ± 0.014
0.688MetMet: 0.688 ± 0.008
1.022MetAsn: 1.022 ± 0.008
1.126MetPro: 1.126 ± 0.01
0.941MetGln: 0.941 ± 0.009
1.275MetArg: 1.275 ± 0.01
1.825MetSer: 1.825 ± 0.011
1.066MetThr: 1.066 ± 0.01
1.733MetVal: 1.733 ± 0.011
0.268MetTrp: 0.268 ± 0.005
0.578MetTyr: 0.578 ± 0.008
0.0MetXaa: 0.0 ± 0.0
Asn
2.704AsnAla: 2.704 ± 0.015
0.858AsnCys: 0.858 ± 0.008
2.016AsnAsp: 2.016 ± 0.012
2.465AsnGlu: 2.465 ± 0.015
1.977AsnPhe: 1.977 ± 0.014
3.311AsnGly: 3.311 ± 0.02
1.129AsnHis: 1.129 ± 0.01
2.403AsnIle: 2.403 ± 0.016
2.416AsnLys: 2.416 ± 0.015
4.78AsnLeu: 4.78 ± 0.028
1.115AsnMet: 1.115 ± 0.011
2.399AsnAsn: 2.399 ± 0.018
2.47AsnPro: 2.47 ± 0.015
1.753AsnGln: 1.753 ± 0.014
1.974AsnArg: 1.974 ± 0.013
3.965AsnSer: 3.965 ± 0.023
1.957AsnThr: 1.957 ± 0.013
2.747AsnVal: 2.747 ± 0.016
0.602AsnTrp: 0.602 ± 0.007
1.325AsnTyr: 1.325 ± 0.011
0.0AsnXaa: 0.0 ± 0.0
Pro
3.109ProAla: 3.109 ± 0.019
0.809ProCys: 0.809 ± 0.009
2.498ProAsp: 2.498 ± 0.015
3.131ProGlu: 3.131 ± 0.016
1.983ProPhe: 1.983 ± 0.014
2.687ProGly: 2.687 ± 0.016
1.189ProHis: 1.189 ± 0.01
2.402ProIle: 2.402 ± 0.015
2.888ProLys: 2.888 ± 0.014
4.422ProLeu: 4.422 ± 0.021
1.002ProMet: 1.002 ± 0.009
2.49ProAsn: 2.49 ± 0.014
3.841ProPro: 3.841 ± 0.04
1.927ProGln: 1.927 ± 0.016
2.398ProArg: 2.398 ± 0.015
5.259ProSer: 5.259 ± 0.025
2.771ProThr: 2.771 ± 0.015
3.01ProVal: 3.01 ± 0.02
0.637ProTrp: 0.637 ± 0.006
1.342ProTyr: 1.342 ± 0.012
0.0ProXaa: 0.0 ± 0.0
Gln
2.516GlnAla: 2.516 ± 0.015
0.606GlnCys: 0.606 ± 0.007
1.7GlnAsp: 1.7 ± 0.013
2.408GlnGlu: 2.408 ± 0.015
1.431GlnPhe: 1.431 ± 0.01
2.264GlnGly: 2.264 ± 0.012
0.957GlnHis: 0.957 ± 0.01
2.016GlnIle: 2.016 ± 0.014
2.333GlnLys: 2.333 ± 0.015
3.823GlnLeu: 3.823 ± 0.02
1.005GlnMet: 1.005 ± 0.01
1.835GlnAsn: 1.835 ± 0.015
1.849GlnPro: 1.849 ± 0.015
2.074GlnGln: 2.074 ± 0.023
2.143GlnArg: 2.143 ± 0.013
2.97GlnSer: 2.97 ± 0.017
1.84GlnThr: 1.84 ± 0.012
2.446GlnVal: 2.446 ± 0.013
0.493GlnTrp: 0.493 ± 0.006
0.932GlnTyr: 0.932 ± 0.008
0.0GlnXaa: 0.0 ± 0.0
Arg
3.321ArgAla: 3.321 ± 0.017
0.98ArgCys: 0.98 ± 0.009
2.569ArgAsp: 2.569 ± 0.015
3.265ArgGlu: 3.265 ± 0.02
2.195ArgPhe: 2.195 ± 0.014
3.216ArgGly: 3.216 ± 0.018
1.276ArgHis: 1.276 ± 0.011
2.755ArgIle: 2.755 ± 0.015
3.757ArgLys: 3.757 ± 0.021
5.073ArgLeu: 5.073 ± 0.02
1.358ArgMet: 1.358 ± 0.011
2.433ArgAsn: 2.433 ± 0.013
2.423ArgPro: 2.423 ± 0.017
1.844ArgGln: 1.844 ± 0.012
3.92ArgArg: 3.92 ± 0.024
4.276ArgSer: 4.276 ± 0.025
2.461ArgThr: 2.461 ± 0.016
3.378ArgVal: 3.378 ± 0.018
0.738ArgTrp: 0.738 ± 0.007
1.398ArgTyr: 1.398 ± 0.011
0.0ArgXaa: 0.0 ± 0.0
Ser
5.459SerAla: 5.459 ± 0.022
1.742SerCys: 1.742 ± 0.015
4.315SerAsp: 4.315 ± 0.019
4.793SerGlu: 4.793 ± 0.022
3.959SerPhe: 3.959 ± 0.017
5.95SerGly: 5.95 ± 0.024
2.06SerHis: 2.06 ± 0.014
4.434SerIle: 4.434 ± 0.019
4.912SerLys: 4.912 ± 0.022
8.788SerLeu: 8.788 ± 0.035
2.136SerMet: 2.136 ± 0.016
4.145SerAsn: 4.145 ± 0.022
4.534SerPro: 4.534 ± 0.028
3.123SerGln: 3.123 ± 0.017
4.347SerArg: 4.347 ± 0.023
11.03SerSer: 11.03 ± 0.044
4.801SerThr: 4.801 ± 0.021
5.232SerVal: 5.232 ± 0.022
1.245SerTrp: 1.245 ± 0.011
2.304SerTyr: 2.304 ± 0.015
0.0SerXaa: 0.0 ± 0.0
Thr
3.356ThrAla: 3.356 ± 0.017
0.956ThrCys: 0.956 ± 0.01
2.229ThrAsp: 2.229 ± 0.012
2.792ThrGlu: 2.792 ± 0.019
2.057ThrPhe: 2.057 ± 0.013
3.242ThrGly: 3.242 ± 0.019
1.105ThrHis: 1.105 ± 0.01
2.695ThrIle: 2.695 ± 0.017
2.752ThrLys: 2.752 ± 0.015
4.752ThrLeu: 4.752 ± 0.02
1.207ThrMet: 1.207 ± 0.01
2.151ThrAsn: 2.151 ± 0.013
2.591ThrPro: 2.591 ± 0.014
1.705ThrGln: 1.705 ± 0.013
2.378ThrArg: 2.378 ± 0.014
4.681ThrSer: 4.681 ± 0.023
3.093ThrThr: 3.093 ± 0.017
3.296ThrVal: 3.296 ± 0.017
0.699ThrTrp: 0.699 ± 0.008
1.412ThrTyr: 1.412 ± 0.012
0.0ThrXaa: 0.0 ± 0.0
Val
4.896ValAla: 4.896 ± 0.019
1.2ValCys: 1.2 ± 0.011
3.709ValAsp: 3.709 ± 0.018
4.434ValGlu: 4.434 ± 0.019
2.737ValPhe: 2.737 ± 0.016
4.316ValGly: 4.316 ± 0.02
1.558ValHis: 1.558 ± 0.012
3.267ValIle: 3.267 ± 0.014
3.845ValLys: 3.845 ± 0.02
6.406ValLeu: 6.406 ± 0.023
1.549ValMet: 1.549 ± 0.011
2.579ValAsn: 2.579 ± 0.014
3.285ValPro: 3.285 ± 0.019
2.376ValGln: 2.376 ± 0.013
3.109ValArg: 3.109 ± 0.016
5.394ValSer: 5.394 ± 0.02
3.163ValThr: 3.163 ± 0.016
5.062ValVal: 5.062 ± 0.026
0.809ValTrp: 0.809 ± 0.008
1.903ValTyr: 1.903 ± 0.013
0.0ValXaa: 0.0 ± 0.0
Trp
0.819TrpAla: 0.819 ± 0.009
0.249TrpCys: 0.249 ± 0.004
0.72TrpAsp: 0.72 ± 0.008
0.783TrpGlu: 0.783 ± 0.008
0.58TrpPhe: 0.58 ± 0.007
0.803TrpGly: 0.803 ± 0.008
0.3TrpHis: 0.3 ± 0.005
0.68TrpIle: 0.68 ± 0.007
0.967TrpLys: 0.967 ± 0.009
1.32TrpLeu: 1.32 ± 0.011
0.345TrpMet: 0.345 ± 0.005
0.737TrpAsn: 0.737 ± 0.009
0.528TrpPro: 0.528 ± 0.007
0.451TrpGln: 0.451 ± 0.007
0.863TrpArg: 0.863 ± 0.008
1.015TrpSer: 1.015 ± 0.009
0.688TrpThr: 0.688 ± 0.007
0.91TrpVal: 0.91 ± 0.009
0.258TrpTrp: 0.258 ± 0.005
0.335TrpTyr: 0.335 ± 0.005
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.825TyrAla: 1.825 ± 0.014
0.607TyrCys: 0.607 ± 0.006
1.468TyrAsp: 1.468 ± 0.012
1.56TyrGlu: 1.56 ± 0.012
1.249TyrPhe: 1.249 ± 0.011
2.125TyrGly: 2.125 ± 0.016
0.714TyrHis: 0.714 ± 0.007
1.4TyrIle: 1.4 ± 0.011
1.521TyrLys: 1.521 ± 0.011
2.76TyrLeu: 2.76 ± 0.016
0.737TyrMet: 0.737 ± 0.009
1.345TyrAsn: 1.345 ± 0.012
1.221TyrPro: 1.221 ± 0.01
0.95TyrGln: 0.95 ± 0.01
1.386TyrArg: 1.386 ± 0.01
2.23TyrSer: 2.23 ± 0.015
1.294TyrThr: 1.294 ± 0.01
1.761TyrVal: 1.761 ± 0.013
0.412TyrTrp: 0.412 ± 0.006
0.948TyrTyr: 0.948 ± 0.013
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 40375 proteins (12929275 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski