Amino acid dipepetide frequency for Helianthus annuus (Common sunflower)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.364AlaAla: 5.364 ± 0.027
1.176AlaCys: 1.176 ± 0.009
2.883AlaAsp: 2.883 ± 0.016
3.456AlaGlu: 3.456 ± 0.018
2.583AlaPhe: 2.583 ± 0.013
3.695AlaGly: 3.695 ± 0.017
1.282AlaHis: 1.282 ± 0.009
3.583AlaIle: 3.583 ± 0.016
3.68AlaLys: 3.68 ± 0.014
5.866AlaLeu: 5.866 ± 0.02
1.677AlaMet: 1.677 ± 0.012
2.534AlaAsn: 2.534 ± 0.014
2.591AlaPro: 2.591 ± 0.015
1.914AlaGln: 1.914 ± 0.011
3.126AlaArg: 3.126 ± 0.015
5.163AlaSer: 5.163 ± 0.017
3.599AlaThr: 3.599 ± 0.014
4.404AlaVal: 4.404 ± 0.017
0.718AlaTrp: 0.718 ± 0.006
1.855AlaTyr: 1.855 ± 0.01
0.0AlaXaa: 0.0 ± 0.0
Cys
0.919CysAla: 0.919 ± 0.008
0.565CysCys: 0.565 ± 0.011
0.945CysAsp: 0.945 ± 0.009
0.912CysGlu: 0.912 ± 0.008
0.989CysPhe: 0.989 ± 0.008
1.412CysGly: 1.412 ± 0.01
0.473CysHis: 0.473 ± 0.005
1.063CysIle: 1.063 ± 0.009
1.211CysLys: 1.211 ± 0.009
1.923CysLeu: 1.923 ± 0.011
0.581CysMet: 0.581 ± 0.019
0.919CysAsn: 0.919 ± 0.008
0.88CysPro: 0.88 ± 0.011
0.551CysGln: 0.551 ± 0.006
1.036CysArg: 1.036 ± 0.008
1.721CysSer: 1.721 ± 0.011
0.923CysThr: 0.923 ± 0.007
1.245CysVal: 1.245 ± 0.009
0.265CysTrp: 0.265 ± 0.004
0.656CysTyr: 0.656 ± 0.007
0.0CysXaa: 0.0 ± 0.0
Asp
3.222AspAla: 3.222 ± 0.015
0.947AspCys: 0.947 ± 0.009
4.047AspAsp: 4.047 ± 0.039
4.022AspGlu: 4.022 ± 0.019
2.378AspPhe: 2.378 ± 0.013
3.73AspGly: 3.73 ± 0.015
1.354AspHis: 1.354 ± 0.009
3.088AspIle: 3.088 ± 0.014
2.743AspLys: 2.743 ± 0.014
5.142AspLeu: 5.142 ± 0.019
1.37AspMet: 1.37 ± 0.008
2.21AspAsn: 2.21 ± 0.012
2.609AspPro: 2.609 ± 0.013
1.819AspGln: 1.819 ± 0.01
2.344AspArg: 2.344 ± 0.014
3.896AspSer: 3.896 ± 0.017
2.368AspThr: 2.368 ± 0.013
3.906AspVal: 3.906 ± 0.018
0.739AspTrp: 0.739 ± 0.007
1.643AspTyr: 1.643 ± 0.01
0.0AspXaa: 0.0 ± 0.0
Glu
4.143GluAla: 4.143 ± 0.021
0.924GluCys: 0.924 ± 0.008
3.692GluAsp: 3.692 ± 0.021
5.209GluGlu: 5.209 ± 0.034
2.409GluPhe: 2.409 ± 0.011
3.283GluGly: 3.283 ± 0.015
1.25GluHis: 1.25 ± 0.008
3.702GluIle: 3.702 ± 0.014
4.461GluLys: 4.461 ± 0.024
5.716GluLeu: 5.716 ± 0.023
1.737GluMet: 1.737 ± 0.011
3.041GluAsn: 3.041 ± 0.014
2.082GluPro: 2.082 ± 0.013
1.979GluGln: 1.979 ± 0.012
3.019GluArg: 3.019 ± 0.017
4.425GluSer: 4.425 ± 0.021
3.144GluThr: 3.144 ± 0.014
4.048GluVal: 4.048 ± 0.017
0.745GluTrp: 0.745 ± 0.007
1.753GluTyr: 1.753 ± 0.01
0.0GluXaa: 0.0 ± 0.0
Phe
2.401PheAla: 2.401 ± 0.013
0.919PheCys: 0.919 ± 0.007
2.532PheAsp: 2.532 ± 0.012
2.413PheGlu: 2.413 ± 0.013
2.144PhePhe: 2.144 ± 0.016
3.266PheGly: 3.266 ± 0.016
1.189PheHis: 1.189 ± 0.008
2.376PheIle: 2.376 ± 0.012
2.371PheLys: 2.371 ± 0.012
4.35PheLeu: 4.35 ± 0.018
1.101PheMet: 1.101 ± 0.008
1.976PheAsn: 1.976 ± 0.012
1.961PhePro: 1.961 ± 0.013
1.559PheGln: 1.559 ± 0.009
2.087PheArg: 2.087 ± 0.012
4.017PheSer: 4.017 ± 0.017
2.324PheThr: 2.324 ± 0.012
2.983PheVal: 2.983 ± 0.015
0.605PheTrp: 0.605 ± 0.006
1.392PheTyr: 1.392 ± 0.01
0.0PheXaa: 0.0 ± 0.0
Gly
3.468GlyAla: 3.468 ± 0.019
1.382GlyCys: 1.382 ± 0.011
3.39GlyAsp: 3.39 ± 0.016
3.367GlyGlu: 3.367 ± 0.013
3.279GlyPhe: 3.279 ± 0.015
5.474GlyGly: 5.474 ± 0.049
1.474GlyHis: 1.474 ± 0.01
3.496GlyIle: 3.496 ± 0.018
3.856GlyLys: 3.856 ± 0.016
5.837GlyLeu: 5.837 ± 0.022
1.493GlyMet: 1.493 ± 0.01
3.028GlyAsn: 3.028 ± 0.016
2.433GlyPro: 2.433 ± 0.014
1.995GlyGln: 1.995 ± 0.011
3.34GlyArg: 3.34 ± 0.014
5.746GlySer: 5.746 ± 0.021
3.199GlyThr: 3.199 ± 0.014
4.398GlyVal: 4.398 ± 0.017
0.912GlyTrp: 0.912 ± 0.008
2.28GlyTyr: 2.28 ± 0.012
0.0GlyXaa: 0.0 ± 0.0
His
1.337HisAla: 1.337 ± 0.01
0.505HisCys: 0.505 ± 0.005
1.263HisAsp: 1.263 ± 0.009
1.425HisGlu: 1.425 ± 0.009
1.123HisPhe: 1.123 ± 0.009
1.692HisGly: 1.692 ± 0.01
1.083HisHis: 1.083 ± 0.012
1.352HisIle: 1.352 ± 0.009
1.334HisLys: 1.334 ± 0.01
2.603HisLeu: 2.603 ± 0.016
0.635HisMet: 0.635 ± 0.006
1.115HisAsn: 1.115 ± 0.009
1.441HisPro: 1.441 ± 0.011
1.054HisGln: 1.054 ± 0.008
1.445HisArg: 1.445 ± 0.01
1.799HisSer: 1.799 ± 0.011
1.218HisThr: 1.218 ± 0.008
1.655HisVal: 1.655 ± 0.011
0.307HisTrp: 0.307 ± 0.004
0.742HisTyr: 0.742 ± 0.007
0.0HisXaa: 0.0 ± 0.0
Ile
3.434IleAla: 3.434 ± 0.016
1.201IleCys: 1.201 ± 0.01
3.044IleAsp: 3.044 ± 0.014
3.163IleGlu: 3.163 ± 0.015
2.439IlePhe: 2.439 ± 0.013
3.52IleGly: 3.52 ± 0.017
1.502IleHis: 1.502 ± 0.009
3.194IleIle: 3.194 ± 0.016
3.253IleLys: 3.253 ± 0.016
5.391IleLeu: 5.391 ± 0.02
1.307IleMet: 1.307 ± 0.01
2.472IleAsn: 2.472 ± 0.015
3.075IlePro: 3.075 ± 0.019
2.035IleGln: 2.035 ± 0.011
2.779IleArg: 2.779 ± 0.011
4.832IleSer: 4.832 ± 0.018
3.018IleThr: 3.018 ± 0.013
3.635IleVal: 3.635 ± 0.015
0.769IleTrp: 0.769 ± 0.007
1.727IleTyr: 1.727 ± 0.011
0.0IleXaa: 0.0 ± 0.0
Lys
3.809LysAla: 3.809 ± 0.019
1.075LysCys: 1.075 ± 0.009
3.309LysAsp: 3.309 ± 0.018
4.381LysGlu: 4.381 ± 0.021
2.255LysPhe: 2.255 ± 0.012
3.61LysGly: 3.61 ± 0.015
1.534LysHis: 1.534 ± 0.01
3.479LysIle: 3.479 ± 0.014
5.036LysLys: 5.036 ± 0.025
6.063LysLeu: 6.063 ± 0.021
1.635LysMet: 1.635 ± 0.011
2.889LysAsn: 2.889 ± 0.012
2.844LysPro: 2.844 ± 0.016
2.309LysGln: 2.309 ± 0.013
3.628LysArg: 3.628 ± 0.015
4.773LysSer: 4.773 ± 0.02
3.311LysThr: 3.311 ± 0.016
4.031LysVal: 4.031 ± 0.016
0.87LysTrp: 0.87 ± 0.008
1.735LysTyr: 1.735 ± 0.012
0.0LysXaa: 0.0 ± 0.0
Leu
5.95LeuAla: 5.95 ± 0.021
1.86LeuCys: 1.86 ± 0.01
5.054LeuAsp: 5.054 ± 0.02
5.971LeuGlu: 5.971 ± 0.027
4.104LeuPhe: 4.104 ± 0.016
5.541LeuGly: 5.541 ± 0.021
2.728LeuHis: 2.728 ± 0.015
5.001LeuIle: 5.001 ± 0.018
6.462LeuLys: 6.462 ± 0.021
9.723LeuLeu: 9.723 ± 0.035
2.241LeuMet: 2.241 ± 0.012
4.239LeuAsn: 4.239 ± 0.015
4.905LeuPro: 4.905 ± 0.018
4.027LeuGln: 4.027 ± 0.02
4.966LeuArg: 4.966 ± 0.02
8.351LeuSer: 8.351 ± 0.03
5.075LeuThr: 5.075 ± 0.019
6.511LeuVal: 6.511 ± 0.02
1.196LeuTrp: 1.196 ± 0.009
2.734LeuTyr: 2.734 ± 0.014
0.0LeuXaa: 0.0 ± 0.0
Met
1.991MetAla: 1.991 ± 0.012
0.405MetCys: 0.405 ± 0.005
1.451MetAsp: 1.451 ± 0.009
1.82MetGlu: 1.82 ± 0.011
1.027MetPhe: 1.027 ± 0.007
1.587MetGly: 1.587 ± 0.01
0.577MetHis: 0.577 ± 0.005
1.485MetIle: 1.485 ± 0.009
1.821MetLys: 1.821 ± 0.01
2.371MetLeu: 2.371 ± 0.011
0.856MetMet: 0.856 ± 0.021
1.215MetAsn: 1.215 ± 0.009
1.004MetPro: 1.004 ± 0.008
0.886MetGln: 0.886 ± 0.007
1.15MetArg: 1.15 ± 0.01
1.897MetSer: 1.897 ± 0.01
1.212MetThr: 1.212 ± 0.009
1.832MetVal: 1.832 ± 0.011
0.296MetTrp: 0.296 ± 0.004
0.797MetTyr: 0.797 ± 0.02
0.0MetXaa: 0.0 ± 0.0
Asn
2.531AsnAla: 2.531 ± 0.015
0.844AsnCys: 0.844 ± 0.007
2.364AsnAsp: 2.364 ± 0.012
2.672AsnGlu: 2.672 ± 0.015
1.978AsnPhe: 1.978 ± 0.012
3.354AsnGly: 3.354 ± 0.014
1.3AsnHis: 1.3 ± 0.009
2.742AsnIle: 2.742 ± 0.015
2.739AsnLys: 2.739 ± 0.014
4.929AsnLeu: 4.929 ± 0.024
1.286AsnMet: 1.286 ± 0.009
2.874AsnAsn: 2.874 ± 0.022
2.476AsnPro: 2.476 ± 0.013
1.826AsnGln: 1.826 ± 0.012
2.233AsnArg: 2.233 ± 0.013
3.718AsnSer: 3.718 ± 0.015
2.295AsnThr: 2.295 ± 0.012
3.001AsnVal: 3.001 ± 0.014
0.605AsnTrp: 0.605 ± 0.007
1.397AsnTyr: 1.397 ± 0.009
0.0AsnXaa: 0.0 ± 0.0
Pro
2.693ProAla: 2.693 ± 0.014
0.729ProCys: 0.729 ± 0.008
2.432ProAsp: 2.432 ± 0.014
3.01ProGlu: 3.01 ± 0.015
2.052ProPhe: 2.052 ± 0.012
2.459ProGly: 2.459 ± 0.013
1.157ProHis: 1.157 ± 0.008
2.423ProIle: 2.423 ± 0.013
2.83ProLys: 2.83 ± 0.015
4.162ProLeu: 4.162 ± 0.017
1.01ProMet: 1.01 ± 0.008
2.489ProAsn: 2.489 ± 0.012
3.848ProPro: 3.848 ± 0.045
1.736ProGln: 1.736 ± 0.012
2.168ProArg: 2.168 ± 0.013
4.525ProSer: 4.525 ± 0.021
2.882ProThr: 2.882 ± 0.015
3.281ProVal: 3.281 ± 0.016
0.594ProTrp: 0.594 ± 0.006
1.369ProTyr: 1.369 ± 0.01
0.0ProXaa: 0.0 ± 0.0
Gln
2.108GlnAla: 2.108 ± 0.011
0.554GlnCys: 0.554 ± 0.006
1.652GlnAsp: 1.652 ± 0.011
2.26GlnGlu: 2.26 ± 0.014
1.37GlnPhe: 1.37 ± 0.008
1.957GlnGly: 1.957 ± 0.011
0.963GlnHis: 0.963 ± 0.009
2.041GlnIle: 2.041 ± 0.011
2.286GlnLys: 2.286 ± 0.015
3.502GlnLeu: 3.502 ± 0.017
0.98GlnMet: 0.98 ± 0.008
1.764GlnAsn: 1.764 ± 0.013
1.765GlnPro: 1.765 ± 0.014
1.975GlnGln: 1.975 ± 0.022
1.968GlnArg: 1.968 ± 0.011
2.696GlnSer: 2.696 ± 0.015
1.931GlnThr: 1.931 ± 0.012
2.326GlnVal: 2.326 ± 0.012
0.463GlnTrp: 0.463 ± 0.005
0.9GlnTyr: 0.9 ± 0.007
0.0GlnXaa: 0.0 ± 0.0
Arg
2.888ArgAla: 2.888 ± 0.014
1.019ArgCys: 1.019 ± 0.01
2.476ArgAsp: 2.476 ± 0.013
2.941ArgGlu: 2.941 ± 0.016
2.442ArgPhe: 2.442 ± 0.012
2.954ArgGly: 2.954 ± 0.015
1.269ArgHis: 1.269 ± 0.009
2.87ArgIle: 2.87 ± 0.014
3.583ArgLys: 3.583 ± 0.017
5.057ArgLeu: 5.057 ± 0.021
1.369ArgMet: 1.369 ± 0.009
2.424ArgAsn: 2.424 ± 0.012
2.142ArgPro: 2.142 ± 0.013
1.657ArgGln: 1.657 ± 0.01
3.591ArgArg: 3.591 ± 0.018
4.215ArgSer: 4.215 ± 0.019
2.45ArgThr: 2.45 ± 0.012
3.45ArgVal: 3.45 ± 0.015
0.777ArgTrp: 0.777 ± 0.007
1.58ArgTyr: 1.58 ± 0.01
0.0ArgXaa: 0.0 ± 0.0
Ser
4.462SerAla: 4.462 ± 0.016
1.694SerCys: 1.694 ± 0.011
4.297SerAsp: 4.297 ± 0.017
4.266SerGlu: 4.266 ± 0.018
4.097SerPhe: 4.097 ± 0.016
5.713SerGly: 5.713 ± 0.02
1.982SerHis: 1.982 ± 0.011
4.579SerIle: 4.579 ± 0.019
4.856SerLys: 4.856 ± 0.02
8.383SerLeu: 8.383 ± 0.029
2.115SerMet: 2.115 ± 0.012
4.128SerAsn: 4.128 ± 0.018
4.148SerPro: 4.148 ± 0.023
2.814SerGln: 2.814 ± 0.013
4.226SerArg: 4.226 ± 0.018
9.695SerSer: 9.695 ± 0.036
4.783SerThr: 4.783 ± 0.02
5.251SerVal: 5.251 ± 0.019
1.182SerTrp: 1.182 ± 0.008
2.462SerTyr: 2.462 ± 0.011
0.0SerXaa: 0.0 ± 0.0
Thr
3.186ThrAla: 3.186 ± 0.016
1.06ThrCys: 1.06 ± 0.009
2.465ThrAsp: 2.465 ± 0.011
2.803ThrGlu: 2.803 ± 0.015
2.298ThrPhe: 2.298 ± 0.01
3.482ThrGly: 3.482 ± 0.015
1.323ThrHis: 1.323 ± 0.009
3.141ThrIle: 3.141 ± 0.014
3.01ThrLys: 3.01 ± 0.015
5.067ThrLeu: 5.067 ± 0.019
1.335ThrMet: 1.335 ± 0.009
2.526ThrAsn: 2.526 ± 0.013
2.815ThrPro: 2.815 ± 0.017
1.703ThrGln: 1.703 ± 0.01
2.638ThrArg: 2.638 ± 0.011
4.829ThrSer: 4.829 ± 0.018
3.695ThrThr: 3.695 ± 0.018
3.495ThrVal: 3.495 ± 0.015
0.716ThrTrp: 0.716 ± 0.007
1.604ThrTyr: 1.604 ± 0.011
0.0ThrXaa: 0.0 ± 0.0
Val
4.628ValAla: 4.628 ± 0.017
1.382ValCys: 1.382 ± 0.019
3.925ValAsp: 3.925 ± 0.017
4.267ValGlu: 4.267 ± 0.017
3.003ValPhe: 3.003 ± 0.015
4.159ValGly: 4.159 ± 0.018
1.561ValHis: 1.561 ± 0.009
3.708ValIle: 3.708 ± 0.017
4.295ValLys: 4.295 ± 0.016
6.29ValLeu: 6.29 ± 0.021
1.693ValMet: 1.693 ± 0.009
3.06ValAsn: 3.06 ± 0.014
3.029ValPro: 3.029 ± 0.014
2.226ValGln: 2.226 ± 0.012
3.058ValArg: 3.058 ± 0.013
5.495ValSer: 5.495 ± 0.021
3.598ValThr: 3.598 ± 0.016
5.494ValVal: 5.494 ± 0.062
0.897ValTrp: 0.897 ± 0.008
2.205ValTyr: 2.205 ± 0.011
0.0ValXaa: 0.0 ± 0.0
Trp
0.77TrpAla: 0.77 ± 0.007
0.279TrpCys: 0.279 ± 0.004
0.692TrpAsp: 0.692 ± 0.007
0.736TrpGlu: 0.736 ± 0.007
0.628TrpPhe: 0.628 ± 0.005
0.766TrpGly: 0.766 ± 0.007
0.3TrpHis: 0.3 ± 0.004
0.767TrpIle: 0.767 ± 0.007
0.966TrpLys: 0.966 ± 0.007
1.295TrpLeu: 1.295 ± 0.009
0.392TrpMet: 0.392 ± 0.004
0.721TrpAsn: 0.721 ± 0.006
0.474TrpPro: 0.474 ± 0.005
0.425TrpGln: 0.425 ± 0.005
0.855TrpArg: 0.855 ± 0.007
1.04TrpSer: 1.04 ± 0.008
0.635TrpThr: 0.635 ± 0.006
0.942TrpVal: 0.942 ± 0.008
0.291TrpTrp: 0.291 ± 0.007
0.404TrpTyr: 0.404 ± 0.005
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.845TyrAla: 1.845 ± 0.01
0.67TyrCys: 0.67 ± 0.007
1.664TyrAsp: 1.664 ± 0.011
1.663TyrGlu: 1.663 ± 0.012
1.366TyrPhe: 1.366 ± 0.009
2.193TyrGly: 2.193 ± 0.014
0.826TyrHis: 0.826 ± 0.007
1.701TyrIle: 1.701 ± 0.01
1.783TyrLys: 1.783 ± 0.012
2.968TyrLeu: 2.968 ± 0.014
0.865TyrMet: 0.865 ± 0.007
1.541TyrAsn: 1.541 ± 0.011
1.329TyrPro: 1.329 ± 0.009
0.973TyrGln: 0.973 ± 0.007
1.498TyrArg: 1.498 ± 0.01
2.299TyrSer: 2.299 ± 0.011
1.531TyrThr: 1.531 ± 0.011
2.089TyrVal: 2.089 ± 0.021
0.432TyrTrp: 0.432 ± 0.005
1.088TyrTyr: 1.088 ± 0.009
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.072XaaXaa: 0.072 ± 0.024
Statistics based on 51240 proteins (17848194 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski