Amino acid dipepetide frequency for Bicyclus anynana (Squinting bush brown butterfly)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.511AlaAla: 7.511 ± 0.068
1.376AlaCys: 1.376 ± 0.028
3.563AlaAsp: 3.563 ± 0.025
4.48AlaGlu: 4.48 ± 0.032
2.24AlaPhe: 2.24 ± 0.018
4.259AlaGly: 4.259 ± 0.03
1.87AlaHis: 1.87 ± 0.014
3.192AlaIle: 3.192 ± 0.02
3.746AlaLys: 3.746 ± 0.024
6.837AlaLeu: 6.837 ± 0.044
1.496AlaMet: 1.496 ± 0.014
2.652AlaAsn: 2.652 ± 0.016
4.417AlaPro: 4.417 ± 0.038
2.639AlaGln: 2.639 ± 0.022
4.344AlaArg: 4.344 ± 0.034
5.111AlaSer: 5.111 ± 0.03
3.828AlaThr: 3.828 ± 0.023
4.737AlaVal: 4.737 ± 0.027
0.748AlaTrp: 0.748 ± 0.01
1.858AlaTyr: 1.858 ± 0.016
0.004AlaXaa: 0.004 ± 0.001
Cys
1.444CysAla: 1.444 ± 0.02
0.511CysCys: 0.511 ± 0.008
1.273CysAsp: 1.273 ± 0.019
1.289CysGlu: 1.289 ± 0.019
0.71CysPhe: 0.71 ± 0.011
1.402CysGly: 1.402 ± 0.032
0.479CysHis: 0.479 ± 0.009
0.997CysIle: 0.997 ± 0.026
1.114CysLys: 1.114 ± 0.02
1.704CysLeu: 1.704 ± 0.023
0.375CysMet: 0.375 ± 0.007
0.973CysAsn: 0.973 ± 0.017
0.989CysPro: 0.989 ± 0.027
0.68CysGln: 0.68 ± 0.015
1.16CysArg: 1.16 ± 0.03
1.527CysSer: 1.527 ± 0.029
1.085CysThr: 1.085 ± 0.023
1.422CysVal: 1.422 ± 0.023
0.216CysTrp: 0.216 ± 0.004
0.592CysTyr: 0.592 ± 0.009
0.001CysXaa: 0.001 ± 0.0
Asp
3.646AspAla: 3.646 ± 0.025
1.002AspCys: 1.002 ± 0.018
4.147AspAsp: 4.147 ± 0.038
4.358AspGlu: 4.358 ± 0.026
2.101AspPhe: 2.101 ± 0.017
3.215AspGly: 3.215 ± 0.025
1.171AspHis: 1.171 ± 0.012
3.529AspIle: 3.529 ± 0.023
3.697AspLys: 3.697 ± 0.029
4.746AspLeu: 4.746 ± 0.028
1.271AspMet: 1.271 ± 0.011
2.863AspAsn: 2.863 ± 0.023
2.545AspPro: 2.545 ± 0.027
1.627AspGln: 1.627 ± 0.013
2.752AspArg: 2.752 ± 0.026
4.345AspSer: 4.345 ± 0.031
3.203AspThr: 3.203 ± 0.023
3.796AspVal: 3.796 ± 0.021
0.629AspTrp: 0.629 ± 0.011
1.877AspTyr: 1.877 ± 0.015
0.001AspXaa: 0.001 ± 0.0
Glu
4.47GluAla: 4.47 ± 0.032
1.251GluCys: 1.251 ± 0.034
4.063GluAsp: 4.063 ± 0.026
5.964GluGlu: 5.964 ± 0.056
2.062GluPhe: 2.062 ± 0.017
3.213GluGly: 3.213 ± 0.029
1.524GluHis: 1.524 ± 0.012
3.727GluIle: 3.727 ± 0.026
5.109GluLys: 5.109 ± 0.044
5.935GluLeu: 5.935 ± 0.035
1.53GluMet: 1.53 ± 0.013
3.565GluAsn: 3.565 ± 0.027
3.169GluPro: 3.169 ± 0.038
2.693GluGln: 2.693 ± 0.027
4.142GluArg: 4.142 ± 0.031
4.497GluSer: 4.497 ± 0.028
3.666GluThr: 3.666 ± 0.027
4.029GluVal: 4.029 ± 0.029
0.692GluTrp: 0.692 ± 0.008
2.015GluTyr: 2.015 ± 0.017
0.001GluXaa: 0.001 ± 0.0
Phe
2.18PheAla: 2.18 ± 0.017
0.753PheCys: 0.753 ± 0.009
2.055PheAsp: 2.055 ± 0.016
2.009PheGlu: 2.009 ± 0.017
1.32PhePhe: 1.32 ± 0.014
2.226PheGly: 2.226 ± 0.02
0.882PheHis: 0.882 ± 0.009
2.014PheIle: 2.014 ± 0.018
2.084PheLys: 2.084 ± 0.017
3.127PheLeu: 3.127 ± 0.022
0.803PheMet: 0.803 ± 0.01
1.754PheAsn: 1.754 ± 0.013
1.521PhePro: 1.521 ± 0.014
1.277PheGln: 1.277 ± 0.011
1.888PheArg: 1.888 ± 0.02
2.675PheSer: 2.675 ± 0.018
2.146PheThr: 2.146 ± 0.018
2.378PheVal: 2.378 ± 0.018
0.419PheTrp: 0.419 ± 0.007
1.267PheTyr: 1.267 ± 0.012
0.001PheXaa: 0.001 ± 0.0
Gly
4.34GlyAla: 4.34 ± 0.033
1.015GlyCys: 1.015 ± 0.014
3.065GlyAsp: 3.065 ± 0.024
3.485GlyGlu: 3.485 ± 0.03
2.069GlyPhe: 2.069 ± 0.018
4.527GlyGly: 4.527 ± 0.069
1.357GlyHis: 1.357 ± 0.014
2.687GlyIle: 2.687 ± 0.022
3.205GlyLys: 3.205 ± 0.027
4.468GlyLeu: 4.468 ± 0.023
1.078GlyMet: 1.078 ± 0.012
2.412GlyAsn: 2.412 ± 0.018
2.551GlyPro: 2.551 ± 0.042
2.049GlyGln: 2.049 ± 0.033
3.203GlyArg: 3.203 ± 0.023
4.421GlySer: 4.421 ± 0.039
2.968GlyThr: 2.968 ± 0.022
3.686GlyVal: 3.686 ± 0.025
0.706GlyTrp: 0.706 ± 0.009
2.011GlyTyr: 2.011 ± 0.022
0.002GlyXaa: 0.002 ± 0.001
His
1.7HisAla: 1.7 ± 0.017
0.576HisCys: 0.576 ± 0.01
1.195HisAsp: 1.195 ± 0.011
1.39HisGlu: 1.39 ± 0.012
0.955HisPhe: 0.955 ± 0.011
1.332HisGly: 1.332 ± 0.014
0.947HisHis: 0.947 ± 0.017
1.374HisIle: 1.374 ± 0.013
1.467HisLys: 1.467 ± 0.014
2.284HisLeu: 2.284 ± 0.017
0.747HisMet: 0.747 ± 0.012
1.187HisAsn: 1.187 ± 0.013
1.352HisPro: 1.352 ± 0.011
1.015HisGln: 1.015 ± 0.012
1.453HisArg: 1.453 ± 0.013
1.928HisSer: 1.928 ± 0.014
1.577HisThr: 1.577 ± 0.018
1.551HisVal: 1.551 ± 0.012
0.295HisTrp: 0.295 ± 0.005
0.888HisTyr: 0.888 ± 0.009
0.001HisXaa: 0.001 ± 0.0
Ile
3.332IleAla: 3.332 ± 0.021
1.171IleCys: 1.171 ± 0.022
3.157IleAsp: 3.157 ± 0.025
3.51IleGlu: 3.51 ± 0.026
1.989IlePhe: 1.989 ± 0.018
2.561IleGly: 2.561 ± 0.018
1.271IleHis: 1.271 ± 0.013
3.105IleIle: 3.105 ± 0.021
3.752IleLys: 3.752 ± 0.029
4.647IleLeu: 4.647 ± 0.033
1.112IleMet: 1.112 ± 0.013
2.806IleAsn: 2.806 ± 0.025
2.665IlePro: 2.665 ± 0.02
2.065IleGln: 2.065 ± 0.015
2.506IleArg: 2.506 ± 0.017
3.937IleSer: 3.937 ± 0.024
3.3IleThr: 3.3 ± 0.023
3.417IleVal: 3.417 ± 0.02
0.499IleTrp: 0.499 ± 0.008
1.62IleTyr: 1.62 ± 0.012
0.002IleXaa: 0.002 ± 0.0
Lys
3.635LysAla: 3.635 ± 0.028
1.294LysCys: 1.294 ± 0.023
3.644LysAsp: 3.644 ± 0.029
4.877LysGlu: 4.877 ± 0.043
2.081LysPhe: 2.081 ± 0.015
2.676LysGly: 2.676 ± 0.023
1.614LysHis: 1.614 ± 0.015
3.661LysIle: 3.661 ± 0.025
5.486LysLys: 5.486 ± 0.056
5.659LysLeu: 5.659 ± 0.035
1.484LysMet: 1.484 ± 0.011
3.274LysAsn: 3.274 ± 0.025
3.608LysPro: 3.608 ± 0.048
2.568LysGln: 2.568 ± 0.024
3.713LysArg: 3.713 ± 0.027
4.769LysSer: 4.769 ± 0.036
3.69LysThr: 3.69 ± 0.03
3.728LysVal: 3.728 ± 0.022
0.664LysTrp: 0.664 ± 0.013
2.212LysTyr: 2.212 ± 0.019
0.002LysXaa: 0.002 ± 0.0
Leu
6.528LeuAla: 6.528 ± 0.039
1.876LeuCys: 1.876 ± 0.021
4.829LeuAsp: 4.829 ± 0.028
5.937LeuGlu: 5.937 ± 0.037
2.979LeuPhe: 2.979 ± 0.025
4.372LeuGly: 4.372 ± 0.029
2.346LeuHis: 2.346 ± 0.018
4.126LeuIle: 4.126 ± 0.027
5.882LeuLys: 5.882 ± 0.028
8.611LeuLeu: 8.611 ± 0.057
1.9LeuMet: 1.9 ± 0.016
4.156LeuAsn: 4.156 ± 0.026
4.771LeuPro: 4.771 ± 0.026
4.308LeuGln: 4.308 ± 0.027
5.515LeuArg: 5.515 ± 0.029
6.831LeuSer: 6.831 ± 0.033
4.929LeuThr: 4.929 ± 0.021
5.323LeuVal: 5.323 ± 0.026
0.918LeuTrp: 0.918 ± 0.013
2.666LeuTyr: 2.666 ± 0.018
0.003LeuXaa: 0.003 ± 0.0
Met
1.564MetAla: 1.564 ± 0.014
0.436MetCys: 0.436 ± 0.009
1.226MetAsp: 1.226 ± 0.01
1.525MetGlu: 1.525 ± 0.015
0.876MetPhe: 0.876 ± 0.01
1.143MetGly: 1.143 ± 0.015
0.542MetHis: 0.542 ± 0.008
1.0MetIle: 1.0 ± 0.01
1.417MetLys: 1.417 ± 0.013
1.931MetLeu: 1.931 ± 0.019
0.602MetMet: 0.602 ± 0.008
0.994MetAsn: 0.994 ± 0.01
1.118MetPro: 1.118 ± 0.012
0.934MetGln: 0.934 ± 0.011
1.251MetArg: 1.251 ± 0.013
1.798MetSer: 1.798 ± 0.013
1.208MetThr: 1.208 ± 0.011
1.25MetVal: 1.25 ± 0.013
0.25MetTrp: 0.25 ± 0.005
0.7MetTyr: 0.7 ± 0.009
0.001MetXaa: 0.001 ± 0.0
Asn
2.825AsnAla: 2.825 ± 0.02
0.879AsnCys: 0.879 ± 0.014
2.646AsnAsp: 2.646 ± 0.021
3.169AsnGlu: 3.169 ± 0.022
1.781AsnPhe: 1.781 ± 0.016
2.803AsnGly: 2.803 ± 0.019
1.074AsnHis: 1.074 ± 0.011
3.305AsnIle: 3.305 ± 0.026
3.359AsnLys: 3.359 ± 0.028
4.169AsnLeu: 4.169 ± 0.027
1.202AsnMet: 1.202 ± 0.012
3.063AsnAsn: 3.063 ± 0.026
2.147AsnPro: 2.147 ± 0.025
1.789AsnGln: 1.789 ± 0.019
2.22AsnArg: 2.22 ± 0.018
3.633AsnSer: 3.633 ± 0.028
2.852AsnThr: 2.852 ± 0.019
3.234AsnVal: 3.234 ± 0.022
0.476AsnTrp: 0.476 ± 0.007
1.69AsnTyr: 1.69 ± 0.015
0.001AsnXaa: 0.001 ± 0.0
Pro
4.393ProAla: 4.393 ± 0.032
0.794ProCys: 0.794 ± 0.039
2.962ProAsp: 2.962 ± 0.019
3.712ProGlu: 3.712 ± 0.033
1.667ProPhe: 1.667 ± 0.018
3.053ProGly: 3.053 ± 0.06
1.474ProHis: 1.474 ± 0.014
2.505ProIle: 2.505 ± 0.022
3.168ProLys: 3.168 ± 0.026
4.372ProLeu: 4.372 ± 0.027
0.981ProMet: 0.981 ± 0.012
2.324ProAsn: 2.324 ± 0.022
5.32ProPro: 5.32 ± 0.075
2.308ProGln: 2.308 ± 0.021
3.073ProArg: 3.073 ± 0.028
4.375ProSer: 4.375 ± 0.057
3.314ProThr: 3.314 ± 0.034
3.557ProVal: 3.557 ± 0.028
0.511ProTrp: 0.511 ± 0.007
1.702ProTyr: 1.702 ± 0.017
0.003ProXaa: 0.003 ± 0.001
Gln
2.595GlnAla: 2.595 ± 0.022
0.805GlnCys: 0.805 ± 0.016
1.849GlnAsp: 1.849 ± 0.014
2.59GlnGlu: 2.59 ± 0.021
1.356GlnPhe: 1.356 ± 0.011
1.834GlnGly: 1.834 ± 0.021
1.171GlnHis: 1.171 ± 0.012
2.079GlnIle: 2.079 ± 0.016
2.447GlnLys: 2.447 ± 0.02
3.789GlnLeu: 3.789 ± 0.027
0.963GlnMet: 0.963 ± 0.013
2.164GlnAsn: 2.164 ± 0.022
2.32GlnPro: 2.32 ± 0.03
2.509GlnGln: 2.509 ± 0.053
2.473GlnArg: 2.473 ± 0.02
2.81GlnSer: 2.81 ± 0.023
2.186GlnThr: 2.186 ± 0.018
2.202GlnVal: 2.202 ± 0.018
0.476GlnTrp: 0.476 ± 0.007
1.337GlnTyr: 1.337 ± 0.014
0.001GlnXaa: 0.001 ± 0.0
Arg
4.432ArgAla: 4.432 ± 0.032
1.152ArgCys: 1.152 ± 0.023
3.206ArgAsp: 3.206 ± 0.025
3.565ArgGlu: 3.565 ± 0.024
1.892ArgPhe: 1.892 ± 0.013
3.129ArgGly: 3.129 ± 0.026
1.638ArgHis: 1.638 ± 0.014
2.634ArgIle: 2.634 ± 0.02
3.602ArgLys: 3.602 ± 0.026
5.158ArgLeu: 5.158 ± 0.031
1.13ArgMet: 1.13 ± 0.01
2.581ArgAsn: 2.581 ± 0.017
3.114ArgPro: 3.114 ± 0.033
2.242ArgGln: 2.242 ± 0.02
4.784ArgArg: 4.784 ± 0.037
4.305ArgSer: 4.305 ± 0.034
2.97ArgThr: 2.97 ± 0.017
3.376ArgVal: 3.376 ± 0.026
0.693ArgTrp: 0.693 ± 0.01
1.694ArgTyr: 1.694 ± 0.015
0.003ArgXaa: 0.003 ± 0.001
Ser
5.248SerAla: 5.248 ± 0.031
1.481SerCys: 1.481 ± 0.03
4.529SerAsp: 4.529 ± 0.027
4.959SerGlu: 4.959 ± 0.033
2.563SerPhe: 2.563 ± 0.018
4.602SerGly: 4.602 ± 0.041
1.733SerHis: 1.733 ± 0.015
3.788SerIle: 3.788 ± 0.023
4.578SerLys: 4.578 ± 0.031
6.62SerLeu: 6.62 ± 0.032
1.607SerMet: 1.607 ± 0.016
3.65SerAsn: 3.65 ± 0.026
4.807SerPro: 4.807 ± 0.066
2.933SerGln: 2.933 ± 0.022
4.039SerArg: 4.039 ± 0.033
7.415SerSer: 7.415 ± 0.058
4.767SerThr: 4.767 ± 0.033
4.767SerVal: 4.767 ± 0.023
0.816SerTrp: 0.816 ± 0.01
2.186SerTyr: 2.186 ± 0.016
0.003SerXaa: 0.003 ± 0.001
Thr
3.944ThrAla: 3.944 ± 0.025
1.134ThrCys: 1.134 ± 0.02
3.152ThrAsp: 3.152 ± 0.023
3.784ThrGlu: 3.784 ± 0.029
2.111ThrPhe: 2.111 ± 0.015
3.298ThrGly: 3.298 ± 0.028
1.464ThrHis: 1.464 ± 0.017
3.099ThrIle: 3.099 ± 0.02
3.492ThrLys: 3.492 ± 0.03
5.181ThrLeu: 5.181 ± 0.028
1.145ThrMet: 1.145 ± 0.012
2.741ThrAsn: 2.741 ± 0.021
3.764ThrPro: 3.764 ± 0.029
2.159ThrGln: 2.159 ± 0.02
2.718ThrArg: 2.718 ± 0.019
4.754ThrSer: 4.754 ± 0.035
4.156ThrThr: 4.156 ± 0.078
3.998ThrVal: 3.998 ± 0.024
0.613ThrTrp: 0.613 ± 0.01
1.713ThrTyr: 1.713 ± 0.014
0.002ThrXaa: 0.002 ± 0.0
Val
4.558ValAla: 4.558 ± 0.028
1.485ValCys: 1.485 ± 0.022
3.508ValAsp: 3.508 ± 0.02
4.076ValGlu: 4.076 ± 0.034
2.3ValPhe: 2.3 ± 0.017
3.176ValGly: 3.176 ± 0.021
1.549ValHis: 1.549 ± 0.014
3.359ValIle: 3.359 ± 0.023
4.019ValLys: 4.019 ± 0.027
5.698ValLeu: 5.698 ± 0.032
1.357ValMet: 1.357 ± 0.012
2.994ValAsn: 2.994 ± 0.022
3.489ValPro: 3.489 ± 0.028
2.468ValGln: 2.468 ± 0.017
3.465ValArg: 3.465 ± 0.024
4.751ValSer: 4.751 ± 0.021
4.14ValThr: 4.14 ± 0.033
4.424ValVal: 4.424 ± 0.026
0.705ValTrp: 0.705 ± 0.009
1.964ValTyr: 1.964 ± 0.017
0.002ValXaa: 0.002 ± 0.0
Trp
0.667TrpAla: 0.667 ± 0.009
0.228TrpCys: 0.228 ± 0.004
0.609TrpAsp: 0.609 ± 0.009
0.65TrpGlu: 0.65 ± 0.01
0.435TrpPhe: 0.435 ± 0.007
0.58TrpGly: 0.58 ± 0.009
0.26TrpHis: 0.26 ± 0.005
0.546TrpIle: 0.546 ± 0.009
0.661TrpLys: 0.661 ± 0.01
1.134TrpLeu: 1.134 ± 0.013
0.274TrpMet: 0.274 ± 0.006
0.493TrpAsn: 0.493 ± 0.007
0.454TrpPro: 0.454 ± 0.007
0.463TrpGln: 0.463 ± 0.007
0.815TrpArg: 0.815 ± 0.013
0.841TrpSer: 0.841 ± 0.01
0.601TrpThr: 0.601 ± 0.009
0.636TrpVal: 0.636 ± 0.011
0.207TrpTrp: 0.207 ± 0.005
0.36TrpTyr: 0.36 ± 0.008
0.001TrpXaa: 0.001 ± 0.0
Tyr
1.893TyrAla: 1.893 ± 0.015
0.726TyrCys: 0.726 ± 0.01
1.842TyrAsp: 1.842 ± 0.015
1.985TyrGlu: 1.985 ± 0.016
1.291TyrPhe: 1.291 ± 0.013
1.899TyrGly: 1.899 ± 0.017
0.844TyrHis: 0.844 ± 0.009
1.693TyrIle: 1.693 ± 0.016
2.051TyrLys: 2.051 ± 0.019
2.736TyrLeu: 2.736 ± 0.018
0.71TyrMet: 0.71 ± 0.009
1.672TyrAsn: 1.672 ± 0.016
1.444TyrPro: 1.444 ± 0.016
1.229TyrGln: 1.229 ± 0.013
1.784TyrArg: 1.784 ± 0.014
2.366TyrSer: 2.366 ± 0.019
1.833TyrThr: 1.833 ± 0.015
1.976TyrVal: 1.976 ± 0.016
0.38TyrTrp: 0.38 ± 0.006
1.23TyrTyr: 1.23 ± 0.013
0.001TyrXaa: 0.001 ± 0.0
Xaa
0.004XaaAla: 0.004 ± 0.001
0.001XaaCys: 0.001 ± 0.0
0.002XaaAsp: 0.002 ± 0.0
0.001XaaGlu: 0.001 ± 0.0
0.001XaaPhe: 0.001 ± 0.0
0.003XaaGly: 0.003 ± 0.001
0.001XaaHis: 0.001 ± 0.0
0.001XaaIle: 0.001 ± 0.0
0.002XaaLys: 0.002 ± 0.0
0.003XaaLeu: 0.003 ± 0.0
0.001XaaMet: 0.001 ± 0.0
0.001XaaAsn: 0.001 ± 0.0
0.002XaaPro: 0.002 ± 0.0
0.002XaaGln: 0.002 ± 0.0
0.003XaaArg: 0.003 ± 0.001
0.002XaaSer: 0.002 ± 0.001
0.002XaaThr: 0.002 ± 0.0
0.002XaaVal: 0.002 ± 0.0
0.001XaaTrp: 0.001 ± 0.0
0.001XaaTyr: 0.001 ± 0.0
0.007XaaXaa: 0.007 ± 0.003
Statistics based on 19396 proteins (11249408 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski