Amino acid dipepetide frequency for Heliothis virescens (Tobacco budworm moth)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.365AlaAla: 8.365 ± 0.094
1.59AlaCys: 1.59 ± 0.049
3.63AlaAsp: 3.63 ± 0.032
4.556AlaGlu: 4.556 ± 0.035
2.175AlaPhe: 2.175 ± 0.023
4.58AlaGly: 4.58 ± 0.058
1.963AlaHis: 1.963 ± 0.033
3.253AlaIle: 3.253 ± 0.027
3.777AlaLys: 3.777 ± 0.034
7.188AlaLeu: 7.188 ± 0.068
1.558AlaMet: 1.558 ± 0.017
2.631AlaAsn: 2.631 ± 0.029
4.866AlaPro: 4.866 ± 0.073
2.791AlaGln: 2.791 ± 0.029
4.755AlaArg: 4.755 ± 0.062
5.208AlaSer: 5.208 ± 0.042
4.109AlaThr: 4.109 ± 0.043
4.882AlaVal: 4.882 ± 0.047
0.79AlaTrp: 0.79 ± 0.018
1.864AlaTyr: 1.864 ± 0.025
0.004AlaXaa: 0.004 ± 0.001
Cys
1.6CysAla: 1.6 ± 0.034
0.531CysCys: 0.531 ± 0.018
1.199CysAsp: 1.199 ± 0.02
1.223CysGlu: 1.223 ± 0.026
0.744CysPhe: 0.744 ± 0.021
1.557CysGly: 1.557 ± 0.051
0.505CysHis: 0.505 ± 0.013
1.027CysIle: 1.027 ± 0.029
1.085CysLys: 1.085 ± 0.024
1.747CysLeu: 1.747 ± 0.031
0.4CysMet: 0.4 ± 0.008
0.877CysAsn: 0.877 ± 0.017
1.272CysPro: 1.272 ± 0.056
0.753CysGln: 0.753 ± 0.023
1.346CysArg: 1.346 ± 0.046
1.665CysSer: 1.665 ± 0.044
1.213CysThr: 1.213 ± 0.033
1.509CysVal: 1.509 ± 0.04
0.244CysTrp: 0.244 ± 0.008
0.662CysTyr: 0.662 ± 0.027
0.001CysXaa: 0.001 ± 0.0
Asp
3.739AspAla: 3.739 ± 0.034
1.074AspCys: 1.074 ± 0.025
4.12AspAsp: 4.12 ± 0.054
4.405AspGlu: 4.405 ± 0.087
2.041AspPhe: 2.041 ± 0.023
3.254AspGly: 3.254 ± 0.026
1.167AspHis: 1.167 ± 0.014
3.278AspIle: 3.278 ± 0.025
3.646AspLys: 3.646 ± 0.067
4.564AspLeu: 4.564 ± 0.031
1.315AspMet: 1.315 ± 0.014
2.724AspAsn: 2.724 ± 0.035
2.714AspPro: 2.714 ± 0.053
1.667AspGln: 1.667 ± 0.017
2.72AspArg: 2.72 ± 0.036
4.273AspSer: 4.273 ± 0.04
3.294AspThr: 3.294 ± 0.038
3.772AspVal: 3.772 ± 0.028
0.615AspTrp: 0.615 ± 0.013
1.844AspTyr: 1.844 ± 0.023
0.001AspXaa: 0.001 ± 0.0
Glu
4.438GluAla: 4.438 ± 0.039
1.354GluCys: 1.354 ± 0.052
4.014GluAsp: 4.014 ± 0.039
5.761GluGlu: 5.761 ± 0.068
1.981GluPhe: 1.981 ± 0.019
3.154GluGly: 3.154 ± 0.036
1.579GluHis: 1.579 ± 0.024
3.489GluIle: 3.489 ± 0.033
5.023GluLys: 5.023 ± 0.109
5.73GluLeu: 5.73 ± 0.042
1.54GluMet: 1.54 ± 0.017
3.334GluAsn: 3.334 ± 0.03
3.382GluPro: 3.382 ± 0.042
2.62GluGln: 2.62 ± 0.031
4.116GluArg: 4.116 ± 0.055
4.343GluSer: 4.343 ± 0.042
3.638GluThr: 3.638 ± 0.038
4.008GluVal: 4.008 ± 0.037
0.686GluTrp: 0.686 ± 0.016
1.988GluTyr: 1.988 ± 0.026
0.003GluXaa: 0.003 ± 0.001
Phe
2.128PheAla: 2.128 ± 0.023
0.718PheCys: 0.718 ± 0.014
1.96PheAsp: 1.96 ± 0.021
1.979PheGlu: 1.979 ± 0.024
1.262PhePhe: 1.262 ± 0.017
2.148PheGly: 2.148 ± 0.026
0.816PheHis: 0.816 ± 0.014
1.869PheIle: 1.869 ± 0.02
1.949PheLys: 1.949 ± 0.019
2.973PheLeu: 2.973 ± 0.031
0.792PheMet: 0.792 ± 0.011
1.66PheAsn: 1.66 ± 0.018
1.55PhePro: 1.55 ± 0.026
1.234PheGln: 1.234 ± 0.017
1.727PheArg: 1.727 ± 0.017
2.529PheSer: 2.529 ± 0.025
1.974PheThr: 1.974 ± 0.021
2.242PheVal: 2.242 ± 0.023
0.392PheTrp: 0.392 ± 0.008
1.214PheTyr: 1.214 ± 0.015
0.001PheXaa: 0.001 ± 0.0
Gly
4.517GlyAla: 4.517 ± 0.06
1.075GlyCys: 1.075 ± 0.023
3.149GlyAsp: 3.149 ± 0.035
3.321GlyGlu: 3.321 ± 0.038
2.012GlyPhe: 2.012 ± 0.023
4.907GlyGly: 4.907 ± 0.091
1.4GlyHis: 1.4 ± 0.02
2.617GlyIle: 2.617 ± 0.027
3.078GlyLys: 3.078 ± 0.027
4.46GlyLeu: 4.46 ± 0.043
1.173GlyMet: 1.173 ± 0.017
2.373GlyAsn: 2.373 ± 0.03
2.763GlyPro: 2.763 ± 0.052
2.165GlyGln: 2.165 ± 0.043
3.334GlyArg: 3.334 ± 0.041
4.641GlySer: 4.641 ± 0.061
3.18GlyThr: 3.18 ± 0.032
3.776GlyVal: 3.776 ± 0.036
0.712GlyTrp: 0.712 ± 0.015
2.022GlyTyr: 2.022 ± 0.031
0.004GlyXaa: 0.004 ± 0.001
His
1.958HisAla: 1.958 ± 0.04
0.655HisCys: 0.655 ± 0.021
1.285HisAsp: 1.285 ± 0.023
1.409HisGlu: 1.409 ± 0.02
0.876HisPhe: 0.876 ± 0.012
1.374HisGly: 1.374 ± 0.022
1.148HisHis: 1.148 ± 0.06
1.304HisIle: 1.304 ± 0.018
1.396HisLys: 1.396 ± 0.018
2.242HisLeu: 2.242 ± 0.027
0.613HisMet: 0.613 ± 0.013
1.104HisAsn: 1.104 ± 0.015
1.356HisPro: 1.356 ± 0.022
1.015HisGln: 1.015 ± 0.025
1.471HisArg: 1.471 ± 0.018
1.928HisSer: 1.928 ± 0.021
1.551HisThr: 1.551 ± 0.03
1.704HisVal: 1.704 ± 0.032
0.289HisTrp: 0.289 ± 0.007
0.923HisTyr: 0.923 ± 0.016
0.001HisXaa: 0.001 ± 0.0
Ile
3.373IleAla: 3.373 ± 0.027
1.129IleCys: 1.129 ± 0.026
2.949IleAsp: 2.949 ± 0.026
3.359IleGlu: 3.359 ± 0.031
1.842IlePhe: 1.842 ± 0.023
2.515IleGly: 2.515 ± 0.029
1.199IleHis: 1.199 ± 0.019
2.822IleIle: 2.822 ± 0.027
3.44IleLys: 3.44 ± 0.044
4.335IleLeu: 4.335 ± 0.036
1.093IleMet: 1.093 ± 0.013
2.494IleAsn: 2.494 ± 0.026
2.697IlePro: 2.697 ± 0.028
1.99IleGln: 1.99 ± 0.024
2.448IleArg: 2.448 ± 0.024
3.786IleSer: 3.786 ± 0.029
3.145IleThr: 3.145 ± 0.032
3.307IleVal: 3.307 ± 0.029
0.473IleTrp: 0.473 ± 0.01
1.555IleTyr: 1.555 ± 0.02
0.002IleXaa: 0.002 ± 0.0
Lys
3.694LysAla: 3.694 ± 0.051
1.234LysCys: 1.234 ± 0.029
3.487LysAsp: 3.487 ± 0.039
4.765LysGlu: 4.765 ± 0.062
1.912LysPhe: 1.912 ± 0.02
2.633LysGly: 2.633 ± 0.03
1.527LysHis: 1.527 ± 0.02
3.363LysIle: 3.363 ± 0.031
5.219LysLys: 5.219 ± 0.059
5.406LysLeu: 5.406 ± 0.047
1.446LysMet: 1.446 ± 0.018
2.926LysAsn: 2.926 ± 0.032
3.673LysPro: 3.673 ± 0.073
2.505LysGln: 2.505 ± 0.029
3.556LysArg: 3.556 ± 0.033
4.378LysSer: 4.378 ± 0.043
3.616LysThr: 3.616 ± 0.047
3.691LysVal: 3.691 ± 0.037
0.594LysTrp: 0.594 ± 0.013
2.051LysTyr: 2.051 ± 0.02
0.003LysXaa: 0.003 ± 0.001
Leu
6.789LeuAla: 6.789 ± 0.058
1.841LeuCys: 1.841 ± 0.03
4.796LeuAsp: 4.796 ± 0.044
5.71LeuGlu: 5.71 ± 0.049
2.908LeuPhe: 2.908 ± 0.031
4.518LeuGly: 4.518 ± 0.044
2.38LeuHis: 2.38 ± 0.026
3.929LeuIle: 3.929 ± 0.031
5.556LeuLys: 5.556 ± 0.047
8.418LeuLeu: 8.418 ± 0.061
1.942LeuMet: 1.942 ± 0.024
3.863LeuAsn: 3.863 ± 0.033
4.998LeuPro: 4.998 ± 0.049
4.114LeuGln: 4.114 ± 0.036
5.374LeuArg: 5.374 ± 0.041
6.566LeuSer: 6.566 ± 0.046
4.889LeuThr: 4.889 ± 0.043
5.413LeuVal: 5.413 ± 0.043
0.876LeuTrp: 0.876 ± 0.014
2.562LeuTyr: 2.562 ± 0.026
0.003LeuXaa: 0.003 ± 0.001
Met
1.651MetAla: 1.651 ± 0.018
0.454MetCys: 0.454 ± 0.012
1.239MetAsp: 1.239 ± 0.015
1.52MetGlu: 1.52 ± 0.017
0.884MetPhe: 0.884 ± 0.012
1.127MetGly: 1.127 ± 0.022
0.528MetHis: 0.528 ± 0.011
0.976MetIle: 0.976 ± 0.013
1.446MetLys: 1.446 ± 0.016
1.953MetLeu: 1.953 ± 0.022
0.63MetMet: 0.63 ± 0.011
0.996MetAsn: 0.996 ± 0.013
1.179MetPro: 1.179 ± 0.019
0.951MetGln: 0.951 ± 0.015
1.232MetArg: 1.232 ± 0.016
1.771MetSer: 1.771 ± 0.018
1.214MetThr: 1.214 ± 0.014
1.274MetVal: 1.274 ± 0.017
0.251MetTrp: 0.251 ± 0.005
0.737MetTyr: 0.737 ± 0.012
0.0MetXaa: 0.0 ± 0.0
Asn
2.804AsnAla: 2.804 ± 0.032
0.881AsnCys: 0.881 ± 0.02
2.478AsnAsp: 2.478 ± 0.028
2.907AsnGlu: 2.907 ± 0.025
1.635AsnPhe: 1.635 ± 0.02
2.63AsnGly: 2.63 ± 0.027
0.972AsnHis: 0.972 ± 0.017
2.914AsnIle: 2.914 ± 0.028
3.053AsnLys: 3.053 ± 0.032
3.881AsnLeu: 3.881 ± 0.04
1.163AsnMet: 1.163 ± 0.018
2.711AsnAsn: 2.711 ± 0.03
2.216AsnPro: 2.216 ± 0.036
1.696AsnGln: 1.696 ± 0.023
2.083AsnArg: 2.083 ± 0.026
3.407AsnSer: 3.407 ± 0.034
2.761AsnThr: 2.761 ± 0.033
3.122AsnVal: 3.122 ± 0.029
0.454AsnTrp: 0.454 ± 0.011
1.565AsnTyr: 1.565 ± 0.019
0.001AsnXaa: 0.001 ± 0.0
Pro
4.808ProAla: 4.808 ± 0.059
0.982ProCys: 0.982 ± 0.052
3.218ProAsp: 3.218 ± 0.06
3.982ProGlu: 3.982 ± 0.041
1.57ProPhe: 1.57 ± 0.018
3.371ProGly: 3.371 ± 0.07
1.6ProHis: 1.6 ± 0.031
2.548ProIle: 2.548 ± 0.045
3.215ProLys: 3.215 ± 0.055
4.374ProLeu: 4.374 ± 0.038
1.033ProMet: 1.033 ± 0.016
2.306ProAsn: 2.306 ± 0.03
5.657ProPro: 5.657 ± 0.093
2.36ProGln: 2.36 ± 0.027
3.509ProArg: 3.509 ± 0.095
4.695ProSer: 4.695 ± 0.114
3.782ProThr: 3.782 ± 0.097
3.88ProVal: 3.88 ± 0.051
0.504ProTrp: 0.504 ± 0.009
1.623ProTyr: 1.623 ± 0.02
0.003ProXaa: 0.003 ± 0.001
Gln
2.811GlnAla: 2.811 ± 0.029
0.856GlnCys: 0.856 ± 0.023
1.837GlnAsp: 1.837 ± 0.021
2.533GlnGlu: 2.533 ± 0.025
1.264GlnPhe: 1.264 ± 0.017
1.925GlnGly: 1.925 ± 0.031
1.202GlnHis: 1.202 ± 0.023
1.966GlnIle: 1.966 ± 0.018
2.343GlnLys: 2.343 ± 0.024
3.77GlnLeu: 3.77 ± 0.035
1.01GlnMet: 1.01 ± 0.018
1.961GlnAsn: 1.961 ± 0.023
2.46GlnPro: 2.46 ± 0.036
2.558GlnGln: 2.558 ± 0.056
2.452GlnArg: 2.452 ± 0.033
2.728GlnSer: 2.728 ± 0.03
2.119GlnThr: 2.119 ± 0.025
2.342GlnVal: 2.342 ± 0.025
0.455GlnTrp: 0.455 ± 0.013
1.429GlnTyr: 1.429 ± 0.03
0.002GlnXaa: 0.002 ± 0.0
Arg
4.847ArgAla: 4.847 ± 0.075
1.448ArgCys: 1.448 ± 0.077
3.147ArgAsp: 3.147 ± 0.035
3.537ArgGlu: 3.537 ± 0.046
1.726ArgPhe: 1.726 ± 0.019
3.28ArgGly: 3.28 ± 0.038
1.709ArgHis: 1.709 ± 0.029
2.532ArgIle: 2.532 ± 0.027
3.525ArgLys: 3.525 ± 0.036
5.196ArgLeu: 5.196 ± 0.05
1.148ArgMet: 1.148 ± 0.014
2.423ArgAsn: 2.423 ± 0.027
3.274ArgPro: 3.274 ± 0.043
2.307ArgGln: 2.307 ± 0.025
5.082ArgArg: 5.082 ± 0.071
4.223ArgSer: 4.223 ± 0.045
3.057ArgThr: 3.057 ± 0.036
3.493ArgVal: 3.493 ± 0.039
0.661ArgTrp: 0.661 ± 0.01
1.784ArgTyr: 1.784 ± 0.027
0.003ArgXaa: 0.003 ± 0.001
Ser
5.319SerAla: 5.319 ± 0.045
1.519SerCys: 1.519 ± 0.044
4.506SerAsp: 4.506 ± 0.037
4.724SerGlu: 4.724 ± 0.035
2.4SerPhe: 2.4 ± 0.025
4.673SerGly: 4.673 ± 0.042
1.735SerHis: 1.735 ± 0.023
3.545SerIle: 3.545 ± 0.033
4.29SerLys: 4.29 ± 0.038
6.381SerLeu: 6.381 ± 0.045
1.565SerMet: 1.565 ± 0.017
3.482SerAsn: 3.482 ± 0.033
5.064SerPro: 5.064 ± 0.119
2.874SerGln: 2.874 ± 0.035
4.156SerArg: 4.156 ± 0.04
7.456SerSer: 7.456 ± 0.069
4.893SerThr: 4.893 ± 0.059
4.775SerVal: 4.775 ± 0.038
0.777SerTrp: 0.777 ± 0.011
2.157SerTyr: 2.157 ± 0.027
0.003SerXaa: 0.003 ± 0.001
Thr
4.282ThrAla: 4.282 ± 0.045
1.28ThrCys: 1.28 ± 0.029
3.326ThrAsp: 3.326 ± 0.037
3.827ThrGlu: 3.827 ± 0.053
2.019ThrPhe: 2.019 ± 0.022
3.229ThrGly: 3.229 ± 0.034
1.446ThrHis: 1.446 ± 0.025
3.021ThrIle: 3.021 ± 0.029
3.31ThrLys: 3.31 ± 0.035
5.134ThrLeu: 5.134 ± 0.035
1.2ThrMet: 1.2 ± 0.014
2.549ThrAsn: 2.549 ± 0.026
4.105ThrPro: 4.105 ± 0.057
2.249ThrGln: 2.249 ± 0.04
2.94ThrArg: 2.94 ± 0.037
4.801ThrSer: 4.801 ± 0.047
4.792ThrThr: 4.792 ± 0.213
4.09ThrVal: 4.09 ± 0.038
0.621ThrTrp: 0.621 ± 0.013
1.728ThrTyr: 1.728 ± 0.024
0.001ThrXaa: 0.001 ± 0.0
Val
4.826ValAla: 4.826 ± 0.044
1.543ValCys: 1.543 ± 0.031
3.498ValAsp: 3.498 ± 0.032
4.109ValGlu: 4.109 ± 0.034
2.225ValPhe: 2.225 ± 0.025
3.294ValGly: 3.294 ± 0.036
1.616ValHis: 1.616 ± 0.026
3.278ValIle: 3.278 ± 0.033
3.885ValLys: 3.885 ± 0.064
5.887ValLeu: 5.887 ± 0.049
1.384ValMet: 1.384 ± 0.018
2.873ValAsn: 2.873 ± 0.028
3.734ValPro: 3.734 ± 0.048
2.521ValGln: 2.521 ± 0.026
3.538ValArg: 3.538 ± 0.037
4.786ValSer: 4.786 ± 0.034
4.214ValThr: 4.214 ± 0.039
4.475ValVal: 4.475 ± 0.039
0.731ValTrp: 0.731 ± 0.014
1.933ValTyr: 1.933 ± 0.02
0.001ValXaa: 0.001 ± 0.0
Trp
0.687TrpAla: 0.687 ± 0.012
0.252TrpCys: 0.252 ± 0.008
0.57TrpAsp: 0.57 ± 0.01
0.624TrpGlu: 0.624 ± 0.012
0.401TrpPhe: 0.401 ± 0.008
0.6TrpGly: 0.6 ± 0.014
0.259TrpHis: 0.259 ± 0.007
0.532TrpIle: 0.532 ± 0.012
0.621TrpLys: 0.621 ± 0.012
1.077TrpLeu: 1.077 ± 0.018
0.263TrpMet: 0.263 ± 0.005
0.475TrpAsn: 0.475 ± 0.008
0.476TrpPro: 0.476 ± 0.011
0.428TrpGln: 0.428 ± 0.007
0.854TrpArg: 0.854 ± 0.022
0.803TrpSer: 0.803 ± 0.012
0.599TrpThr: 0.599 ± 0.012
0.62TrpVal: 0.62 ± 0.01
0.204TrpTrp: 0.204 ± 0.006
0.359TrpTyr: 0.359 ± 0.008
0.001TrpXaa: 0.001 ± 0.0
Tyr
1.895TyrAla: 1.895 ± 0.026
0.743TyrCys: 0.743 ± 0.012
1.816TyrAsp: 1.816 ± 0.021
1.927TyrGlu: 1.927 ± 0.022
1.238TyrPhe: 1.238 ± 0.017
1.91TyrGly: 1.91 ± 0.027
0.852TyrHis: 0.852 ± 0.017
1.686TyrIle: 1.686 ± 0.025
1.819TyrLys: 1.819 ± 0.019
2.725TyrLeu: 2.725 ± 0.026
0.745TyrMet: 0.745 ± 0.012
1.573TyrAsn: 1.573 ± 0.018
1.579TyrPro: 1.579 ± 0.035
1.222TyrGln: 1.222 ± 0.017
1.706TyrArg: 1.706 ± 0.017
2.303TyrSer: 2.303 ± 0.026
1.91TyrThr: 1.91 ± 0.026
1.977TyrVal: 1.977 ± 0.021
0.372TyrTrp: 0.372 ± 0.008
1.219TyrTyr: 1.219 ± 0.017
0.001TyrXaa: 0.001 ± 0.0
Xaa
0.003XaaAla: 0.003 ± 0.001
0.001XaaCys: 0.001 ± 0.0
0.001XaaAsp: 0.001 ± 0.0
0.001XaaGlu: 0.001 ± 0.0
0.002XaaPhe: 0.002 ± 0.0
0.002XaaGly: 0.002 ± 0.0
0.001XaaHis: 0.001 ± 0.0
0.002XaaIle: 0.002 ± 0.001
0.003XaaLys: 0.003 ± 0.001
0.004XaaLeu: 0.004 ± 0.001
0.0XaaMet: 0.0 ± 0.0
0.001XaaAsn: 0.001 ± 0.0
0.003XaaPro: 0.003 ± 0.001
0.001XaaGln: 0.001 ± 0.0
0.003XaaArg: 0.003 ± 0.0
0.003XaaSer: 0.003 ± 0.001
0.002XaaThr: 0.002 ± 0.0
0.002XaaVal: 0.002 ± 0.001
0.0XaaTrp: 0.0 ± 0.0
0.001XaaTyr: 0.001 ± 0.0
0.085XaaXaa: 0.085 ± 0.018
Statistics based on 17055 proteins (9040933 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski