Amino acid dipepetide frequency for Photinus pyralis (Common eastern firefly) (Lampyris pyralis)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.198AlaAla: 4.198 ± 0.037
1.168AlaCys: 1.168 ± 0.023
2.902AlaAsp: 2.902 ± 0.022
3.558AlaGlu: 3.558 ± 0.031
2.433AlaPhe: 2.433 ± 0.018
2.938AlaGly: 2.938 ± 0.028
1.388AlaHis: 1.388 ± 0.017
3.784AlaIle: 3.784 ± 0.027
3.661AlaLys: 3.661 ± 0.029
5.735AlaLeu: 5.735 ± 0.041
1.334AlaMet: 1.334 ± 0.015
2.777AlaAsn: 2.777 ± 0.022
2.618AlaPro: 2.618 ± 0.03
2.221AlaGln: 2.221 ± 0.023
2.723AlaArg: 2.723 ± 0.022
4.316AlaSer: 4.316 ± 0.026
3.563AlaThr: 3.563 ± 0.028
4.086AlaVal: 4.086 ± 0.031
0.583AlaTrp: 0.583 ± 0.01
1.726AlaTyr: 1.726 ± 0.018
0.0AlaXaa: 0.0 ± 0.0
Cys
1.184CysAla: 1.184 ± 0.016
0.518CysCys: 0.518 ± 0.012
1.313CysAsp: 1.313 ± 0.02
1.275CysGlu: 1.275 ± 0.018
0.863CysPhe: 0.863 ± 0.011
1.382CysGly: 1.382 ± 0.035
0.542CysHis: 0.542 ± 0.011
1.305CysIle: 1.305 ± 0.025
1.349CysLys: 1.349 ± 0.019
2.035CysLeu: 2.035 ± 0.028
0.438CysMet: 0.438 ± 0.009
1.167CysAsn: 1.167 ± 0.018
1.004CysPro: 1.004 ± 0.032
0.777CysGln: 0.777 ± 0.019
1.031CysArg: 1.031 ± 0.032
1.685CysSer: 1.685 ± 0.031
1.239CysThr: 1.239 ± 0.02
1.424CysVal: 1.424 ± 0.028
0.204CysTrp: 0.204 ± 0.004
0.656CysTyr: 0.656 ± 0.01
0.0CysXaa: 0.0 ± 0.0
Asp
2.981AspAla: 2.981 ± 0.024
1.142AspCys: 1.142 ± 0.02
3.52AspAsp: 3.52 ± 0.027
4.03AspGlu: 4.03 ± 0.035
2.29AspPhe: 2.29 ± 0.019
3.011AspGly: 3.011 ± 0.03
1.224AspHis: 1.224 ± 0.013
3.615AspIle: 3.615 ± 0.027
3.17AspLys: 3.17 ± 0.029
5.111AspLeu: 5.111 ± 0.031
1.171AspMet: 1.171 ± 0.013
2.715AspAsn: 2.715 ± 0.024
2.372AspPro: 2.372 ± 0.034
1.745AspGln: 1.745 ± 0.015
2.399AspArg: 2.399 ± 0.024
4.14AspSer: 4.14 ± 0.029
2.784AspThr: 2.784 ± 0.019
3.787AspVal: 3.787 ± 0.025
0.641AspTrp: 0.641 ± 0.011
1.988AspTyr: 1.988 ± 0.017
0.0AspXaa: 0.0 ± 0.0
Glu
3.608GluAla: 3.608 ± 0.033
1.317GluCys: 1.317 ± 0.039
3.946GluAsp: 3.946 ± 0.031
5.558GluGlu: 5.558 ± 0.089
2.403GluPhe: 2.403 ± 0.02
2.956GluGly: 2.956 ± 0.027
1.502GluHis: 1.502 ± 0.017
4.223GluIle: 4.223 ± 0.034
4.835GluLys: 4.835 ± 0.043
5.94GluLeu: 5.94 ± 0.04
1.597GluMet: 1.597 ± 0.018
3.878GluAsn: 3.878 ± 0.028
2.34GluPro: 2.34 ± 0.025
2.514GluGln: 2.514 ± 0.027
3.425GluArg: 3.425 ± 0.03
4.377GluSer: 4.377 ± 0.031
3.633GluThr: 3.633 ± 0.036
4.043GluVal: 4.043 ± 0.055
0.689GluTrp: 0.689 ± 0.008
2.018GluTyr: 2.018 ± 0.02
0.0GluXaa: 0.0 ± 0.0
Phe
2.329PheAla: 2.329 ± 0.024
0.91PheCys: 0.91 ± 0.012
2.292PheAsp: 2.292 ± 0.02
2.417PheGlu: 2.417 ± 0.022
1.717PhePhe: 1.717 ± 0.021
2.498PheGly: 2.498 ± 0.026
1.051PheHis: 1.051 ± 0.013
2.502PheIle: 2.502 ± 0.023
2.562PheLys: 2.562 ± 0.022
3.986PheLeu: 3.986 ± 0.033
0.924PheMet: 0.924 ± 0.012
2.196PheAsn: 2.196 ± 0.018
1.861PhePro: 1.861 ± 0.017
1.543PheGln: 1.543 ± 0.015
1.973PheArg: 1.973 ± 0.019
3.264PheSer: 3.264 ± 0.022
2.375PheThr: 2.375 ± 0.021
2.775PheVal: 2.775 ± 0.022
0.492PheTrp: 0.492 ± 0.009
1.544PheTyr: 1.544 ± 0.018
0.0PheXaa: 0.0 ± 0.0
Gly
3.117GlyAla: 3.117 ± 0.031
1.08GlyCys: 1.08 ± 0.018
2.862GlyAsp: 2.862 ± 0.024
3.094GlyGlu: 3.094 ± 0.029
2.324GlyPhe: 2.324 ± 0.024
3.556GlyGly: 3.556 ± 0.042
1.389GlyHis: 1.389 ± 0.015
3.386GlyIle: 3.386 ± 0.026
3.441GlyLys: 3.441 ± 0.029
4.638GlyLeu: 4.638 ± 0.035
1.14GlyMet: 1.14 ± 0.015
2.666GlyAsn: 2.666 ± 0.024
2.161GlyPro: 2.161 ± 0.039
1.918GlyGln: 1.918 ± 0.036
2.746GlyArg: 2.746 ± 0.024
4.172GlySer: 4.172 ± 0.034
3.021GlyThr: 3.021 ± 0.021
3.48GlyVal: 3.48 ± 0.027
0.666GlyTrp: 0.666 ± 0.011
2.04GlyTyr: 2.04 ± 0.028
0.0GlyXaa: 0.0 ± 0.0
His
1.269HisAla: 1.269 ± 0.015
0.646HisCys: 0.646 ± 0.012
1.115HisAsp: 1.115 ± 0.013
1.317HisGlu: 1.317 ± 0.014
1.229HisPhe: 1.229 ± 0.013
1.321HisGly: 1.321 ± 0.017
0.836HisHis: 0.836 ± 0.015
1.627HisIle: 1.627 ± 0.016
1.491HisLys: 1.491 ± 0.016
2.658HisLeu: 2.658 ± 0.026
0.616HisMet: 0.616 ± 0.011
1.364HisAsn: 1.364 ± 0.014
1.298HisPro: 1.298 ± 0.014
0.995HisGln: 0.995 ± 0.015
1.291HisArg: 1.291 ± 0.015
2.052HisSer: 2.052 ± 0.019
1.415HisThr: 1.415 ± 0.016
1.576HisVal: 1.576 ± 0.017
0.304HisTrp: 0.304 ± 0.007
0.956HisTyr: 0.956 ± 0.014
0.0HisXaa: 0.0 ± 0.0
Ile
3.753IleAla: 3.753 ± 0.028
1.403IleCys: 1.403 ± 0.019
3.304IleAsp: 3.304 ± 0.023
3.749IleGlu: 3.749 ± 0.03
2.614IlePhe: 2.614 ± 0.026
3.126IleGly: 3.126 ± 0.026
1.478IleHis: 1.478 ± 0.016
3.785IleIle: 3.785 ± 0.029
3.993IleLys: 3.993 ± 0.033
5.835IleLeu: 5.835 ± 0.033
1.303IleMet: 1.303 ± 0.015
3.302IleAsn: 3.302 ± 0.022
3.18IlePro: 3.18 ± 0.021
2.418IleGln: 2.418 ± 0.021
2.992IleArg: 2.992 ± 0.021
4.851IleSer: 4.851 ± 0.03
3.651IleThr: 3.651 ± 0.026
3.985IleVal: 3.985 ± 0.029
0.612IleTrp: 0.612 ± 0.011
1.955IleTyr: 1.955 ± 0.02
0.0IleXaa: 0.0 ± 0.0
Lys
3.482LysAla: 3.482 ± 0.024
1.521LysCys: 1.521 ± 0.022
3.381LysAsp: 3.381 ± 0.026
4.648LysGlu: 4.648 ± 0.043
2.507LysPhe: 2.507 ± 0.021
3.044LysGly: 3.044 ± 0.03
1.756LysHis: 1.756 ± 0.018
4.089LysIle: 4.089 ± 0.032
5.186LysLys: 5.186 ± 0.063
6.376LysLeu: 6.376 ± 0.041
1.616LysMet: 1.616 ± 0.017
3.439LysAsn: 3.439 ± 0.028
3.157LysPro: 3.157 ± 0.064
2.771LysGln: 2.771 ± 0.023
3.92LysArg: 3.92 ± 0.038
4.863LysSer: 4.863 ± 0.036
3.713LysThr: 3.713 ± 0.029
4.048LysVal: 4.048 ± 0.03
0.769LysTrp: 0.769 ± 0.012
2.381LysTyr: 2.381 ± 0.019
0.0LysXaa: 0.0 ± 0.0
Leu
5.618LeuAla: 5.618 ± 0.039
1.966LeuCys: 1.966 ± 0.018
4.922LeuAsp: 4.922 ± 0.03
6.203LeuGlu: 6.203 ± 0.044
3.636LeuPhe: 3.636 ± 0.032
4.706LeuGly: 4.706 ± 0.037
2.722LeuHis: 2.722 ± 0.031
5.248LeuIle: 5.248 ± 0.035
6.809LeuLys: 6.809 ± 0.046
9.502LeuLeu: 9.502 ± 0.063
2.067LeuMet: 2.067 ± 0.02
4.942LeuAsn: 4.942 ± 0.03
4.915LeuPro: 4.915 ± 0.029
4.73LeuGln: 4.73 ± 0.037
5.102LeuArg: 5.102 ± 0.036
7.382LeuSer: 7.382 ± 0.043
5.321LeuThr: 5.321 ± 0.027
5.345LeuVal: 5.345 ± 0.034
0.963LeuTrp: 0.963 ± 0.012
2.999LeuTyr: 2.999 ± 0.025
0.0LeuXaa: 0.0 ± 0.0
Met
1.487MetAla: 1.487 ± 0.017
0.484MetCys: 0.484 ± 0.009
1.259MetAsp: 1.259 ± 0.013
1.585MetGlu: 1.585 ± 0.015
0.926MetPhe: 0.926 ± 0.012
1.207MetGly: 1.207 ± 0.015
0.54MetHis: 0.54 ± 0.009
1.114MetIle: 1.114 ± 0.014
1.509MetLys: 1.509 ± 0.016
2.031MetLeu: 2.031 ± 0.019
0.537MetMet: 0.537 ± 0.009
1.051MetAsn: 1.051 ± 0.012
0.987MetPro: 0.987 ± 0.012
0.963MetGln: 0.963 ± 0.014
1.099MetArg: 1.099 ± 0.014
1.693MetSer: 1.693 ± 0.017
1.134MetThr: 1.134 ± 0.011
1.379MetVal: 1.379 ± 0.015
0.265MetTrp: 0.265 ± 0.006
0.807MetTyr: 0.807 ± 0.011
0.0MetXaa: 0.0 ± 0.0
Asn
2.979AsnAla: 2.979 ± 0.025
1.205AsnCys: 1.205 ± 0.017
2.654AsnAsp: 2.654 ± 0.024
3.298AsnGlu: 3.298 ± 0.026
2.321AsnPhe: 2.321 ± 0.024
2.993AsnGly: 2.993 ± 0.031
1.211AsnHis: 1.211 ± 0.015
3.517AsnIle: 3.517 ± 0.024
3.339AsnLys: 3.339 ± 0.026
5.214AsnLeu: 5.214 ± 0.04
1.204AsnMet: 1.204 ± 0.014
3.076AsnAsn: 3.076 ± 0.031
2.371AsnPro: 2.371 ± 0.026
1.89AsnGln: 1.89 ± 0.017
2.505AsnArg: 2.505 ± 0.021
4.168AsnSer: 4.168 ± 0.029
2.799AsnThr: 2.799 ± 0.024
3.802AsnVal: 3.802 ± 0.025
0.556AsnTrp: 0.556 ± 0.009
2.008AsnTyr: 2.008 ± 0.018
0.0AsnXaa: 0.0 ± 0.0
Pro
2.595ProAla: 2.595 ± 0.024
0.817ProCys: 0.817 ± 0.049
2.453ProAsp: 2.453 ± 0.021
3.217ProGlu: 3.217 ± 0.052
1.877ProPhe: 1.877 ± 0.021
2.584ProGly: 2.584 ± 0.07
1.233ProHis: 1.233 ± 0.014
2.865ProIle: 2.865 ± 0.021
3.138ProLys: 3.138 ± 0.039
4.279ProLeu: 4.279 ± 0.028
0.89ProMet: 0.89 ± 0.013
2.558ProAsn: 2.558 ± 0.026
3.753ProPro: 3.753 ± 0.051
2.034ProGln: 2.034 ± 0.027
2.186ProArg: 2.186 ± 0.024
4.121ProSer: 4.121 ± 0.04
3.052ProThr: 3.052 ± 0.028
3.051ProVal: 3.051 ± 0.027
0.48ProTrp: 0.48 ± 0.009
1.6ProTyr: 1.6 ± 0.019
0.0ProXaa: 0.0 ± 0.0
Gln
2.203GlnAla: 2.203 ± 0.02
0.893GlnCys: 0.893 ± 0.019
1.807GlnAsp: 1.807 ± 0.018
2.622GlnGlu: 2.622 ± 0.029
1.658GlnPhe: 1.658 ± 0.014
1.759GlnGly: 1.759 ± 0.02
1.128GlnHis: 1.128 ± 0.013
2.468GlnIle: 2.468 ± 0.02
2.642GlnLys: 2.642 ± 0.023
4.073GlnLeu: 4.073 ± 0.032
0.964GlnMet: 0.964 ± 0.013
2.236GlnAsn: 2.236 ± 0.02
1.943GlnPro: 1.943 ± 0.039
2.332GlnGln: 2.332 ± 0.033
2.214GlnArg: 2.214 ± 0.021
2.883GlnSer: 2.883 ± 0.022
2.252GlnThr: 2.252 ± 0.023
2.359GlnVal: 2.359 ± 0.02
0.47GlnTrp: 0.47 ± 0.009
1.37GlnTyr: 1.37 ± 0.017
0.0GlnXaa: 0.0 ± 0.0
Arg
2.682ArgAla: 2.682 ± 0.022
1.037ArgCys: 1.037 ± 0.02
2.633ArgAsp: 2.633 ± 0.023
3.175ArgGlu: 3.175 ± 0.024
2.099ArgPhe: 2.099 ± 0.02
2.554ArgGly: 2.554 ± 0.03
1.384ArgHis: 1.384 ± 0.016
2.945ArgIle: 2.945 ± 0.023
3.993ArgLys: 3.993 ± 0.035
4.675ArgLeu: 4.675 ± 0.03
1.102ArgMet: 1.102 ± 0.014
2.86ArgAsn: 2.86 ± 0.025
2.217ArgPro: 2.217 ± 0.026
2.132ArgGln: 2.132 ± 0.019
3.515ArgArg: 3.515 ± 0.031
3.82ArgSer: 3.82 ± 0.04
2.691ArgThr: 2.691 ± 0.021
2.806ArgVal: 2.806 ± 0.022
0.578ArgTrp: 0.578 ± 0.009
1.742ArgTyr: 1.742 ± 0.017
0.0ArgXaa: 0.0 ± 0.0
Ser
4.393SerAla: 4.393 ± 0.027
1.538SerCys: 1.538 ± 0.033
4.481SerAsp: 4.481 ± 0.03
4.777SerGlu: 4.777 ± 0.029
3.063SerPhe: 3.063 ± 0.024
4.358SerGly: 4.358 ± 0.031
1.901SerHis: 1.901 ± 0.021
4.444SerIle: 4.444 ± 0.029
4.938SerLys: 4.938 ± 0.033
7.133SerLeu: 7.133 ± 0.042
1.556SerMet: 1.556 ± 0.016
4.135SerAsn: 4.135 ± 0.028
4.134SerPro: 4.134 ± 0.051
2.921SerGln: 2.921 ± 0.024
3.686SerArg: 3.686 ± 0.037
7.6SerSer: 7.6 ± 0.061
4.932SerThr: 4.932 ± 0.039
4.903SerVal: 4.903 ± 0.027
0.798SerTrp: 0.798 ± 0.011
2.45SerTyr: 2.45 ± 0.022
0.0SerXaa: 0.0 ± 0.0
Thr
3.495ThrAla: 3.495 ± 0.027
1.258ThrCys: 1.258 ± 0.029
3.027ThrAsp: 3.027 ± 0.022
3.58ThrGlu: 3.58 ± 0.042
2.464ThrPhe: 2.464 ± 0.022
3.061ThrGly: 3.061 ± 0.026
1.388ThrHis: 1.388 ± 0.017
3.659ThrIle: 3.659 ± 0.026
3.64ThrLys: 3.64 ± 0.033
5.389ThrLeu: 5.389 ± 0.032
1.142ThrMet: 1.142 ± 0.014
2.943ThrAsn: 2.943 ± 0.021
3.265ThrPro: 3.265 ± 0.036
2.092ThrGln: 2.092 ± 0.025
2.484ThrArg: 2.484 ± 0.025
4.816ThrSer: 4.816 ± 0.031
3.846ThrThr: 3.846 ± 0.053
3.985ThrVal: 3.985 ± 0.04
0.6ThrTrp: 0.6 ± 0.011
1.831ThrTyr: 1.831 ± 0.016
0.0ThrXaa: 0.0 ± 0.0
Val
3.991ValAla: 3.991 ± 0.031
1.458ValCys: 1.458 ± 0.021
3.552ValAsp: 3.552 ± 0.027
4.098ValGlu: 4.098 ± 0.044
2.606ValPhe: 2.606 ± 0.024
3.261ValGly: 3.261 ± 0.027
1.567ValHis: 1.567 ± 0.017
3.979ValIle: 3.979 ± 0.026
4.035ValLys: 4.035 ± 0.035
6.035ValLeu: 6.035 ± 0.041
1.373ValMet: 1.373 ± 0.016
3.273ValAsn: 3.273 ± 0.027
3.278ValPro: 3.278 ± 0.031
2.551ValGln: 2.551 ± 0.021
2.975ValArg: 2.975 ± 0.021
4.667ValSer: 4.667 ± 0.026
4.063ValThr: 4.063 ± 0.05
4.329ValVal: 4.329 ± 0.035
0.717ValTrp: 0.717 ± 0.011
2.074ValTyr: 2.074 ± 0.019
0.0ValXaa: 0.0 ± 0.0
Trp
0.54TrpAla: 0.54 ± 0.01
0.236TrpCys: 0.236 ± 0.005
0.586TrpAsp: 0.586 ± 0.011
0.611TrpGlu: 0.611 ± 0.008
0.498TrpPhe: 0.498 ± 0.007
0.57TrpGly: 0.57 ± 0.012
0.254TrpHis: 0.254 ± 0.007
0.718TrpIle: 0.718 ± 0.012
0.832TrpLys: 0.832 ± 0.013
1.116TrpLeu: 1.116 ± 0.014
0.295TrpMet: 0.295 ± 0.007
0.642TrpAsn: 0.642 ± 0.009
0.433TrpPro: 0.433 ± 0.008
0.424TrpGln: 0.424 ± 0.008
0.638TrpArg: 0.638 ± 0.01
0.81TrpSer: 0.81 ± 0.011
0.593TrpThr: 0.593 ± 0.011
0.618TrpVal: 0.618 ± 0.009
0.177TrpTrp: 0.177 ± 0.005
0.377TrpTyr: 0.377 ± 0.008
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.808TyrAla: 1.808 ± 0.02
0.793TyrCys: 0.793 ± 0.011
1.827TyrAsp: 1.827 ± 0.017
1.987TyrGlu: 1.987 ± 0.015
1.685TyrPhe: 1.685 ± 0.018
2.012TyrGly: 2.012 ± 0.021
0.885TyrHis: 0.885 ± 0.012
2.016TyrIle: 2.016 ± 0.019
2.103TyrLys: 2.103 ± 0.016
3.337TyrLeu: 3.337 ± 0.027
0.786TyrMet: 0.786 ± 0.011
1.89TyrAsn: 1.89 ± 0.02
1.497TyrPro: 1.497 ± 0.018
1.306TyrGln: 1.306 ± 0.014
1.737TyrArg: 1.737 ± 0.017
2.459TyrSer: 2.459 ± 0.02
1.888TyrThr: 1.888 ± 0.019
2.101TyrVal: 2.101 ± 0.018
0.407TyrTrp: 0.407 ± 0.009
1.353TyrTyr: 1.353 ± 0.017
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.009XaaXaa: 0.009 ± 0.006
Statistics based on 14952 proteins (7462682 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski