Amino acid dipepetide frequency for Apis cerana cerana (Oriental honeybee)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.308AlaAla: 4.308 ± 0.049
1.076AlaCys: 1.076 ± 0.014
2.478AlaAsp: 2.478 ± 0.025
3.53AlaGlu: 3.53 ± 0.038
2.034AlaPhe: 2.034 ± 0.021
3.019AlaGly: 3.019 ± 0.032
1.215AlaHis: 1.215 ± 0.019
3.668AlaIle: 3.668 ± 0.031
3.443AlaLys: 3.443 ± 0.03
5.393AlaLeu: 5.393 ± 0.037
1.39AlaMet: 1.39 ± 0.017
2.534AlaAsn: 2.534 ± 0.022
2.311AlaPro: 2.311 ± 0.029
2.192AlaGln: 2.192 ± 0.023
2.949AlaArg: 2.949 ± 0.03
4.239AlaSer: 4.239 ± 0.036
3.474AlaThr: 3.474 ± 0.03
3.416AlaVal: 3.416 ± 0.029
0.585AlaTrp: 0.585 ± 0.012
1.557AlaTyr: 1.557 ± 0.016
0.0AlaXaa: 0.0 ± 0.0
Cys
1.027CysAla: 1.027 ± 0.016
0.474CysCys: 0.474 ± 0.012
1.097CysAsp: 1.097 ± 0.018
1.171CysGlu: 1.171 ± 0.022
0.74CysPhe: 0.74 ± 0.012
1.247CysGly: 1.247 ± 0.02
0.5CysHis: 0.5 ± 0.012
1.325CysIle: 1.325 ± 0.02
1.258CysLys: 1.258 ± 0.022
1.827CysLeu: 1.827 ± 0.023
0.414CysMet: 0.414 ± 0.01
1.095CysAsn: 1.095 ± 0.019
0.999CysPro: 0.999 ± 0.023
0.772CysGln: 0.772 ± 0.016
1.022CysArg: 1.022 ± 0.017
1.574CysSer: 1.574 ± 0.025
1.138CysThr: 1.138 ± 0.016
1.1CysVal: 1.1 ± 0.018
0.226CysTrp: 0.226 ± 0.006
0.641CysTyr: 0.641 ± 0.011
0.0CysXaa: 0.0 ± 0.0
Asp
2.637AspAla: 2.637 ± 0.024
0.963AspCys: 0.963 ± 0.016
3.358AspAsp: 3.358 ± 0.038
4.132AspGlu: 4.132 ± 0.039
2.075AspPhe: 2.075 ± 0.02
2.808AspGly: 2.808 ± 0.03
1.078AspHis: 1.078 ± 0.017
4.143AspIle: 4.143 ± 0.033
3.47AspLys: 3.47 ± 0.032
4.727AspLeu: 4.727 ± 0.031
1.2AspMet: 1.2 ± 0.017
2.925AspAsn: 2.925 ± 0.029
2.287AspPro: 2.287 ± 0.023
1.752AspGln: 1.752 ± 0.02
2.536AspArg: 2.536 ± 0.033
4.154AspSer: 4.154 ± 0.037
2.937AspThr: 2.937 ± 0.026
3.228AspVal: 3.228 ± 0.027
0.604AspTrp: 0.604 ± 0.011
1.838AspTyr: 1.838 ± 0.02
0.0AspXaa: 0.0 ± 0.0
Glu
3.715GluAla: 3.715 ± 0.035
1.27GluCys: 1.27 ± 0.028
4.212GluAsp: 4.212 ± 0.034
7.091GluGlu: 7.091 ± 0.1
2.28GluPhe: 2.28 ± 0.022
2.926GluGly: 2.926 ± 0.031
1.487GluHis: 1.487 ± 0.017
4.972GluIle: 4.972 ± 0.045
6.091GluLys: 6.091 ± 0.062
6.05GluLeu: 6.05 ± 0.052
1.647GluMet: 1.647 ± 0.021
4.768GluAsn: 4.768 ± 0.045
2.36GluPro: 2.36 ± 0.029
2.928GluGln: 2.928 ± 0.032
3.779GluArg: 3.779 ± 0.047
4.786GluSer: 4.786 ± 0.042
3.943GluThr: 3.943 ± 0.032
3.511GluVal: 3.511 ± 0.028
0.709GluTrp: 0.709 ± 0.013
2.194GluTyr: 2.194 ± 0.022
0.0GluXaa: 0.0 ± 0.0
Phe
1.928PheAla: 1.928 ± 0.019
0.793PheCys: 0.793 ± 0.014
2.048PheAsp: 2.048 ± 0.019
2.273PheGlu: 2.273 ± 0.023
1.492PhePhe: 1.492 ± 0.02
2.162PheGly: 2.162 ± 0.027
0.973PheHis: 0.973 ± 0.014
2.396PheIle: 2.396 ± 0.028
2.294PheLys: 2.294 ± 0.026
3.653PheLeu: 3.653 ± 0.029
0.823PheMet: 0.823 ± 0.011
1.977PheAsn: 1.977 ± 0.02
1.594PhePro: 1.594 ± 0.018
1.469PheGln: 1.469 ± 0.015
1.765PheArg: 1.765 ± 0.021
2.876PheSer: 2.876 ± 0.023
2.076PheThr: 2.076 ± 0.021
2.233PheVal: 2.233 ± 0.023
0.424PheTrp: 0.424 ± 0.011
1.348PheTyr: 1.348 ± 0.017
0.0PheXaa: 0.0 ± 0.0
Gly
2.758GlyAla: 2.758 ± 0.036
0.974GlyCys: 0.974 ± 0.018
2.562GlyAsp: 2.562 ± 0.027
3.026GlyGlu: 3.026 ± 0.03
2.048GlyPhe: 2.048 ± 0.025
3.862GlyGly: 3.862 ± 0.058
1.277GlyHis: 1.277 ± 0.019
3.513GlyIle: 3.513 ± 0.03
3.417GlyLys: 3.417 ± 0.03
4.279GlyLeu: 4.279 ± 0.035
1.144GlyMet: 1.144 ± 0.016
2.759GlyAsn: 2.759 ± 0.026
2.188GlyPro: 2.188 ± 0.042
1.97GlyGln: 1.97 ± 0.029
2.751GlyArg: 2.751 ± 0.029
4.296GlySer: 4.296 ± 0.042
3.109GlyThr: 3.109 ± 0.03
2.845GlyVal: 2.845 ± 0.03
0.622GlyTrp: 0.622 ± 0.013
1.877GlyTyr: 1.877 ± 0.027
0.0GlyXaa: 0.0 ± 0.0
His
1.236HisAla: 1.236 ± 0.016
0.573HisCys: 0.573 ± 0.011
1.06HisAsp: 1.06 ± 0.014
1.405HisGlu: 1.405 ± 0.017
0.941HisPhe: 0.941 ± 0.014
1.298HisGly: 1.298 ± 0.016
1.025HisHis: 1.025 ± 0.031
1.582HisIle: 1.582 ± 0.018
1.426HisLys: 1.426 ± 0.014
2.308HisLeu: 2.308 ± 0.023
0.585HisMet: 0.585 ± 0.011
1.244HisAsn: 1.244 ± 0.018
1.27HisPro: 1.27 ± 0.019
1.125HisGln: 1.125 ± 0.018
1.4HisArg: 1.4 ± 0.019
1.887HisSer: 1.887 ± 0.023
1.326HisThr: 1.326 ± 0.017
1.401HisVal: 1.401 ± 0.015
0.295HisTrp: 0.295 ± 0.008
0.863HisTyr: 0.863 ± 0.014
0.0HisXaa: 0.0 ± 0.0
Ile
3.768IleAla: 3.768 ± 0.029
1.466IleCys: 1.466 ± 0.024
3.762IleAsp: 3.762 ± 0.032
4.828IleGlu: 4.828 ± 0.05
2.692IlePhe: 2.692 ± 0.031
3.255IleGly: 3.255 ± 0.032
1.613IleHis: 1.613 ± 0.016
4.679IleIle: 4.679 ± 0.047
4.772IleLys: 4.772 ± 0.042
6.456IleLeu: 6.456 ± 0.052
1.41IleMet: 1.41 ± 0.018
3.95IleAsn: 3.95 ± 0.045
3.28IlePro: 3.28 ± 0.027
2.844IleGln: 2.844 ± 0.027
3.051IleArg: 3.051 ± 0.024
5.439IleSer: 5.439 ± 0.039
3.923IleThr: 3.923 ± 0.033
3.857IleVal: 3.857 ± 0.032
0.677IleTrp: 0.677 ± 0.013
2.16IleTyr: 2.16 ± 0.025
0.0IleXaa: 0.0 ± 0.0
Lys
3.178LysAla: 3.178 ± 0.03
1.324LysCys: 1.324 ± 0.021
3.799LysAsp: 3.799 ± 0.036
5.803LysGlu: 5.803 ± 0.062
2.375LysPhe: 2.375 ± 0.025
2.669LysGly: 2.669 ± 0.028
1.633LysHis: 1.633 ± 0.021
5.157LysIle: 5.157 ± 0.051
6.395LysLys: 6.395 ± 0.077
6.33LysLeu: 6.33 ± 0.048
1.646LysMet: 1.646 ± 0.019
4.52LysAsn: 4.52 ± 0.048
2.768LysPro: 2.768 ± 0.026
3.028LysGln: 3.028 ± 0.028
3.883LysArg: 3.883 ± 0.034
5.004LysSer: 5.004 ± 0.04
3.779LysThr: 3.779 ± 0.03
3.475LysVal: 3.475 ± 0.026
0.714LysTrp: 0.714 ± 0.011
2.516LysTyr: 2.516 ± 0.027
0.0LysXaa: 0.0 ± 0.0
Leu
5.431LeuAla: 5.431 ± 0.038
1.816LeuCys: 1.816 ± 0.024
4.735LeuAsp: 4.735 ± 0.031
6.516LeuGlu: 6.516 ± 0.047
3.209LeuPhe: 3.209 ± 0.032
4.284LeuGly: 4.284 ± 0.031
2.401LeuHis: 2.401 ± 0.023
5.458LeuIle: 5.458 ± 0.046
6.663LeuLys: 6.663 ± 0.057
8.813LeuLeu: 8.813 ± 0.068
2.024LeuMet: 2.024 ± 0.021
4.924LeuAsn: 4.924 ± 0.043
4.536LeuPro: 4.536 ± 0.037
4.535LeuGln: 4.535 ± 0.037
5.056LeuArg: 5.056 ± 0.042
7.122LeuSer: 7.122 ± 0.046
5.072LeuThr: 5.072 ± 0.037
4.628LeuVal: 4.628 ± 0.037
0.943LeuTrp: 0.943 ± 0.015
2.911LeuTyr: 2.911 ± 0.028
0.0LeuXaa: 0.0 ± 0.0
Met
1.48MetAla: 1.48 ± 0.018
0.423MetCys: 0.423 ± 0.01
1.323MetAsp: 1.323 ± 0.015
1.78MetGlu: 1.78 ± 0.019
0.838MetPhe: 0.838 ± 0.013
1.089MetGly: 1.089 ± 0.016
0.52MetHis: 0.52 ± 0.009
1.412MetIle: 1.412 ± 0.018
1.711MetLys: 1.711 ± 0.021
2.009MetLeu: 2.009 ± 0.021
0.592MetMet: 0.592 ± 0.011
1.258MetAsn: 1.258 ± 0.016
1.014MetPro: 1.014 ± 0.014
1.05MetGln: 1.05 ± 0.017
1.092MetArg: 1.092 ± 0.014
1.721MetSer: 1.721 ± 0.021
1.263MetThr: 1.263 ± 0.016
1.178MetVal: 1.178 ± 0.014
0.25MetTrp: 0.25 ± 0.006
0.722MetTyr: 0.722 ± 0.012
0.0MetXaa: 0.0 ± 0.0
Asn
2.779AsnAla: 2.779 ± 0.024
1.011AsnCys: 1.011 ± 0.02
3.077AsnAsp: 3.077 ± 0.033
4.115AsnGlu: 4.115 ± 0.04
2.149AsnPhe: 2.149 ± 0.023
2.937AsnGly: 2.937 ± 0.03
1.241AsnHis: 1.241 ± 0.014
4.875AsnIle: 4.875 ± 0.054
4.025AsnLys: 4.025 ± 0.042
5.079AsnLeu: 5.079 ± 0.039
1.312AsnMet: 1.312 ± 0.019
4.331AsnAsn: 4.331 ± 0.05
2.282AsnPro: 2.282 ± 0.023
2.212AsnGln: 2.212 ± 0.028
2.463AsnArg: 2.463 ± 0.02
4.587AsnSer: 4.587 ± 0.04
3.297AsnThr: 3.297 ± 0.031
3.506AsnVal: 3.506 ± 0.029
0.57AsnTrp: 0.57 ± 0.011
1.923AsnTyr: 1.923 ± 0.024
0.0AsnXaa: 0.0 ± 0.0
Pro
2.538ProAla: 2.538 ± 0.033
0.796ProCys: 0.796 ± 0.021
2.301ProAsp: 2.301 ± 0.024
3.096ProGlu: 3.096 ± 0.034
1.573ProPhe: 1.573 ± 0.018
2.587ProGly: 2.587 ± 0.062
1.125ProHis: 1.125 ± 0.015
2.957ProIle: 2.957 ± 0.026
2.678ProLys: 2.678 ± 0.028
4.003ProLeu: 4.003 ± 0.033
0.946ProMet: 0.946 ± 0.014
2.313ProAsn: 2.313 ± 0.025
3.881ProPro: 3.881 ± 0.075
1.999ProGln: 1.999 ± 0.03
2.402ProArg: 2.402 ± 0.032
3.997ProSer: 3.997 ± 0.04
2.907ProThr: 2.907 ± 0.03
2.837ProVal: 2.837 ± 0.033
0.496ProTrp: 0.496 ± 0.011
1.495ProTyr: 1.495 ± 0.018
0.0ProXaa: 0.0 ± 0.0
Gln
2.247GlnAla: 2.247 ± 0.024
0.822GlnCys: 0.822 ± 0.017
1.978GlnAsp: 1.978 ± 0.018
3.141GlnGlu: 3.141 ± 0.031
1.44GlnPhe: 1.44 ± 0.018
1.815GlnGly: 1.815 ± 0.024
1.183GlnHis: 1.183 ± 0.019
2.751GlnIle: 2.751 ± 0.027
2.927GlnLys: 2.927 ± 0.033
3.991GlnLeu: 3.991 ± 0.038
0.999GlnMet: 0.999 ± 0.018
2.68GlnAsn: 2.68 ± 0.036
1.904GlnPro: 1.904 ± 0.029
3.625GlnGln: 3.625 ± 0.092
2.348GlnArg: 2.348 ± 0.028
3.113GlnSer: 3.113 ± 0.03
2.438GlnThr: 2.438 ± 0.026
2.172GlnVal: 2.172 ± 0.024
0.47GlnTrp: 0.47 ± 0.009
1.395GlnTyr: 1.395 ± 0.02
0.0GlnXaa: 0.0 ± 0.0
Arg
2.737ArgAla: 2.737 ± 0.025
1.021ArgCys: 1.021 ± 0.02
2.742ArgAsp: 2.742 ± 0.035
3.542ArgGlu: 3.542 ± 0.038
1.845ArgPhe: 1.845 ± 0.019
2.668ArgGly: 2.668 ± 0.032
1.421ArgHis: 1.421 ± 0.016
3.252ArgIle: 3.252 ± 0.028
3.948ArgLys: 3.948 ± 0.033
4.703ArgLeu: 4.703 ± 0.038
1.161ArgMet: 1.161 ± 0.014
2.919ArgAsn: 2.919 ± 0.026
2.23ArgPro: 2.23 ± 0.024
2.221ArgGln: 2.221 ± 0.023
3.797ArgArg: 3.797 ± 0.048
4.046ArgSer: 4.046 ± 0.046
2.671ArgThr: 2.671 ± 0.025
2.659ArgVal: 2.659 ± 0.026
0.56ArgTrp: 0.56 ± 0.012
1.758ArgTyr: 1.758 ± 0.019
0.0ArgXaa: 0.0 ± 0.0
Ser
3.988SerAla: 3.988 ± 0.039
1.477SerCys: 1.477 ± 0.02
4.117SerAsp: 4.117 ± 0.032
4.973SerGlu: 4.973 ± 0.041
2.736SerPhe: 2.736 ± 0.026
4.425SerGly: 4.425 ± 0.043
1.801SerHis: 1.801 ± 0.021
5.052SerIle: 5.052 ± 0.04
5.111SerLys: 5.111 ± 0.042
7.033SerLeu: 7.033 ± 0.045
1.797SerMet: 1.797 ± 0.018
4.8SerAsn: 4.8 ± 0.047
4.18SerPro: 4.18 ± 0.051
3.282SerGln: 3.282 ± 0.036
3.989SerArg: 3.989 ± 0.047
8.401SerSer: 8.401 ± 0.077
5.295SerThr: 5.295 ± 0.046
4.351SerVal: 4.351 ± 0.032
0.831SerTrp: 0.831 ± 0.011
2.339SerTyr: 2.339 ± 0.024
0.0SerXaa: 0.0 ± 0.0
Thr
3.332ThrAla: 3.332 ± 0.031
1.185ThrCys: 1.185 ± 0.024
2.894ThrAsp: 2.894 ± 0.027
3.858ThrGlu: 3.858 ± 0.037
2.193ThrPhe: 2.193 ± 0.023
3.148ThrGly: 3.148 ± 0.026
1.232ThrHis: 1.232 ± 0.013
4.103ThrIle: 4.103 ± 0.034
3.746ThrLys: 3.746 ± 0.031
5.29ThrLeu: 5.29 ± 0.036
1.343ThrMet: 1.343 ± 0.019
3.324ThrAsn: 3.324 ± 0.032
3.095ThrPro: 3.095 ± 0.034
2.151ThrGln: 2.151 ± 0.026
2.625ThrArg: 2.625 ± 0.024
5.204ThrSer: 5.204 ± 0.048
4.247ThrThr: 4.247 ± 0.057
3.554ThrVal: 3.554 ± 0.028
0.624ThrTrp: 0.624 ± 0.013
1.77ThrTyr: 1.77 ± 0.018
0.0ThrXaa: 0.0 ± 0.0
Val
3.498ValAla: 3.498 ± 0.029
1.19ValCys: 1.19 ± 0.019
2.949ValAsp: 2.949 ± 0.028
3.733ValGlu: 3.733 ± 0.035
2.073ValPhe: 2.073 ± 0.02
2.768ValGly: 2.768 ± 0.033
1.383ValHis: 1.383 ± 0.017
3.636ValIle: 3.636 ± 0.033
3.622ValLys: 3.622 ± 0.032
5.055ValLeu: 5.055 ± 0.034
1.255ValMet: 1.255 ± 0.016
2.861ValAsn: 2.861 ± 0.025
2.891ValPro: 2.891 ± 0.032
2.446ValGln: 2.446 ± 0.024
2.673ValArg: 2.673 ± 0.027
4.338ValSer: 4.338 ± 0.035
3.59ValThr: 3.59 ± 0.028
3.396ValVal: 3.396 ± 0.032
0.624ValTrp: 0.624 ± 0.012
1.739ValTyr: 1.739 ± 0.021
0.0ValXaa: 0.0 ± 0.0
Trp
0.537TrpAla: 0.537 ± 0.011
0.226TrpCys: 0.226 ± 0.007
0.572TrpAsp: 0.572 ± 0.011
0.634TrpGlu: 0.634 ± 0.01
0.451TrpPhe: 0.451 ± 0.011
0.526TrpGly: 0.526 ± 0.01
0.272TrpHis: 0.272 ± 0.007
0.778TrpIle: 0.778 ± 0.014
0.81TrpLys: 0.81 ± 0.015
1.071TrpLeu: 1.071 ± 0.018
0.279TrpMet: 0.279 ± 0.008
0.671TrpAsn: 0.671 ± 0.011
0.409TrpPro: 0.409 ± 0.01
0.462TrpGln: 0.462 ± 0.01
0.609TrpArg: 0.609 ± 0.01
0.8TrpSer: 0.8 ± 0.014
0.606TrpThr: 0.606 ± 0.011
0.512TrpVal: 0.512 ± 0.011
0.179TrpTrp: 0.179 ± 0.007
0.38TrpTyr: 0.38 ± 0.008
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.688TyrAla: 1.688 ± 0.019
0.767TyrCys: 0.767 ± 0.014
1.788TyrAsp: 1.788 ± 0.021
2.076TyrGlu: 2.076 ± 0.021
1.417TyrPhe: 1.417 ± 0.019
1.786TyrGly: 1.786 ± 0.022
0.87TyrHis: 0.87 ± 0.014
2.171TyrIle: 2.171 ± 0.029
2.187TyrLys: 2.187 ± 0.024
3.008TyrLeu: 3.008 ± 0.028
0.757TyrMet: 0.757 ± 0.012
1.916TyrAsn: 1.916 ± 0.022
1.447TyrPro: 1.447 ± 0.024
1.373TyrGln: 1.373 ± 0.019
1.711TyrArg: 1.711 ± 0.018
2.393TyrSer: 2.393 ± 0.025
1.83TyrThr: 1.83 ± 0.02
1.86TyrVal: 1.86 ± 0.022
0.379TyrTrp: 0.379 ± 0.01
1.309TyrTyr: 1.309 ± 0.018
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 9931 proteins (5621110 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski