Amino acid dipepetide frequency for Operophtera brumata (winter moth)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.92AlaAla: 6.92 ± 0.053
1.531AlaCys: 1.531 ± 0.043
3.532AlaAsp: 3.532 ± 0.026
4.234AlaGlu: 4.234 ± 0.034
2.311AlaPhe: 2.311 ± 0.022
4.305AlaGly: 4.305 ± 0.042
1.827AlaHis: 1.827 ± 0.022
3.398AlaIle: 3.398 ± 0.03
3.697AlaLys: 3.697 ± 0.036
6.891AlaLeu: 6.891 ± 0.051
1.596AlaMet: 1.596 ± 0.02
2.589AlaAsn: 2.589 ± 0.022
3.971AlaPro: 3.971 ± 0.04
2.568AlaGln: 2.568 ± 0.024
4.42AlaArg: 4.42 ± 0.044
5.202AlaSer: 5.202 ± 0.038
3.927AlaThr: 3.927 ± 0.033
4.725AlaVal: 4.725 ± 0.035
0.751AlaTrp: 0.751 ± 0.015
1.901AlaTyr: 1.901 ± 0.02
0.005AlaXaa: 0.005 ± 0.001
Cys
1.509CysAla: 1.509 ± 0.026
0.509CysCys: 0.509 ± 0.014
1.244CysAsp: 1.244 ± 0.023
1.237CysGlu: 1.237 ± 0.024
0.743CysPhe: 0.743 ± 0.019
1.49CysGly: 1.49 ± 0.047
0.541CysHis: 0.541 ± 0.015
1.099CysIle: 1.099 ± 0.031
1.189CysLys: 1.189 ± 0.024
1.829CysLeu: 1.829 ± 0.028
0.45CysMet: 0.45 ± 0.011
0.946CysAsn: 0.946 ± 0.022
1.127CysPro: 1.127 ± 0.045
0.718CysGln: 0.718 ± 0.022
1.266CysArg: 1.266 ± 0.047
1.702CysSer: 1.702 ± 0.043
1.217CysThr: 1.217 ± 0.034
1.434CysVal: 1.434 ± 0.034
0.271CysTrp: 0.271 ± 0.008
0.641CysTyr: 0.641 ± 0.015
0.001CysXaa: 0.001 ± 0.0
Asp
3.511AspAla: 3.511 ± 0.029
1.061AspCys: 1.061 ± 0.024
3.716AspAsp: 3.716 ± 0.041
4.092AspGlu: 4.092 ± 0.038
2.108AspPhe: 2.108 ± 0.024
3.176AspGly: 3.176 ± 0.028
1.209AspHis: 1.209 ± 0.016
3.433AspIle: 3.433 ± 0.029
3.503AspLys: 3.503 ± 0.033
4.872AspLeu: 4.872 ± 0.039
1.334AspMet: 1.334 ± 0.016
2.646AspAsn: 2.646 ± 0.029
2.534AspPro: 2.534 ± 0.04
1.599AspGln: 1.599 ± 0.016
2.641AspArg: 2.641 ± 0.027
4.301AspSer: 4.301 ± 0.034
3.097AspThr: 3.097 ± 0.026
3.743AspVal: 3.743 ± 0.028
0.629AspTrp: 0.629 ± 0.012
1.854AspTyr: 1.854 ± 0.021
0.001AspXaa: 0.001 ± 0.0
Glu
4.216GluAla: 4.216 ± 0.039
1.317GluCys: 1.317 ± 0.049
3.793GluAsp: 3.793 ± 0.038
5.297GluGlu: 5.297 ± 0.057
2.091GluPhe: 2.091 ± 0.02
3.082GluGly: 3.082 ± 0.027
1.581GluHis: 1.581 ± 0.024
3.697GluIle: 3.697 ± 0.034
4.585GluLys: 4.585 ± 0.052
5.91GluLeu: 5.91 ± 0.043
1.537GluMet: 1.537 ± 0.019
3.411GluAsn: 3.411 ± 0.03
2.968GluPro: 2.968 ± 0.031
2.499GluGln: 2.499 ± 0.024
3.954GluArg: 3.954 ± 0.038
4.408GluSer: 4.408 ± 0.027
3.779GluThr: 3.779 ± 0.04
3.859GluVal: 3.859 ± 0.035
0.691GluTrp: 0.691 ± 0.013
2.01GluTyr: 2.01 ± 0.019
0.003GluXaa: 0.003 ± 0.001
Phe
2.184PheAla: 2.184 ± 0.024
0.772PheCys: 0.772 ± 0.013
2.062PheAsp: 2.062 ± 0.023
2.126PheGlu: 2.126 ± 0.022
1.367PhePhe: 1.367 ± 0.019
2.207PheGly: 2.207 ± 0.021
0.922PheHis: 0.922 ± 0.014
2.104PheIle: 2.104 ± 0.022
2.211PheLys: 2.211 ± 0.022
3.277PheLeu: 3.277 ± 0.031
0.869PheMet: 0.869 ± 0.013
1.794PheAsn: 1.794 ± 0.021
1.541PhePro: 1.541 ± 0.019
1.339PheGln: 1.339 ± 0.016
1.862PheArg: 1.862 ± 0.024
2.814PheSer: 2.814 ± 0.024
2.16PheThr: 2.16 ± 0.021
2.35PheVal: 2.35 ± 0.027
0.417PheTrp: 0.417 ± 0.01
1.36PheTyr: 1.36 ± 0.019
0.0PheXaa: 0.0 ± 0.0
Gly
4.308GlyAla: 4.308 ± 0.042
1.074GlyCys: 1.074 ± 0.02
3.045GlyAsp: 3.045 ± 0.029
3.217GlyGlu: 3.217 ± 0.032
2.087GlyPhe: 2.087 ± 0.025
4.104GlyGly: 4.104 ± 0.051
1.478GlyHis: 1.478 ± 0.021
2.835GlyIle: 2.835 ± 0.029
3.288GlyLys: 3.288 ± 0.028
4.619GlyLeu: 4.619 ± 0.037
1.221GlyMet: 1.221 ± 0.019
2.357GlyAsn: 2.357 ± 0.027
2.474GlyPro: 2.474 ± 0.041
1.967GlyGln: 1.967 ± 0.027
3.279GlyArg: 3.279 ± 0.033
4.338GlySer: 4.338 ± 0.04
3.114GlyThr: 3.114 ± 0.03
3.785GlyVal: 3.785 ± 0.03
0.73GlyTrp: 0.73 ± 0.014
1.929GlyTyr: 1.929 ± 0.027
0.003GlyXaa: 0.003 ± 0.001
His
1.799HisAla: 1.799 ± 0.025
0.62HisCys: 0.62 ± 0.013
1.245HisAsp: 1.245 ± 0.015
1.418HisGlu: 1.418 ± 0.019
0.979HisPhe: 0.979 ± 0.013
1.466HisGly: 1.466 ± 0.021
0.972HisHis: 0.972 ± 0.022
1.449HisIle: 1.449 ± 0.017
1.439HisLys: 1.439 ± 0.018
2.476HisLeu: 2.476 ± 0.027
0.692HisMet: 0.692 ± 0.018
1.222HisAsn: 1.222 ± 0.019
1.353HisPro: 1.353 ± 0.022
1.01HisGln: 1.01 ± 0.018
1.586HisArg: 1.586 ± 0.021
2.032HisSer: 2.032 ± 0.023
1.541HisThr: 1.541 ± 0.02
1.658HisVal: 1.658 ± 0.021
0.335HisTrp: 0.335 ± 0.009
0.945HisTyr: 0.945 ± 0.015
0.0HisXaa: 0.0 ± 0.0
Ile
3.483IleAla: 3.483 ± 0.027
1.188IleCys: 1.188 ± 0.028
3.153IleAsp: 3.153 ± 0.029
3.484IleGlu: 3.484 ± 0.032
2.023IlePhe: 2.023 ± 0.026
2.678IleGly: 2.678 ± 0.027
1.34IleHis: 1.34 ± 0.016
3.123IleIle: 3.123 ± 0.034
3.637IleLys: 3.637 ± 0.031
4.787IleLeu: 4.787 ± 0.04
1.177IleMet: 1.177 ± 0.016
2.754IleAsn: 2.754 ± 0.029
2.722IlePro: 2.722 ± 0.027
2.095IleGln: 2.095 ± 0.027
2.66IleArg: 2.66 ± 0.021
4.093IleSer: 4.093 ± 0.031
3.373IleThr: 3.373 ± 0.028
3.491IleVal: 3.491 ± 0.032
0.522IleTrp: 0.522 ± 0.015
1.704IleTyr: 1.704 ± 0.021
0.001IleXaa: 0.001 ± 0.0
Lys
3.611LysAla: 3.611 ± 0.034
1.308LysCys: 1.308 ± 0.031
3.395LysAsp: 3.395 ± 0.037
4.588LysGlu: 4.588 ± 0.052
2.138LysPhe: 2.138 ± 0.024
2.711LysGly: 2.711 ± 0.029
1.653LysHis: 1.653 ± 0.023
3.728LysIle: 3.728 ± 0.034
5.192LysLys: 5.192 ± 0.06
5.755LysLeu: 5.755 ± 0.041
1.535LysMet: 1.535 ± 0.015
3.211LysAsn: 3.211 ± 0.032
3.26LysPro: 3.26 ± 0.041
2.495LysGln: 2.495 ± 0.026
3.672LysArg: 3.672 ± 0.034
4.548LysSer: 4.548 ± 0.038
3.786LysThr: 3.786 ± 0.035
3.699LysVal: 3.699 ± 0.033
0.648LysTrp: 0.648 ± 0.012
2.201LysTyr: 2.201 ± 0.021
0.001LysXaa: 0.001 ± 0.001
Leu
6.67LeuAla: 6.67 ± 0.048
1.951LeuCys: 1.951 ± 0.021
4.865LeuAsp: 4.865 ± 0.037
5.783LeuGlu: 5.783 ± 0.046
3.165LeuPhe: 3.165 ± 0.036
4.675LeuGly: 4.675 ± 0.039
2.579LeuHis: 2.579 ± 0.029
4.258LeuIle: 4.258 ± 0.036
6.047LeuLys: 6.047 ± 0.049
8.773LeuLeu: 8.773 ± 0.066
2.047LeuMet: 2.047 ± 0.023
4.271LeuAsn: 4.271 ± 0.036
4.904LeuPro: 4.904 ± 0.035
4.195LeuGln: 4.195 ± 0.034
5.541LeuArg: 5.541 ± 0.045
7.005LeuSer: 7.005 ± 0.044
5.096LeuThr: 5.096 ± 0.04
5.496LeuVal: 5.496 ± 0.042
0.978LeuTrp: 0.978 ± 0.016
2.805LeuTyr: 2.805 ± 0.03
0.003LeuXaa: 0.003 ± 0.001
Met
1.703MetAla: 1.703 ± 0.018
0.487MetCys: 0.487 ± 0.012
1.283MetAsp: 1.283 ± 0.017
1.55MetGlu: 1.55 ± 0.015
0.944MetPhe: 0.944 ± 0.015
1.168MetGly: 1.168 ± 0.018
0.56MetHis: 0.56 ± 0.012
1.087MetIle: 1.087 ± 0.015
1.535MetLys: 1.535 ± 0.019
2.084MetLeu: 2.084 ± 0.024
0.648MetMet: 0.648 ± 0.014
1.078MetAsn: 1.078 ± 0.016
1.186MetPro: 1.186 ± 0.016
0.934MetGln: 0.934 ± 0.016
1.272MetArg: 1.272 ± 0.018
1.855MetSer: 1.855 ± 0.02
1.327MetThr: 1.327 ± 0.018
1.33MetVal: 1.33 ± 0.017
0.283MetTrp: 0.283 ± 0.008
0.74MetTyr: 0.74 ± 0.014
0.0MetXaa: 0.0 ± 0.0
Asn
2.776AsnAla: 2.776 ± 0.034
0.932AsnCys: 0.932 ± 0.018
2.479AsnAsp: 2.479 ± 0.025
3.084AsnGlu: 3.084 ± 0.033
1.783AsnPhe: 1.783 ± 0.019
2.662AsnGly: 2.662 ± 0.034
1.1AsnHis: 1.1 ± 0.021
3.224AsnIle: 3.224 ± 0.024
3.318AsnLys: 3.318 ± 0.034
4.183AsnLeu: 4.183 ± 0.038
1.208AsnMet: 1.208 ± 0.019
2.773AsnAsn: 2.773 ± 0.03
2.162AsnPro: 2.162 ± 0.037
1.677AsnGln: 1.677 ± 0.023
2.19AsnArg: 2.19 ± 0.024
3.522AsnSer: 3.522 ± 0.032
2.805AsnThr: 2.805 ± 0.03
3.127AsnVal: 3.127 ± 0.028
0.484AsnTrp: 0.484 ± 0.011
1.709AsnTyr: 1.709 ± 0.022
0.0AsnXaa: 0.0 ± 0.0
Pro
4.111ProAla: 4.111 ± 0.037
0.908ProCys: 0.908 ± 0.061
2.813ProAsp: 2.813 ± 0.024
3.574ProGlu: 3.574 ± 0.035
1.667ProPhe: 1.667 ± 0.024
2.977ProGly: 2.977 ± 0.046
1.387ProHis: 1.387 ± 0.02
2.46ProIle: 2.46 ± 0.026
3.035ProLys: 3.035 ± 0.03
4.332ProLeu: 4.332 ± 0.039
1.042ProMet: 1.042 ± 0.018
2.187ProAsn: 2.187 ± 0.031
4.314ProPro: 4.314 ± 0.055
2.155ProGln: 2.155 ± 0.031
3.074ProArg: 3.074 ± 0.037
4.273ProSer: 4.273 ± 0.048
3.208ProThr: 3.208 ± 0.042
3.526ProVal: 3.526 ± 0.036
0.512ProTrp: 0.512 ± 0.012
1.615ProTyr: 1.615 ± 0.023
0.003ProXaa: 0.003 ± 0.001
Gln
2.618GlnAla: 2.618 ± 0.028
0.841GlnCys: 0.841 ± 0.021
1.802GlnAsp: 1.802 ± 0.02
2.409GlnGlu: 2.409 ± 0.028
1.368GlnPhe: 1.368 ± 0.015
1.836GlnGly: 1.836 ± 0.025
1.192GlnHis: 1.192 ± 0.017
2.021GlnIle: 2.021 ± 0.02
2.329GlnLys: 2.329 ± 0.029
3.825GlnLeu: 3.825 ± 0.032
0.979GlnMet: 0.979 ± 0.016
2.0GlnAsn: 2.0 ± 0.024
2.106GlnPro: 2.106 ± 0.029
2.062GlnGln: 2.062 ± 0.04
2.452GlnArg: 2.452 ± 0.027
2.704GlnSer: 2.704 ± 0.025
2.135GlnThr: 2.135 ± 0.026
2.253GlnVal: 2.253 ± 0.027
0.443GlnTrp: 0.443 ± 0.009
1.342GlnTyr: 1.342 ± 0.018
0.001GlnXaa: 0.001 ± 0.0
Arg
4.486ArgAla: 4.486 ± 0.042
1.248ArgCys: 1.248 ± 0.03
3.235ArgAsp: 3.235 ± 0.033
3.434ArgGlu: 3.434 ± 0.031
1.855ArgPhe: 1.855 ± 0.023
3.291ArgGly: 3.291 ± 0.033
1.764ArgHis: 1.764 ± 0.021
2.66ArgIle: 2.66 ± 0.023
3.591ArgLys: 3.591 ± 0.036
5.245ArgLeu: 5.245 ± 0.039
1.256ArgMet: 1.256 ± 0.013
2.583ArgAsn: 2.583 ± 0.021
3.037ArgPro: 3.037 ± 0.038
2.28ArgGln: 2.28 ± 0.026
4.716ArgArg: 4.716 ± 0.052
4.223ArgSer: 4.223 ± 0.036
3.04ArgThr: 3.04 ± 0.028
3.484ArgVal: 3.484 ± 0.03
0.685ArgTrp: 0.685 ± 0.015
1.872ArgTyr: 1.872 ± 0.021
0.004ArgXaa: 0.004 ± 0.001
Ser
5.201SerAla: 5.201 ± 0.036
1.552SerCys: 1.552 ± 0.047
4.398SerAsp: 4.398 ± 0.034
4.778SerGlu: 4.778 ± 0.035
2.739SerPhe: 2.739 ± 0.026
4.554SerGly: 4.554 ± 0.039
1.909SerHis: 1.909 ± 0.022
3.925SerIle: 3.925 ± 0.034
4.574SerLys: 4.574 ± 0.037
6.822SerLeu: 6.822 ± 0.048
1.685SerMet: 1.685 ± 0.019
3.593SerAsn: 3.593 ± 0.034
4.505SerPro: 4.505 ± 0.061
2.856SerGln: 2.856 ± 0.026
4.178SerArg: 4.178 ± 0.035
7.175SerSer: 7.175 ± 0.06
4.739SerThr: 4.739 ± 0.043
4.89SerVal: 4.89 ± 0.034
0.856SerTrp: 0.856 ± 0.013
2.334SerTyr: 2.334 ± 0.027
0.003SerXaa: 0.003 ± 0.001
Thr
4.088ThrAla: 4.088 ± 0.031
1.313ThrCys: 1.313 ± 0.033
3.163ThrAsp: 3.163 ± 0.031
3.771ThrGlu: 3.771 ± 0.04
2.164ThrPhe: 2.164 ± 0.024
3.192ThrGly: 3.192 ± 0.032
1.468ThrHis: 1.468 ± 0.021
3.177ThrIle: 3.177 ± 0.026
3.385ThrLys: 3.385 ± 0.034
5.349ThrLeu: 5.349 ± 0.042
1.287ThrMet: 1.287 ± 0.017
2.647ThrAsn: 2.647 ± 0.026
3.628ThrPro: 3.628 ± 0.029
2.172ThrGln: 2.172 ± 0.025
2.985ThrArg: 2.985 ± 0.03
4.796ThrSer: 4.796 ± 0.04
3.962ThrThr: 3.962 ± 0.069
3.949ThrVal: 3.949 ± 0.039
0.638ThrTrp: 0.638 ± 0.012
1.798ThrTyr: 1.798 ± 0.027
0.002ThrXaa: 0.002 ± 0.001
Val
4.544ValAla: 4.544 ± 0.038
1.521ValCys: 1.521 ± 0.033
3.42ValAsp: 3.42 ± 0.026
3.984ValGlu: 3.984 ± 0.039
2.422ValPhe: 2.422 ± 0.026
3.162ValGly: 3.162 ± 0.029
1.599ValHis: 1.599 ± 0.022
3.429ValIle: 3.429 ± 0.029
3.857ValLys: 3.857 ± 0.033
5.94ValLeu: 5.94 ± 0.043
1.421ValMet: 1.421 ± 0.019
2.883ValAsn: 2.883 ± 0.031
3.484ValPro: 3.484 ± 0.035
2.437ValGln: 2.437 ± 0.028
3.575ValArg: 3.575 ± 0.029
4.992ValSer: 4.992 ± 0.036
4.063ValThr: 4.063 ± 0.036
4.326ValVal: 4.326 ± 0.035
0.732ValTrp: 0.732 ± 0.017
2.009ValTyr: 2.009 ± 0.022
0.001ValXaa: 0.001 ± 0.0
Trp
0.682TrpAla: 0.682 ± 0.014
0.239TrpCys: 0.239 ± 0.007
0.631TrpAsp: 0.631 ± 0.015
0.639TrpGlu: 0.639 ± 0.011
0.425TrpPhe: 0.425 ± 0.01
0.604TrpGly: 0.604 ± 0.012
0.285TrpHis: 0.285 ± 0.008
0.579TrpIle: 0.579 ± 0.011
0.698TrpLys: 0.698 ± 0.012
1.153TrpLeu: 1.153 ± 0.018
0.296TrpMet: 0.296 ± 0.009
0.53TrpAsn: 0.53 ± 0.013
0.459TrpPro: 0.459 ± 0.011
0.45TrpGln: 0.45 ± 0.01
0.842TrpArg: 0.842 ± 0.015
0.846TrpSer: 0.846 ± 0.015
0.658TrpThr: 0.658 ± 0.013
0.648TrpVal: 0.648 ± 0.014
0.194TrpTrp: 0.194 ± 0.006
0.358TrpTyr: 0.358 ± 0.01
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.877TyrAla: 1.877 ± 0.019
0.79TyrCys: 0.79 ± 0.014
1.784TyrAsp: 1.784 ± 0.022
1.986TyrGlu: 1.986 ± 0.023
1.361TyrPhe: 1.361 ± 0.021
1.909TyrGly: 1.909 ± 0.022
0.867TyrHis: 0.867 ± 0.015
1.801TyrIle: 1.801 ± 0.021
2.02TyrLys: 2.02 ± 0.025
2.981TyrLeu: 2.981 ± 0.03
0.774TyrMet: 0.774 ± 0.013
1.724TyrAsn: 1.724 ± 0.022
1.433TyrPro: 1.433 ± 0.024
1.209TyrGln: 1.209 ± 0.017
1.822TyrArg: 1.822 ± 0.021
2.437TyrSer: 2.437 ± 0.026
1.908TyrThr: 1.908 ± 0.023
2.029TyrVal: 2.029 ± 0.024
0.418TyrTrp: 0.418 ± 0.011
1.303TyrTyr: 1.303 ± 0.021
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.004XaaAla: 0.004 ± 0.001
0.002XaaCys: 0.002 ± 0.001
0.001XaaAsp: 0.001 ± 0.0
0.001XaaGlu: 0.001 ± 0.0
0.001XaaPhe: 0.001 ± 0.0
0.001XaaGly: 0.001 ± 0.001
0.001XaaHis: 0.001 ± 0.0
0.002XaaIle: 0.002 ± 0.001
0.001XaaLys: 0.001 ± 0.0
0.004XaaLeu: 0.004 ± 0.001
0.001XaaMet: 0.001 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.004XaaPro: 0.004 ± 0.001
0.001XaaGln: 0.001 ± 0.0
0.002XaaArg: 0.002 ± 0.001
0.002XaaSer: 0.002 ± 0.001
0.002XaaThr: 0.002 ± 0.001
0.002XaaVal: 0.002 ± 0.001
0.0XaaTrp: 0.0 ± 0.0
0.001XaaTyr: 0.001 ± 0.0
0.017XaaXaa: 0.017 ± 0.006
Statistics based on 16814 proteins (6022392 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski