Amino acid dipepetide frequency for Penicillium nalgiovense

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.355AlaAla: 8.355 ± 0.064
1.078AlaCys: 1.078 ± 0.017
4.224AlaAsp: 4.224 ± 0.032
5.09AlaGlu: 5.09 ± 0.048
3.126AlaPhe: 3.126 ± 0.03
5.595AlaGly: 5.595 ± 0.036
1.805AlaHis: 1.805 ± 0.018
4.269AlaIle: 4.269 ± 0.034
3.877AlaLys: 3.877 ± 0.028
7.613AlaLeu: 7.613 ± 0.045
1.987AlaMet: 1.987 ± 0.019
2.968AlaAsn: 2.968 ± 0.025
4.792AlaPro: 4.792 ± 0.049
3.39AlaGln: 3.39 ± 0.028
4.842AlaArg: 4.842 ± 0.034
7.097AlaSer: 7.097 ± 0.046
5.147AlaThr: 5.147 ± 0.037
5.248AlaVal: 5.248 ± 0.036
1.122AlaTrp: 1.122 ± 0.017
2.098AlaTyr: 2.098 ± 0.02
0.0AlaXaa: 0.0 ± 0.0
Cys
0.979CysAla: 0.979 ± 0.014
0.241CysCys: 0.241 ± 0.009
0.694CysAsp: 0.694 ± 0.013
0.623CysGlu: 0.623 ± 0.011
0.562CysPhe: 0.562 ± 0.01
0.945CysGly: 0.945 ± 0.017
0.36CysHis: 0.36 ± 0.009
0.72CysIle: 0.72 ± 0.014
0.497CysLys: 0.497 ± 0.01
1.297CysLeu: 1.297 ± 0.017
0.276CysMet: 0.276 ± 0.008
0.438CysAsn: 0.438 ± 0.01
0.692CysPro: 0.692 ± 0.015
0.488CysGln: 0.488 ± 0.01
0.767CysArg: 0.767 ± 0.012
0.928CysSer: 0.928 ± 0.016
0.707CysThr: 0.707 ± 0.013
0.842CysVal: 0.842 ± 0.016
0.2CysTrp: 0.2 ± 0.006
0.35CysTyr: 0.35 ± 0.008
0.0CysXaa: 0.0 ± 0.0
Asp
4.477AspAla: 4.477 ± 0.033
0.644AspCys: 0.644 ± 0.014
3.939AspAsp: 3.939 ± 0.042
4.319AspGlu: 4.319 ± 0.037
2.161AspPhe: 2.161 ± 0.022
3.849AspGly: 3.849 ± 0.029
1.37AspHis: 1.37 ± 0.022
3.185AspIle: 3.185 ± 0.026
2.289AspLys: 2.289 ± 0.02
5.189AspLeu: 5.189 ± 0.037
1.345AspMet: 1.345 ± 0.017
1.98AspAsn: 1.98 ± 0.019
3.411AspPro: 3.411 ± 0.022
2.056AspGln: 2.056 ± 0.021
3.16AspArg: 3.16 ± 0.03
4.334AspSer: 4.334 ± 0.039
3.115AspThr: 3.115 ± 0.025
3.596AspVal: 3.596 ± 0.024
0.89AspTrp: 0.89 ± 0.013
1.619AspTyr: 1.619 ± 0.018
0.0AspXaa: 0.0 ± 0.0
Glu
5.113GluAla: 5.113 ± 0.04
0.653GluCys: 0.653 ± 0.013
4.147GluAsp: 4.147 ± 0.036
5.195GluGlu: 5.195 ± 0.052
2.007GluPhe: 2.007 ± 0.019
3.627GluGly: 3.627 ± 0.028
1.438GluHis: 1.438 ± 0.018
3.206GluIle: 3.206 ± 0.027
3.502GluLys: 3.502 ± 0.031
5.114GluLeu: 5.114 ± 0.037
1.489GluMet: 1.489 ± 0.016
2.432GluAsn: 2.432 ± 0.023
2.961GluPro: 2.961 ± 0.045
2.522GluGln: 2.522 ± 0.025
3.79GluArg: 3.79 ± 0.034
4.536GluSer: 4.536 ± 0.035
3.671GluThr: 3.671 ± 0.03
3.454GluVal: 3.454 ± 0.03
0.898GluTrp: 0.898 ± 0.014
1.698GluTyr: 1.698 ± 0.019
0.0GluXaa: 0.0 ± 0.0
Phe
3.034PheAla: 3.034 ± 0.029
0.578PheCys: 0.578 ± 0.012
2.309PheAsp: 2.309 ± 0.023
2.17PheGlu: 2.17 ± 0.023
1.686PhePhe: 1.686 ± 0.021
2.84PheGly: 2.84 ± 0.028
0.952PheHis: 0.952 ± 0.013
1.889PheIle: 1.889 ± 0.022
1.451PheLys: 1.451 ± 0.019
3.495PheLeu: 3.495 ± 0.028
0.848PheMet: 0.848 ± 0.015
1.493PheAsn: 1.493 ± 0.017
2.011PhePro: 2.011 ± 0.019
1.455PheGln: 1.455 ± 0.016
1.998PheArg: 1.998 ± 0.02
3.016PheSer: 3.016 ± 0.026
2.162PheThr: 2.162 ± 0.023
2.402PheVal: 2.402 ± 0.025
0.628PheTrp: 0.628 ± 0.011
1.145PheTyr: 1.145 ± 0.016
0.0PheXaa: 0.0 ± 0.0
Gly
5.148GlyAla: 5.148 ± 0.041
0.895GlyCys: 0.895 ± 0.016
3.565GlyAsp: 3.565 ± 0.032
3.511GlyGlu: 3.511 ± 0.03
2.822GlyPhe: 2.822 ± 0.028
5.227GlyGly: 5.227 ± 0.051
1.731GlyHis: 1.731 ± 0.02
3.594GlyIle: 3.594 ± 0.034
3.226GlyLys: 3.226 ± 0.025
6.081GlyLeu: 6.081 ± 0.038
1.636GlyMet: 1.636 ± 0.018
2.545GlyAsn: 2.545 ± 0.027
3.382GlyPro: 3.382 ± 0.031
2.586GlyGln: 2.586 ± 0.022
3.933GlyArg: 3.933 ± 0.031
5.747GlySer: 5.747 ± 0.038
3.94GlyThr: 3.94 ± 0.034
4.279GlyVal: 4.279 ± 0.031
1.128GlyTrp: 1.128 ± 0.017
2.124GlyTyr: 2.124 ± 0.024
0.0GlyXaa: 0.0 ± 0.0
His
1.868HisAla: 1.868 ± 0.014
0.352HisCys: 0.352 ± 0.009
1.339HisAsp: 1.339 ± 0.015
1.391HisGlu: 1.391 ± 0.018
0.961HisPhe: 0.961 ± 0.014
1.769HisGly: 1.769 ± 0.02
0.839HisHis: 0.839 ± 0.016
1.268HisIle: 1.268 ± 0.017
0.907HisLys: 0.907 ± 0.013
2.304HisLeu: 2.304 ± 0.022
0.532HisMet: 0.532 ± 0.01
0.908HisAsn: 0.908 ± 0.013
1.7HisPro: 1.7 ± 0.02
1.037HisGln: 1.037 ± 0.014
1.61HisArg: 1.61 ± 0.021
1.942HisSer: 1.942 ± 0.023
1.346HisThr: 1.346 ± 0.018
1.472HisVal: 1.472 ± 0.016
0.366HisTrp: 0.366 ± 0.009
0.716HisTyr: 0.716 ± 0.012
0.0HisXaa: 0.0 ± 0.0
Ile
4.234IleAla: 4.234 ± 0.033
0.804IleCys: 0.804 ± 0.013
2.968IleAsp: 2.968 ± 0.028
2.916IleGlu: 2.916 ± 0.025
2.093IlePhe: 2.093 ± 0.023
3.315IleGly: 3.315 ± 0.031
1.287IleHis: 1.287 ± 0.016
2.567IleIle: 2.567 ± 0.027
2.114IleLys: 2.114 ± 0.021
4.664IleLeu: 4.664 ± 0.038
1.069IleMet: 1.069 ± 0.013
1.838IleAsn: 1.838 ± 0.02
3.242IlePro: 3.242 ± 0.03
2.018IleGln: 2.018 ± 0.02
2.837IleArg: 2.837 ± 0.026
4.042IleSer: 4.042 ± 0.028
2.833IleThr: 2.833 ± 0.024
3.169IleVal: 3.169 ± 0.03
0.753IleTrp: 0.753 ± 0.012
1.446IleTyr: 1.446 ± 0.017
0.0IleXaa: 0.0 ± 0.0
Lys
3.948LysAla: 3.948 ± 0.037
0.526LysCys: 0.526 ± 0.011
2.709LysAsp: 2.709 ± 0.024
3.182LysGlu: 3.182 ± 0.034
1.456LysPhe: 1.456 ± 0.016
2.806LysGly: 2.806 ± 0.027
1.086LysHis: 1.086 ± 0.014
2.233LysIle: 2.233 ± 0.02
3.034LysLys: 3.034 ± 0.038
3.896LysLeu: 3.896 ± 0.033
1.003LysMet: 1.003 ± 0.013
1.751LysAsn: 1.751 ± 0.019
2.687LysPro: 2.687 ± 0.028
1.873LysGln: 1.873 ± 0.022
3.25LysArg: 3.25 ± 0.031
3.507LysSer: 3.507 ± 0.031
2.77LysThr: 2.77 ± 0.022
2.697LysVal: 2.697 ± 0.022
0.686LysTrp: 0.686 ± 0.013
1.338LysTyr: 1.338 ± 0.017
0.0LysXaa: 0.0 ± 0.0
Leu
7.731LeuAla: 7.731 ± 0.043
1.246LeuCys: 1.246 ± 0.016
5.207LeuAsp: 5.207 ± 0.035
5.466LeuGlu: 5.466 ± 0.04
3.365LeuPhe: 3.365 ± 0.035
5.942LeuGly: 5.942 ± 0.036
2.3LeuHis: 2.3 ± 0.024
4.009LeuIle: 4.009 ± 0.033
4.039LeuLys: 4.039 ± 0.028
8.342LeuLeu: 8.342 ± 0.063
1.868LeuMet: 1.868 ± 0.02
3.261LeuAsn: 3.261 ± 0.026
5.462LeuPro: 5.462 ± 0.038
3.935LeuGln: 3.935 ± 0.032
5.753LeuArg: 5.753 ± 0.04
7.48LeuSer: 7.48 ± 0.043
4.816LeuThr: 4.816 ± 0.034
5.505LeuVal: 5.505 ± 0.041
1.22LeuTrp: 1.22 ± 0.016
2.344LeuTyr: 2.344 ± 0.022
0.0LeuXaa: 0.0 ± 0.0
Met
2.243MetAla: 2.243 ± 0.021
0.262MetCys: 0.262 ± 0.007
1.349MetAsp: 1.349 ± 0.017
1.362MetGlu: 1.362 ± 0.015
0.802MetPhe: 0.802 ± 0.013
1.546MetGly: 1.546 ± 0.019
0.518MetHis: 0.518 ± 0.008
1.089MetIle: 1.089 ± 0.015
1.011MetLys: 1.011 ± 0.014
1.879MetLeu: 1.879 ± 0.022
0.602MetMet: 0.602 ± 0.011
0.863MetAsn: 0.863 ± 0.015
1.278MetPro: 1.278 ± 0.016
0.935MetGln: 0.935 ± 0.015
1.278MetArg: 1.278 ± 0.016
1.984MetSer: 1.984 ± 0.02
1.389MetThr: 1.389 ± 0.018
1.393MetVal: 1.393 ± 0.017
0.27MetTrp: 0.27 ± 0.007
0.521MetTyr: 0.521 ± 0.009
0.0MetXaa: 0.0 ± 0.0
Asn
3.129AsnAla: 3.129 ± 0.022
0.473AsnCys: 0.473 ± 0.012
2.012AsnAsp: 2.012 ± 0.021
2.135AsnGlu: 2.135 ± 0.022
1.395AsnPhe: 1.395 ± 0.015
2.89AsnGly: 2.89 ± 0.028
0.883AsnHis: 0.883 ± 0.012
2.129AsnIle: 2.129 ± 0.02
1.559AsnLys: 1.559 ± 0.019
3.313AsnLeu: 3.313 ± 0.028
0.911AsnMet: 0.911 ± 0.015
1.455AsnAsn: 1.455 ± 0.02
2.624AsnPro: 2.624 ± 0.026
1.436AsnGln: 1.436 ± 0.015
2.014AsnArg: 2.014 ± 0.02
2.806AsnSer: 2.806 ± 0.022
2.262AsnThr: 2.262 ± 0.021
2.36AsnVal: 2.36 ± 0.023
0.584AsnTrp: 0.584 ± 0.01
1.065AsnTyr: 1.065 ± 0.015
0.0AsnXaa: 0.0 ± 0.0
Pro
5.231ProAla: 5.231 ± 0.045
0.557ProCys: 0.557 ± 0.012
3.308ProAsp: 3.308 ± 0.027
3.983ProGlu: 3.983 ± 0.039
2.104ProPhe: 2.104 ± 0.021
3.935ProGly: 3.935 ± 0.034
1.388ProHis: 1.388 ± 0.018
2.661ProIle: 2.661 ± 0.026
2.643ProLys: 2.643 ± 0.028
4.746ProLeu: 4.746 ± 0.033
1.185ProMet: 1.185 ± 0.017
2.268ProAsn: 2.268 ± 0.024
4.729ProPro: 4.729 ± 0.072
2.498ProGln: 2.498 ± 0.03
3.498ProArg: 3.498 ± 0.032
6.246ProSer: 6.246 ± 0.056
4.158ProThr: 4.158 ± 0.032
3.722ProVal: 3.722 ± 0.031
0.776ProTrp: 0.776 ± 0.013
1.508ProTyr: 1.508 ± 0.018
0.0ProXaa: 0.0 ± 0.0
Gln
3.403GlnAla: 3.403 ± 0.028
0.477GlnCys: 0.477 ± 0.009
2.104GlnAsp: 2.104 ± 0.022
2.48GlnGlu: 2.48 ± 0.026
1.374GlnPhe: 1.374 ± 0.015
2.466GlnGly: 2.466 ± 0.022
1.07GlnHis: 1.07 ± 0.015
2.0GlnIle: 2.0 ± 0.018
2.005GlnLys: 2.005 ± 0.022
3.515GlnLeu: 3.515 ± 0.026
0.966GlnMet: 0.966 ± 0.013
1.655GlnAsn: 1.655 ± 0.017
2.656GlnPro: 2.656 ± 0.032
2.204GlnGln: 2.204 ± 0.035
2.665GlnArg: 2.665 ± 0.023
3.364GlnSer: 3.364 ± 0.031
2.47GlnThr: 2.47 ± 0.022
2.223GlnVal: 2.223 ± 0.022
0.599GlnTrp: 0.599 ± 0.01
1.217GlnTyr: 1.217 ± 0.014
0.0GlnXaa: 0.0 ± 0.0
Arg
4.628ArgAla: 4.628 ± 0.032
0.723ArgCys: 0.723 ± 0.012
3.343ArgAsp: 3.343 ± 0.03
3.747ArgGlu: 3.747 ± 0.032
2.179ArgPhe: 2.179 ± 0.021
3.701ArgGly: 3.701 ± 0.035
1.582ArgHis: 1.582 ± 0.02
2.903ArgIle: 2.903 ± 0.024
3.319ArgLys: 3.319 ± 0.031
5.551ArgLeu: 5.551 ± 0.043
1.348ArgMet: 1.348 ± 0.016
2.273ArgAsn: 2.273 ± 0.018
3.552ArgPro: 3.552 ± 0.034
2.614ArgGln: 2.614 ± 0.025
4.952ArgArg: 4.952 ± 0.044
4.863ArgSer: 4.863 ± 0.042
3.276ArgThr: 3.276 ± 0.03
3.453ArgVal: 3.453 ± 0.027
0.946ArgTrp: 0.946 ± 0.015
1.627ArgTyr: 1.627 ± 0.017
0.0ArgXaa: 0.0 ± 0.0
Ser
6.689SerAla: 6.689 ± 0.042
0.908SerCys: 0.908 ± 0.015
4.475SerAsp: 4.475 ± 0.035
4.396SerGlu: 4.396 ± 0.034
3.07SerPhe: 3.07 ± 0.027
5.527SerGly: 5.527 ± 0.038
2.075SerHis: 2.075 ± 0.021
4.091SerIle: 4.091 ± 0.03
3.74SerLys: 3.74 ± 0.031
7.415SerLeu: 7.415 ± 0.041
1.814SerMet: 1.814 ± 0.021
3.111SerAsn: 3.111 ± 0.027
5.725SerPro: 5.725 ± 0.058
3.474SerGln: 3.474 ± 0.029
5.031SerArg: 5.031 ± 0.039
8.708SerSer: 8.708 ± 0.081
5.691SerThr: 5.691 ± 0.041
4.824SerVal: 4.824 ± 0.034
1.181SerTrp: 1.181 ± 0.015
2.102SerTyr: 2.102 ± 0.02
0.0SerXaa: 0.0 ± 0.0
Thr
5.091ThrAla: 5.091 ± 0.032
0.747ThrCys: 0.747 ± 0.014
2.987ThrAsp: 2.987 ± 0.026
3.364ThrGlu: 3.364 ± 0.025
2.207ThrPhe: 2.207 ± 0.025
4.206ThrGly: 4.206 ± 0.035
1.344ThrHis: 1.344 ± 0.019
3.123ThrIle: 3.123 ± 0.024
2.601ThrLys: 2.601 ± 0.029
5.258ThrLeu: 5.258 ± 0.035
1.258ThrMet: 1.258 ± 0.015
2.142ThrAsn: 2.142 ± 0.022
4.454ThrPro: 4.454 ± 0.043
2.222ThrGln: 2.222 ± 0.022
3.225ThrArg: 3.225 ± 0.027
5.346ThrSer: 5.346 ± 0.042
4.135ThrThr: 4.135 ± 0.04
3.788ThrVal: 3.788 ± 0.026
0.851ThrTrp: 0.851 ± 0.013
1.597ThrTyr: 1.597 ± 0.019
0.0ThrXaa: 0.0 ± 0.0
Val
5.172ValAla: 5.172 ± 0.033
0.849ValCys: 0.849 ± 0.013
3.737ValAsp: 3.737 ± 0.028
3.739ValGlu: 3.739 ± 0.027
2.512ValPhe: 2.512 ± 0.025
3.943ValGly: 3.943 ± 0.034
1.451ValHis: 1.451 ± 0.017
3.082ValIle: 3.082 ± 0.025
2.707ValLys: 2.707 ± 0.023
5.558ValLeu: 5.558 ± 0.04
1.39ValMet: 1.39 ± 0.015
2.292ValAsn: 2.292 ± 0.024
3.66ValPro: 3.66 ± 0.029
2.445ValGln: 2.445 ± 0.022
3.398ValArg: 3.398 ± 0.031
4.865ValSer: 4.865 ± 0.035
3.523ValThr: 3.523 ± 0.03
4.149ValVal: 4.149 ± 0.033
0.854ValTrp: 0.854 ± 0.015
1.737ValTyr: 1.737 ± 0.019
0.0ValXaa: 0.0 ± 0.0
Trp
1.15TrpAla: 1.15 ± 0.016
0.19TrpCys: 0.19 ± 0.006
0.899TrpAsp: 0.899 ± 0.017
0.857TrpGlu: 0.857 ± 0.014
0.517TrpPhe: 0.517 ± 0.011
0.921TrpGly: 0.921 ± 0.014
0.364TrpHis: 0.364 ± 0.009
0.768TrpIle: 0.768 ± 0.013
0.824TrpLys: 0.824 ± 0.013
1.368TrpLeu: 1.368 ± 0.019
0.394TrpMet: 0.394 ± 0.009
0.639TrpAsn: 0.639 ± 0.011
0.599TrpPro: 0.599 ± 0.011
0.566TrpGln: 0.566 ± 0.01
0.947TrpArg: 0.947 ± 0.014
1.115TrpSer: 1.115 ± 0.012
0.932TrpThr: 0.932 ± 0.014
0.892TrpVal: 0.892 ± 0.015
0.28TrpTrp: 0.28 ± 0.008
0.441TrpTyr: 0.441 ± 0.01
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.099TyrAla: 2.099 ± 0.022
0.402TyrCys: 0.402 ± 0.009
1.614TyrAsp: 1.614 ± 0.018
1.527TyrGlu: 1.527 ± 0.018
1.162TyrPhe: 1.162 ± 0.017
2.049TyrGly: 2.049 ± 0.023
0.786TyrHis: 0.786 ± 0.012
1.421TyrIle: 1.421 ± 0.018
1.094TyrLys: 1.094 ± 0.014
2.692TyrLeu: 2.692 ± 0.024
0.65TyrMet: 0.65 ± 0.012
1.124TyrAsn: 1.124 ± 0.014
1.525TyrPro: 1.525 ± 0.018
1.159TyrGln: 1.159 ± 0.014
1.631TyrArg: 1.631 ± 0.018
2.122TyrSer: 2.122 ± 0.02
1.604TyrThr: 1.604 ± 0.015
1.598TyrVal: 1.598 ± 0.021
0.431TyrTrp: 0.431 ± 0.01
0.968TyrTyr: 0.968 ± 0.017
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.035XaaXaa: 0.035 ± 0.012
Statistics based on 11728 proteins (5354899 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski