Amino acid dipepetide frequency for Aspergillus carbonarius (strain ITEM 5010)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.527AlaAla: 8.527 ± 0.057
1.164AlaCys: 1.164 ± 0.018
4.149AlaAsp: 4.149 ± 0.03
4.801AlaGlu: 4.801 ± 0.036
3.26AlaPhe: 3.26 ± 0.029
5.799AlaGly: 5.799 ± 0.033
1.84AlaHis: 1.84 ± 0.022
4.434AlaIle: 4.434 ± 0.034
3.5AlaLys: 3.5 ± 0.033
8.096AlaLeu: 8.096 ± 0.048
2.04AlaMet: 2.04 ± 0.024
2.778AlaAsn: 2.778 ± 0.025
4.405AlaPro: 4.405 ± 0.032
3.27AlaGln: 3.27 ± 0.032
4.951AlaArg: 4.951 ± 0.036
7.082AlaSer: 7.082 ± 0.04
5.233AlaThr: 5.233 ± 0.035
5.603AlaVal: 5.603 ± 0.047
1.254AlaTrp: 1.254 ± 0.02
2.308AlaTyr: 2.308 ± 0.023
0.0AlaXaa: 0.0 ± 0.0
Cys
0.997CysAla: 0.997 ± 0.015
0.26CysCys: 0.26 ± 0.007
0.711CysAsp: 0.711 ± 0.012
0.634CysGlu: 0.634 ± 0.011
0.57CysPhe: 0.57 ± 0.011
0.973CysGly: 0.973 ± 0.018
0.375CysHis: 0.375 ± 0.009
0.759CysIle: 0.759 ± 0.014
0.456CysLys: 0.456 ± 0.01
1.448CysLeu: 1.448 ± 0.018
0.294CysMet: 0.294 ± 0.008
0.445CysAsn: 0.445 ± 0.01
0.679CysPro: 0.679 ± 0.013
0.483CysGln: 0.483 ± 0.011
0.864CysArg: 0.864 ± 0.015
1.006CysSer: 1.006 ± 0.016
0.743CysThr: 0.743 ± 0.013
0.867CysVal: 0.867 ± 0.015
0.225CysTrp: 0.225 ± 0.007
0.387CysTyr: 0.387 ± 0.01
0.0CysXaa: 0.0 ± 0.0
Asp
4.601AspAla: 4.601 ± 0.034
0.653AspCys: 0.653 ± 0.013
3.797AspAsp: 3.797 ± 0.04
4.067AspGlu: 4.067 ± 0.033
2.174AspPhe: 2.174 ± 0.021
3.983AspGly: 3.983 ± 0.037
1.31AspHis: 1.31 ± 0.016
3.132AspIle: 3.132 ± 0.025
2.071AspLys: 2.071 ± 0.023
5.293AspLeu: 5.293 ± 0.035
1.244AspMet: 1.244 ± 0.019
1.791AspAsn: 1.791 ± 0.022
3.469AspPro: 3.469 ± 0.028
1.934AspGln: 1.934 ± 0.019
3.147AspArg: 3.147 ± 0.029
3.931AspSer: 3.931 ± 0.03
2.975AspThr: 2.975 ± 0.026
3.66AspVal: 3.66 ± 0.029
0.929AspTrp: 0.929 ± 0.014
1.702AspTyr: 1.702 ± 0.019
0.0AspXaa: 0.0 ± 0.0
Glu
5.033GluAla: 5.033 ± 0.037
0.649GluCys: 0.649 ± 0.013
3.878GluAsp: 3.878 ± 0.034
5.033GluGlu: 5.033 ± 0.046
1.977GluPhe: 1.977 ± 0.02
3.792GluGly: 3.792 ± 0.031
1.383GluHis: 1.383 ± 0.018
3.018GluIle: 3.018 ± 0.027
3.252GluLys: 3.252 ± 0.03
5.091GluLeu: 5.091 ± 0.031
1.475GluMet: 1.475 ± 0.017
2.132GluAsn: 2.132 ± 0.019
2.637GluPro: 2.637 ± 0.024
2.378GluGln: 2.378 ± 0.025
3.797GluArg: 3.797 ± 0.032
4.16GluSer: 4.16 ± 0.031
3.463GluThr: 3.463 ± 0.025
3.592GluVal: 3.592 ± 0.03
0.921GluTrp: 0.921 ± 0.016
1.754GluTyr: 1.754 ± 0.018
0.0GluXaa: 0.0 ± 0.0
Phe
3.119PheAla: 3.119 ± 0.027
0.619PheCys: 0.619 ± 0.011
2.266PheAsp: 2.266 ± 0.022
2.07PheGlu: 2.07 ± 0.021
1.708PhePhe: 1.708 ± 0.023
2.861PheGly: 2.861 ± 0.03
1.012PheHis: 1.012 ± 0.014
1.925PheIle: 1.925 ± 0.021
1.338PheLys: 1.338 ± 0.017
3.801PheLeu: 3.801 ± 0.036
0.8PheMet: 0.8 ± 0.013
1.398PheAsn: 1.398 ± 0.016
2.089PhePro: 2.089 ± 0.021
1.479PheGln: 1.479 ± 0.016
2.07PheArg: 2.07 ± 0.02
3.046PheSer: 3.046 ± 0.025
2.281PheThr: 2.281 ± 0.024
2.473PheVal: 2.473 ± 0.024
0.692PheTrp: 0.692 ± 0.013
1.206PheTyr: 1.206 ± 0.018
0.0PheXaa: 0.0 ± 0.0
Gly
5.206GlyAla: 5.206 ± 0.038
0.962GlyCys: 0.962 ± 0.015
3.607GlyAsp: 3.607 ± 0.031
3.686GlyGlu: 3.686 ± 0.032
2.885GlyPhe: 2.885 ± 0.028
5.424GlyGly: 5.424 ± 0.042
1.717GlyHis: 1.717 ± 0.021
3.701GlyIle: 3.701 ± 0.027
3.14GlyLys: 3.14 ± 0.025
6.533GlyLeu: 6.533 ± 0.037
1.624GlyMet: 1.624 ± 0.02
2.406GlyAsn: 2.406 ± 0.025
3.395GlyPro: 3.395 ± 0.032
2.575GlyGln: 2.575 ± 0.023
4.105GlyArg: 4.105 ± 0.033
5.635GlySer: 5.635 ± 0.036
4.03GlyThr: 4.03 ± 0.03
4.707GlyVal: 4.707 ± 0.034
1.259GlyTrp: 1.259 ± 0.015
2.28GlyTyr: 2.28 ± 0.025
0.0GlyXaa: 0.0 ± 0.0
His
1.946HisAla: 1.946 ± 0.02
0.361HisCys: 0.361 ± 0.009
1.363HisAsp: 1.363 ± 0.016
1.328HisGlu: 1.328 ± 0.016
0.944HisPhe: 0.944 ± 0.015
1.783HisGly: 1.783 ± 0.02
0.93HisHis: 0.93 ± 0.017
1.299HisIle: 1.299 ± 0.017
0.814HisLys: 0.814 ± 0.013
2.486HisLeu: 2.486 ± 0.022
0.514HisMet: 0.514 ± 0.012
0.843HisAsn: 0.843 ± 0.012
1.866HisPro: 1.866 ± 0.022
1.008HisGln: 1.008 ± 0.016
1.651HisArg: 1.651 ± 0.018
1.852HisSer: 1.852 ± 0.023
1.367HisThr: 1.367 ± 0.018
1.488HisVal: 1.488 ± 0.014
0.388HisTrp: 0.388 ± 0.009
0.751HisTyr: 0.751 ± 0.012
0.0HisXaa: 0.0 ± 0.0
Ile
4.331IleAla: 4.331 ± 0.028
0.838IleCys: 0.838 ± 0.015
2.832IleAsp: 2.832 ± 0.024
2.753IleGlu: 2.753 ± 0.025
2.125IlePhe: 2.125 ± 0.021
3.363IleGly: 3.363 ± 0.034
1.349IleHis: 1.349 ± 0.014
2.664IleIle: 2.664 ± 0.028
1.937IleLys: 1.937 ± 0.021
4.992IleLeu: 4.992 ± 0.039
1.058IleMet: 1.058 ± 0.017
1.79IleAsn: 1.79 ± 0.02
3.285IlePro: 3.285 ± 0.027
2.02IleGln: 2.02 ± 0.02
2.907IleArg: 2.907 ± 0.025
3.91IleSer: 3.91 ± 0.033
2.971IleThr: 2.971 ± 0.027
3.288IleVal: 3.288 ± 0.028
0.78IleTrp: 0.78 ± 0.013
1.562IleTyr: 1.562 ± 0.018
0.0IleXaa: 0.0 ± 0.0
Lys
3.709LysAla: 3.709 ± 0.03
0.483LysCys: 0.483 ± 0.01
2.419LysAsp: 2.419 ± 0.024
2.918LysGlu: 2.918 ± 0.033
1.282LysPhe: 1.282 ± 0.018
2.703LysGly: 2.703 ± 0.023
1.015LysHis: 1.015 ± 0.014
2.07LysIle: 2.07 ± 0.018
2.536LysLys: 2.536 ± 0.035
3.722LysLeu: 3.722 ± 0.034
0.911LysMet: 0.911 ± 0.015
1.499LysAsn: 1.499 ± 0.018
2.395LysPro: 2.395 ± 0.026
1.651LysGln: 1.651 ± 0.021
3.037LysArg: 3.037 ± 0.029
3.049LysSer: 3.049 ± 0.031
2.435LysThr: 2.435 ± 0.023
2.644LysVal: 2.644 ± 0.026
0.619LysTrp: 0.619 ± 0.011
1.297LysTyr: 1.297 ± 0.021
0.0LysXaa: 0.0 ± 0.0
Leu
8.198LeuAla: 8.198 ± 0.051
1.319LeuCys: 1.319 ± 0.017
5.371LeuAsp: 5.371 ± 0.035
5.574LeuGlu: 5.574 ± 0.042
3.581LeuPhe: 3.581 ± 0.03
6.452LeuGly: 6.452 ± 0.039
2.451LeuHis: 2.451 ± 0.023
4.289LeuIle: 4.289 ± 0.034
3.847LeuLys: 3.847 ± 0.028
9.125LeuLeu: 9.125 ± 0.052
1.88LeuMet: 1.88 ± 0.02
3.191LeuAsn: 3.191 ± 0.026
5.697LeuPro: 5.697 ± 0.035
4.049LeuGln: 4.049 ± 0.034
6.043LeuArg: 6.043 ± 0.038
7.773LeuSer: 7.773 ± 0.046
5.227LeuThr: 5.227 ± 0.036
5.897LeuVal: 5.897 ± 0.043
1.311LeuTrp: 1.311 ± 0.016
2.64LeuTyr: 2.64 ± 0.022
0.0LeuXaa: 0.0 ± 0.0
Met
2.228MetAla: 2.228 ± 0.023
0.248MetCys: 0.248 ± 0.008
1.236MetAsp: 1.236 ± 0.019
1.31MetGlu: 1.31 ± 0.015
0.792MetPhe: 0.792 ± 0.014
1.542MetGly: 1.542 ± 0.021
0.515MetHis: 0.515 ± 0.01
1.069MetIle: 1.069 ± 0.013
0.944MetLys: 0.944 ± 0.015
1.923MetLeu: 1.923 ± 0.024
0.582MetMet: 0.582 ± 0.011
0.792MetAsn: 0.792 ± 0.013
1.212MetPro: 1.212 ± 0.015
0.865MetGln: 0.865 ± 0.013
1.25MetArg: 1.25 ± 0.016
1.908MetSer: 1.908 ± 0.019
1.337MetThr: 1.337 ± 0.017
1.446MetVal: 1.446 ± 0.019
0.274MetTrp: 0.274 ± 0.008
0.561MetTyr: 0.561 ± 0.011
0.0MetXaa: 0.0 ± 0.0
Asn
2.955AsnAla: 2.955 ± 0.025
0.445AsnCys: 0.445 ± 0.01
1.836AsnAsp: 1.836 ± 0.02
1.874AsnGlu: 1.874 ± 0.02
1.309AsnPhe: 1.309 ± 0.017
2.724AsnGly: 2.724 ± 0.022
0.865AsnHis: 0.865 ± 0.014
2.006AsnIle: 2.006 ± 0.02
1.338AsnLys: 1.338 ± 0.016
3.242AsnLeu: 3.242 ± 0.027
0.791AsnMet: 0.791 ± 0.012
1.334AsnAsn: 1.334 ± 0.018
2.501AsnPro: 2.501 ± 0.024
1.301AsnGln: 1.301 ± 0.017
1.912AsnArg: 1.912 ± 0.017
2.525AsnSer: 2.525 ± 0.024
2.116AsnThr: 2.116 ± 0.023
2.305AsnVal: 2.305 ± 0.018
0.564AsnTrp: 0.564 ± 0.01
1.099AsnTyr: 1.099 ± 0.016
0.0AsnXaa: 0.0 ± 0.0
Pro
4.947ProAla: 4.947 ± 0.033
0.591ProCys: 0.591 ± 0.012
3.34ProAsp: 3.34 ± 0.028
3.796ProGlu: 3.796 ± 0.029
2.18ProPhe: 2.18 ± 0.023
3.951ProGly: 3.951 ± 0.029
1.406ProHis: 1.406 ± 0.018
2.627ProIle: 2.627 ± 0.024
2.297ProLys: 2.297 ± 0.022
4.922ProLeu: 4.922 ± 0.033
1.07ProMet: 1.07 ± 0.014
2.104ProAsn: 2.104 ± 0.019
4.483ProPro: 4.483 ± 0.053
2.31ProGln: 2.31 ± 0.029
3.427ProArg: 3.427 ± 0.031
5.924ProSer: 5.924 ± 0.046
4.011ProThr: 4.011 ± 0.034
3.723ProVal: 3.723 ± 0.03
0.823ProTrp: 0.823 ± 0.012
1.587ProTyr: 1.587 ± 0.019
0.0ProXaa: 0.0 ± 0.0
Gln
3.376GlnAla: 3.376 ± 0.029
0.497GlnCys: 0.497 ± 0.012
2.06GlnAsp: 2.06 ± 0.02
2.359GlnGlu: 2.359 ± 0.023
1.337GlnPhe: 1.337 ± 0.016
2.454GlnGly: 2.454 ± 0.023
1.054GlnHis: 1.054 ± 0.014
1.946GlnIle: 1.946 ± 0.019
1.803GlnLys: 1.803 ± 0.022
3.599GlnLeu: 3.599 ± 0.029
0.886GlnMet: 0.886 ± 0.014
1.434GlnAsn: 1.434 ± 0.017
2.457GlnPro: 2.457 ± 0.03
2.12GlnGln: 2.12 ± 0.034
2.631GlnArg: 2.631 ± 0.025
3.132GlnSer: 3.132 ± 0.032
2.383GlnThr: 2.383 ± 0.017
2.27GlnVal: 2.27 ± 0.023
0.624GlnTrp: 0.624 ± 0.01
1.213GlnTyr: 1.213 ± 0.015
0.0GlnXaa: 0.0 ± 0.0
Arg
4.67ArgAla: 4.67 ± 0.035
0.787ArgCys: 0.787 ± 0.015
3.32ArgAsp: 3.32 ± 0.027
3.74ArgGlu: 3.74 ± 0.032
2.287ArgPhe: 2.287 ± 0.019
3.722ArgGly: 3.722 ± 0.03
1.602ArgHis: 1.602 ± 0.017
2.983ArgIle: 2.983 ± 0.022
3.157ArgLys: 3.157 ± 0.029
5.845ArgLeu: 5.845 ± 0.035
1.358ArgMet: 1.358 ± 0.018
2.174ArgAsn: 2.174 ± 0.019
3.47ArgPro: 3.47 ± 0.032
2.626ArgGln: 2.626 ± 0.026
4.96ArgArg: 4.96 ± 0.045
4.7ArgSer: 4.7 ± 0.038
3.343ArgThr: 3.343 ± 0.03
3.689ArgVal: 3.689 ± 0.028
1.017ArgTrp: 1.017 ± 0.014
1.798ArgTyr: 1.798 ± 0.021
0.0ArgXaa: 0.0 ± 0.0
Ser
6.494SerAla: 6.494 ± 0.043
0.927SerCys: 0.927 ± 0.014
4.194SerAsp: 4.194 ± 0.03
4.006SerGlu: 4.006 ± 0.031
3.059SerPhe: 3.059 ± 0.026
5.472SerGly: 5.472 ± 0.036
2.054SerHis: 2.054 ± 0.023
4.052SerIle: 4.052 ± 0.031
3.217SerLys: 3.217 ± 0.029
7.654SerLeu: 7.654 ± 0.041
1.739SerMet: 1.739 ± 0.018
2.82SerAsn: 2.82 ± 0.028
5.368SerPro: 5.368 ± 0.047
3.24SerGln: 3.24 ± 0.031
4.932SerArg: 4.932 ± 0.038
8.352SerSer: 8.352 ± 0.069
5.505SerThr: 5.505 ± 0.041
4.783SerVal: 4.783 ± 0.034
1.202SerTrp: 1.202 ± 0.018
2.185SerTyr: 2.185 ± 0.023
0.0SerXaa: 0.0 ± 0.0
Thr
5.26ThrAla: 5.26 ± 0.033
0.812ThrCys: 0.812 ± 0.014
2.992ThrAsp: 2.992 ± 0.025
3.083ThrGlu: 3.083 ± 0.026
2.303ThrPhe: 2.303 ± 0.023
4.376ThrGly: 4.376 ± 0.028
1.386ThrHis: 1.386 ± 0.017
3.241ThrIle: 3.241 ± 0.024
2.271ThrLys: 2.271 ± 0.025
5.622ThrLeu: 5.622 ± 0.038
1.235ThrMet: 1.235 ± 0.016
2.048ThrAsn: 2.048 ± 0.021
4.143ThrPro: 4.143 ± 0.037
2.093ThrGln: 2.093 ± 0.02
3.14ThrArg: 3.14 ± 0.025
5.145ThrSer: 5.145 ± 0.036
4.436ThrThr: 4.436 ± 0.039
3.979ThrVal: 3.979 ± 0.035
0.909ThrTrp: 0.909 ± 0.014
1.806ThrTyr: 1.806 ± 0.019
0.0ThrXaa: 0.0 ± 0.0
Val
5.357ValAla: 5.357 ± 0.037
0.913ValCys: 0.913 ± 0.015
3.837ValAsp: 3.837 ± 0.029
3.891ValGlu: 3.891 ± 0.028
2.646ValPhe: 2.646 ± 0.025
4.282ValGly: 4.282 ± 0.032
1.49ValHis: 1.49 ± 0.018
3.165ValIle: 3.165 ± 0.026
2.66ValLys: 2.66 ± 0.026
6.085ValLeu: 6.085 ± 0.037
1.443ValMet: 1.443 ± 0.017
2.227ValAsn: 2.227 ± 0.024
3.669ValPro: 3.669 ± 0.028
2.48ValGln: 2.48 ± 0.024
3.654ValArg: 3.654 ± 0.03
4.867ValSer: 4.867 ± 0.031
3.669ValThr: 3.669 ± 0.03
4.576ValVal: 4.576 ± 0.04
0.932ValTrp: 0.932 ± 0.014
1.942ValTyr: 1.942 ± 0.019
0.0ValXaa: 0.0 ± 0.0
Trp
1.205TrpAla: 1.205 ± 0.016
0.205TrpCys: 0.205 ± 0.006
0.946TrpAsp: 0.946 ± 0.013
0.889TrpGlu: 0.889 ± 0.014
0.586TrpPhe: 0.586 ± 0.011
1.006TrpGly: 1.006 ± 0.016
0.379TrpHis: 0.379 ± 0.01
0.821TrpIle: 0.821 ± 0.013
0.8TrpLys: 0.8 ± 0.014
1.458TrpLeu: 1.458 ± 0.016
0.407TrpMet: 0.407 ± 0.008
0.67TrpAsn: 0.67 ± 0.011
0.648TrpPro: 0.648 ± 0.012
0.595TrpGln: 0.595 ± 0.01
1.011TrpArg: 1.011 ± 0.017
1.119TrpSer: 1.119 ± 0.015
1.008TrpThr: 1.008 ± 0.014
0.998TrpVal: 0.998 ± 0.014
0.307TrpTrp: 0.307 ± 0.009
0.461TrpTyr: 0.461 ± 0.011
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.333TyrAla: 2.333 ± 0.024
0.442TyrCys: 0.442 ± 0.011
1.713TyrAsp: 1.713 ± 0.018
1.598TyrGlu: 1.598 ± 0.019
1.25TyrPhe: 1.25 ± 0.018
2.217TyrGly: 2.217 ± 0.027
0.848TyrHis: 0.848 ± 0.014
1.552TyrIle: 1.552 ± 0.017
1.018TyrLys: 1.018 ± 0.016
2.975TyrLeu: 2.975 ± 0.025
0.683TyrMet: 0.683 ± 0.012
1.166TyrAsn: 1.166 ± 0.016
1.65TyrPro: 1.65 ± 0.017
1.158TyrGln: 1.158 ± 0.017
1.759TyrArg: 1.759 ± 0.021
2.137TyrSer: 2.137 ± 0.02
1.747TyrThr: 1.747 ± 0.02
1.797TyrVal: 1.797 ± 0.02
0.492TyrTrp: 0.492 ± 0.009
1.03TyrTyr: 1.03 ± 0.016
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 11037 proteins (5099753 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski