Amino acid dipepetide frequency for Aspergillus wentii DTO 134E9

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.93AlaAla: 7.93 ± 0.046
1.084AlaCys: 1.084 ± 0.017
4.039AlaAsp: 4.039 ± 0.027
4.74AlaGlu: 4.74 ± 0.032
3.199AlaPhe: 3.199 ± 0.024
5.632AlaGly: 5.632 ± 0.041
1.68AlaHis: 1.68 ± 0.017
4.314AlaIle: 4.314 ± 0.027
3.699AlaLys: 3.699 ± 0.03
7.441AlaLeu: 7.441 ± 0.043
1.976AlaMet: 1.976 ± 0.02
2.894AlaAsn: 2.894 ± 0.021
4.282AlaPro: 4.282 ± 0.038
3.147AlaGln: 3.147 ± 0.024
4.486AlaArg: 4.486 ± 0.031
6.951AlaSer: 6.951 ± 0.04
4.808AlaThr: 4.808 ± 0.033
5.26AlaVal: 5.26 ± 0.032
1.125AlaTrp: 1.125 ± 0.016
2.144AlaTyr: 2.144 ± 0.021
0.0AlaXaa: 0.0 ± 0.0
Cys
0.954CysAla: 0.954 ± 0.012
0.285CysCys: 0.285 ± 0.009
0.726CysAsp: 0.726 ± 0.011
0.616CysGlu: 0.616 ± 0.011
0.601CysPhe: 0.601 ± 0.012
0.943CysGly: 0.943 ± 0.015
0.38CysHis: 0.38 ± 0.01
0.781CysIle: 0.781 ± 0.013
0.523CysLys: 0.523 ± 0.009
1.394CysLeu: 1.394 ± 0.017
0.304CysMet: 0.304 ± 0.008
0.482CysAsn: 0.482 ± 0.011
0.694CysPro: 0.694 ± 0.014
0.51CysGln: 0.51 ± 0.011
0.817CysArg: 0.817 ± 0.014
1.026CysSer: 1.026 ± 0.016
0.712CysThr: 0.712 ± 0.012
0.831CysVal: 0.831 ± 0.016
0.213CysTrp: 0.213 ± 0.007
0.388CysTyr: 0.388 ± 0.009
0.0CysXaa: 0.0 ± 0.0
Asp
4.444AspAla: 4.444 ± 0.026
0.646AspCys: 0.646 ± 0.011
4.091AspAsp: 4.091 ± 0.041
4.406AspGlu: 4.406 ± 0.036
2.234AspPhe: 2.234 ± 0.021
4.072AspGly: 4.072 ± 0.031
1.302AspHis: 1.302 ± 0.015
3.276AspIle: 3.276 ± 0.026
2.371AspLys: 2.371 ± 0.025
5.118AspLeu: 5.118 ± 0.033
1.274AspMet: 1.274 ± 0.016
2.006AspAsn: 2.006 ± 0.018
3.277AspPro: 3.277 ± 0.025
2.015AspGln: 2.015 ± 0.021
3.115AspArg: 3.115 ± 0.027
4.316AspSer: 4.316 ± 0.029
2.987AspThr: 2.987 ± 0.022
3.626AspVal: 3.626 ± 0.024
0.901AspTrp: 0.901 ± 0.014
1.673AspTyr: 1.673 ± 0.017
0.0AspXaa: 0.0 ± 0.0
Glu
4.908GluAla: 4.908 ± 0.04
0.663GluCys: 0.663 ± 0.012
4.084GluAsp: 4.084 ± 0.035
5.315GluGlu: 5.315 ± 0.048
2.013GluPhe: 2.013 ± 0.018
3.622GluGly: 3.622 ± 0.027
1.319GluHis: 1.319 ± 0.016
3.164GluIle: 3.164 ± 0.024
3.824GluLys: 3.824 ± 0.033
4.927GluLeu: 4.927 ± 0.035
1.544GluMet: 1.544 ± 0.019
2.51GluAsn: 2.51 ± 0.022
2.715GluPro: 2.715 ± 0.032
2.464GluGln: 2.464 ± 0.028
3.736GluArg: 3.736 ± 0.026
4.493GluSer: 4.493 ± 0.031
3.551GluThr: 3.551 ± 0.027
3.393GluVal: 3.393 ± 0.026
0.906GluTrp: 0.906 ± 0.011
1.81GluTyr: 1.81 ± 0.019
0.0GluXaa: 0.0 ± 0.0
Phe
3.046PheAla: 3.046 ± 0.026
0.632PheCys: 0.632 ± 0.011
2.392PheAsp: 2.392 ± 0.023
2.134PheGlu: 2.134 ± 0.019
1.862PhePhe: 1.862 ± 0.023
2.912PheGly: 2.912 ± 0.031
1.015PheHis: 1.015 ± 0.014
2.024PheIle: 2.024 ± 0.023
1.499PheLys: 1.499 ± 0.018
3.796PheLeu: 3.796 ± 0.028
0.847PheMet: 0.847 ± 0.013
1.55PheAsn: 1.55 ± 0.016
2.158PhePro: 2.158 ± 0.022
1.498PheGln: 1.498 ± 0.015
2.016PheArg: 2.016 ± 0.019
3.264PheSer: 3.264 ± 0.026
2.253PheThr: 2.253 ± 0.024
2.445PheVal: 2.445 ± 0.023
0.7PheTrp: 0.7 ± 0.012
1.223PheTyr: 1.223 ± 0.015
0.0PheXaa: 0.0 ± 0.0
Gly
4.988GlyAla: 4.988 ± 0.041
0.964GlyCys: 0.964 ± 0.015
3.661GlyAsp: 3.661 ± 0.028
3.657GlyGlu: 3.657 ± 0.03
2.965GlyPhe: 2.965 ± 0.027
5.392GlyGly: 5.392 ± 0.055
1.692GlyHis: 1.692 ± 0.02
3.754GlyIle: 3.754 ± 0.03
3.448GlyLys: 3.448 ± 0.03
6.183GlyLeu: 6.183 ± 0.038
1.637GlyMet: 1.637 ± 0.02
2.618GlyAsn: 2.618 ± 0.025
3.23GlyPro: 3.23 ± 0.028
2.574GlyGln: 2.574 ± 0.026
3.912GlyArg: 3.912 ± 0.029
5.731GlySer: 5.731 ± 0.046
3.766GlyThr: 3.766 ± 0.03
4.428GlyVal: 4.428 ± 0.031
1.175GlyTrp: 1.175 ± 0.016
2.244GlyTyr: 2.244 ± 0.022
0.0GlyXaa: 0.0 ± 0.0
His
1.812HisAla: 1.812 ± 0.02
0.357HisCys: 0.357 ± 0.008
1.345HisAsp: 1.345 ± 0.016
1.363HisGlu: 1.363 ± 0.015
0.979HisPhe: 0.979 ± 0.012
1.778HisGly: 1.778 ± 0.019
0.896HisHis: 0.896 ± 0.016
1.27HisIle: 1.27 ± 0.015
0.857HisLys: 0.857 ± 0.012
2.367HisLeu: 2.367 ± 0.024
0.492HisMet: 0.492 ± 0.01
0.88HisAsn: 0.88 ± 0.013
1.727HisPro: 1.727 ± 0.018
1.008HisGln: 1.008 ± 0.015
1.586HisArg: 1.586 ± 0.02
1.908HisSer: 1.908 ± 0.023
1.274HisThr: 1.274 ± 0.016
1.446HisVal: 1.446 ± 0.016
0.373HisTrp: 0.373 ± 0.008
0.726HisTyr: 0.726 ± 0.012
0.0HisXaa: 0.0 ± 0.0
Ile
4.23IleAla: 4.23 ± 0.029
0.851IleCys: 0.851 ± 0.015
2.964IleAsp: 2.964 ± 0.026
2.943IleGlu: 2.943 ± 0.027
2.244IlePhe: 2.244 ± 0.025
3.334IleGly: 3.334 ± 0.029
1.358IleHis: 1.358 ± 0.015
2.739IleIle: 2.739 ± 0.027
2.203IleLys: 2.203 ± 0.021
4.902IleLeu: 4.902 ± 0.033
1.082IleMet: 1.082 ± 0.017
1.923IleAsn: 1.923 ± 0.023
3.353IlePro: 3.353 ± 0.024
2.102IleGln: 2.102 ± 0.019
2.819IleArg: 2.819 ± 0.023
4.194IleSer: 4.194 ± 0.03
2.912IleThr: 2.912 ± 0.023
3.276IleVal: 3.276 ± 0.026
0.765IleTrp: 0.765 ± 0.013
1.551IleTyr: 1.551 ± 0.018
0.0IleXaa: 0.0 ± 0.0
Lys
3.922LysAla: 3.922 ± 0.03
0.537LysCys: 0.537 ± 0.011
2.805LysAsp: 2.805 ± 0.025
3.351LysGlu: 3.351 ± 0.033
1.426LysPhe: 1.426 ± 0.018
2.961LysGly: 2.961 ± 0.024
1.12LysHis: 1.12 ± 0.014
2.316LysIle: 2.316 ± 0.022
3.215LysLys: 3.215 ± 0.033
3.92LysLeu: 3.92 ± 0.029
1.025LysMet: 1.025 ± 0.015
1.903LysAsn: 1.903 ± 0.021
2.726LysPro: 2.726 ± 0.026
1.905LysGln: 1.905 ± 0.021
3.229LysArg: 3.229 ± 0.028
3.527LysSer: 3.527 ± 0.028
2.814LysThr: 2.814 ± 0.025
2.745LysVal: 2.745 ± 0.025
0.686LysTrp: 0.686 ± 0.012
1.397LysTyr: 1.397 ± 0.018
0.0LysXaa: 0.0 ± 0.0
Leu
7.474LeuAla: 7.474 ± 0.041
1.296LeuCys: 1.296 ± 0.015
5.246LeuAsp: 5.246 ± 0.037
5.39LeuGlu: 5.39 ± 0.035
3.619LeuPhe: 3.619 ± 0.033
6.005LeuGly: 6.005 ± 0.04
2.324LeuHis: 2.324 ± 0.019
4.172LeuIle: 4.172 ± 0.031
4.067LeuLys: 4.067 ± 0.031
8.492LeuLeu: 8.492 ± 0.052
1.848LeuMet: 1.848 ± 0.019
3.343LeuAsn: 3.343 ± 0.024
5.489LeuPro: 5.489 ± 0.03
3.899LeuGln: 3.899 ± 0.033
5.57LeuArg: 5.57 ± 0.033
7.687LeuSer: 7.687 ± 0.041
4.78LeuThr: 4.78 ± 0.031
5.485LeuVal: 5.485 ± 0.034
1.236LeuTrp: 1.236 ± 0.017
2.506LeuTyr: 2.506 ± 0.023
0.0LeuXaa: 0.0 ± 0.0
Met
2.199MetAla: 2.199 ± 0.019
0.263MetCys: 0.263 ± 0.007
1.315MetAsp: 1.315 ± 0.015
1.343MetGlu: 1.343 ± 0.016
0.777MetPhe: 0.777 ± 0.013
1.552MetGly: 1.552 ± 0.02
0.502MetHis: 0.502 ± 0.009
1.097MetIle: 1.097 ± 0.015
1.072MetLys: 1.072 ± 0.013
1.863MetLeu: 1.863 ± 0.019
0.61MetMet: 0.61 ± 0.011
0.891MetAsn: 0.891 ± 0.013
1.234MetPro: 1.234 ± 0.016
0.897MetGln: 0.897 ± 0.012
1.228MetArg: 1.228 ± 0.013
1.878MetSer: 1.878 ± 0.019
1.321MetThr: 1.321 ± 0.015
1.39MetVal: 1.39 ± 0.015
0.27MetTrp: 0.27 ± 0.007
0.547MetTyr: 0.547 ± 0.01
0.0MetXaa: 0.0 ± 0.0
Asn
3.176AsnAla: 3.176 ± 0.027
0.5AsnCys: 0.5 ± 0.009
2.075AsnAsp: 2.075 ± 0.02
2.135AsnGlu: 2.135 ± 0.019
1.411AsnPhe: 1.411 ± 0.015
3.023AsnGly: 3.023 ± 0.033
0.932AsnHis: 0.932 ± 0.012
2.256AsnIle: 2.256 ± 0.02
1.612AsnLys: 1.612 ± 0.019
3.367AsnLeu: 3.367 ± 0.026
0.895AsnMet: 0.895 ± 0.013
1.611AsnAsn: 1.611 ± 0.02
2.572AsnPro: 2.572 ± 0.022
1.468AsnGln: 1.468 ± 0.019
2.049AsnArg: 2.049 ± 0.025
2.889AsnSer: 2.889 ± 0.026
2.318AsnThr: 2.318 ± 0.021
2.411AsnVal: 2.411 ± 0.021
0.605AsnTrp: 0.605 ± 0.011
1.123AsnTyr: 1.123 ± 0.017
0.0AsnXaa: 0.0 ± 0.0
Pro
4.758ProAla: 4.758 ± 0.035
0.577ProCys: 0.577 ± 0.013
3.266ProAsp: 3.266 ± 0.021
3.911ProGlu: 3.911 ± 0.037
2.233ProPhe: 2.233 ± 0.023
3.824ProGly: 3.824 ± 0.035
1.337ProHis: 1.337 ± 0.016
2.616ProIle: 2.616 ± 0.02
2.55ProLys: 2.55 ± 0.026
4.721ProLeu: 4.721 ± 0.031
1.094ProMet: 1.094 ± 0.015
2.217ProAsn: 2.217 ± 0.022
4.587ProPro: 4.587 ± 0.06
2.392ProGln: 2.392 ± 0.032
3.273ProArg: 3.273 ± 0.029
6.16ProSer: 6.16 ± 0.052
3.742ProThr: 3.742 ± 0.026
3.615ProVal: 3.615 ± 0.023
0.799ProTrp: 0.799 ± 0.011
1.53ProTyr: 1.53 ± 0.018
0.0ProXaa: 0.0 ± 0.0
Gln
3.195GlnAla: 3.195 ± 0.025
0.473GlnCys: 0.473 ± 0.01
2.113GlnAsp: 2.113 ± 0.024
2.39GlnGlu: 2.39 ± 0.022
1.373GlnPhe: 1.373 ± 0.016
2.451GlnGly: 2.451 ± 0.024
1.065GlnHis: 1.065 ± 0.015
2.008GlnIle: 2.008 ± 0.02
2.087GlnLys: 2.087 ± 0.023
3.438GlnLeu: 3.438 ± 0.023
0.906GlnMet: 0.906 ± 0.012
1.741GlnAsn: 1.741 ± 0.018
2.531GlnPro: 2.531 ± 0.035
2.292GlnGln: 2.292 ± 0.045
2.566GlnArg: 2.566 ± 0.023
3.314GlnSer: 3.314 ± 0.029
2.406GlnThr: 2.406 ± 0.024
2.195GlnVal: 2.195 ± 0.021
0.6GlnTrp: 0.6 ± 0.01
1.198GlnTyr: 1.198 ± 0.016
0.0GlnXaa: 0.0 ± 0.0
Arg
4.224ArgAla: 4.224 ± 0.03
0.755ArgCys: 0.755 ± 0.013
3.29ArgAsp: 3.29 ± 0.03
3.628ArgGlu: 3.628 ± 0.031
2.274ArgPhe: 2.274 ± 0.02
3.615ArgGly: 3.615 ± 0.029
1.54ArgHis: 1.54 ± 0.018
2.963ArgIle: 2.963 ± 0.025
3.322ArgLys: 3.322 ± 0.03
5.483ArgLeu: 5.483 ± 0.037
1.324ArgMet: 1.324 ± 0.016
2.287ArgAsn: 2.287 ± 0.022
3.265ArgPro: 3.265 ± 0.031
2.543ArgGln: 2.543 ± 0.024
4.752ArgArg: 4.752 ± 0.042
4.671ArgSer: 4.671 ± 0.039
3.082ArgThr: 3.082 ± 0.024
3.415ArgVal: 3.415 ± 0.026
0.946ArgTrp: 0.946 ± 0.014
1.714ArgTyr: 1.714 ± 0.017
0.0ArgXaa: 0.0 ± 0.0
Ser
6.377SerAla: 6.377 ± 0.043
0.992SerCys: 0.992 ± 0.016
4.356SerAsp: 4.356 ± 0.031
4.254SerGlu: 4.254 ± 0.032
3.327SerPhe: 3.327 ± 0.027
5.558SerGly: 5.558 ± 0.042
2.083SerHis: 2.083 ± 0.022
4.332SerIle: 4.332 ± 0.027
3.813SerLys: 3.813 ± 0.031
7.557SerLeu: 7.557 ± 0.043
1.746SerMet: 1.746 ± 0.015
3.246SerAsn: 3.246 ± 0.029
5.474SerPro: 5.474 ± 0.046
3.424SerGln: 3.424 ± 0.029
4.982SerArg: 4.982 ± 0.048
8.985SerSer: 8.985 ± 0.067
5.542SerThr: 5.542 ± 0.036
4.723SerVal: 4.723 ± 0.033
1.213SerTrp: 1.213 ± 0.018
2.179SerTyr: 2.179 ± 0.023
0.0SerXaa: 0.0 ± 0.0
Thr
4.972ThrAla: 4.972 ± 0.029
0.763ThrCys: 0.763 ± 0.013
2.9ThrAsp: 2.9 ± 0.025
3.119ThrGlu: 3.119 ± 0.021
2.255ThrPhe: 2.255 ± 0.02
4.208ThrGly: 4.208 ± 0.034
1.278ThrHis: 1.278 ± 0.014
3.147ThrIle: 3.147 ± 0.027
2.551ThrLys: 2.551 ± 0.024
5.201ThrLeu: 5.201 ± 0.031
1.222ThrMet: 1.222 ± 0.015
2.132ThrAsn: 2.132 ± 0.023
4.205ThrPro: 4.205 ± 0.036
2.083ThrGln: 2.083 ± 0.02
2.994ThrArg: 2.994 ± 0.021
5.016ThrSer: 5.016 ± 0.033
3.942ThrThr: 3.942 ± 0.033
3.854ThrVal: 3.854 ± 0.026
0.856ThrTrp: 0.856 ± 0.014
1.596ThrTyr: 1.596 ± 0.016
0.0ThrXaa: 0.0 ± 0.0
Val
4.907ValAla: 4.907 ± 0.031
0.886ValCys: 0.886 ± 0.014
3.88ValAsp: 3.88 ± 0.028
3.783ValGlu: 3.783 ± 0.027
2.63ValPhe: 2.63 ± 0.025
4.037ValGly: 4.037 ± 0.029
1.456ValHis: 1.456 ± 0.018
3.142ValIle: 3.142 ± 0.028
2.856ValLys: 2.856 ± 0.024
5.596ValLeu: 5.596 ± 0.035
1.36ValMet: 1.36 ± 0.016
2.316ValAsn: 2.316 ± 0.021
3.483ValPro: 3.483 ± 0.024
2.389ValGln: 2.389 ± 0.02
3.334ValArg: 3.334 ± 0.027
4.862ValSer: 4.862 ± 0.029
3.443ValThr: 3.443 ± 0.025
4.221ValVal: 4.221 ± 0.035
0.891ValTrp: 0.891 ± 0.013
1.85ValTyr: 1.85 ± 0.019
0.0ValXaa: 0.0 ± 0.0
Trp
1.132TrpAla: 1.132 ± 0.014
0.208TrpCys: 0.208 ± 0.006
0.922TrpAsp: 0.922 ± 0.016
0.885TrpGlu: 0.885 ± 0.014
0.569TrpPhe: 0.569 ± 0.01
0.949TrpGly: 0.949 ± 0.015
0.356TrpHis: 0.356 ± 0.009
0.834TrpIle: 0.834 ± 0.013
0.864TrpLys: 0.864 ± 0.013
1.361TrpLeu: 1.361 ± 0.016
0.408TrpMet: 0.408 ± 0.008
0.674TrpAsn: 0.674 ± 0.012
0.605TrpPro: 0.605 ± 0.011
0.585TrpGln: 0.585 ± 0.011
0.975TrpArg: 0.975 ± 0.015
1.105TrpSer: 1.105 ± 0.013
0.941TrpThr: 0.941 ± 0.013
0.916TrpVal: 0.916 ± 0.014
0.282TrpTrp: 0.282 ± 0.007
0.449TrpTyr: 0.449 ± 0.01
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.182TyrAla: 2.182 ± 0.019
0.446TyrCys: 0.446 ± 0.009
1.679TyrAsp: 1.679 ± 0.021
1.597TyrGlu: 1.597 ± 0.015
1.276TyrPhe: 1.276 ± 0.018
2.152TyrGly: 2.152 ± 0.023
0.81TyrHis: 0.81 ± 0.011
1.541TyrIle: 1.541 ± 0.017
1.094TyrLys: 1.094 ± 0.015
2.8TyrLeu: 2.8 ± 0.026
0.656TyrMet: 0.656 ± 0.011
1.205TyrAsn: 1.205 ± 0.017
1.597TyrPro: 1.597 ± 0.019
1.148TyrGln: 1.148 ± 0.016
1.655TyrArg: 1.655 ± 0.016
2.187TyrSer: 2.187 ± 0.022
1.697TyrThr: 1.697 ± 0.018
1.651TyrVal: 1.651 ± 0.018
0.476TyrTrp: 0.476 ± 0.011
0.972TyrTyr: 0.972 ± 0.015
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 12389 proteins (5719169 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski