Amino acid dipepetide frequency for Aspergillus sydowii CBS 593.65

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.571AlaAla: 8.571 ± 0.048
1.19AlaCys: 1.19 ± 0.018
4.221AlaAsp: 4.221 ± 0.027
4.953AlaGlu: 4.953 ± 0.032
3.243AlaPhe: 3.243 ± 0.025
5.937AlaGly: 5.937 ± 0.04
1.778AlaHis: 1.778 ± 0.015
4.383AlaIle: 4.383 ± 0.026
3.565AlaLys: 3.565 ± 0.026
7.867AlaLeu: 7.867 ± 0.045
1.958AlaMet: 1.958 ± 0.018
2.859AlaAsn: 2.859 ± 0.02
4.469AlaPro: 4.469 ± 0.035
3.26AlaGln: 3.26 ± 0.025
4.864AlaArg: 4.864 ± 0.029
6.957AlaSer: 6.957 ± 0.041
5.04AlaThr: 5.04 ± 0.032
5.579AlaVal: 5.579 ± 0.037
1.208AlaTrp: 1.208 ± 0.015
2.221AlaTyr: 2.221 ± 0.022
0.003AlaXaa: 0.003 ± 0.001
Cys
1.052CysAla: 1.052 ± 0.013
0.295CysCys: 0.295 ± 0.008
0.739CysAsp: 0.739 ± 0.011
0.651CysGlu: 0.651 ± 0.012
0.607CysPhe: 0.607 ± 0.009
0.987CysGly: 0.987 ± 0.015
0.37CysHis: 0.37 ± 0.008
0.791CysIle: 0.791 ± 0.013
0.496CysLys: 0.496 ± 0.008
1.414CysLeu: 1.414 ± 0.018
0.308CysMet: 0.308 ± 0.007
0.462CysAsn: 0.462 ± 0.01
0.705CysPro: 0.705 ± 0.013
0.493CysGln: 0.493 ± 0.009
0.885CysArg: 0.885 ± 0.012
1.043CysSer: 1.043 ± 0.013
0.782CysThr: 0.782 ± 0.012
0.862CysVal: 0.862 ± 0.012
0.225CysTrp: 0.225 ± 0.006
0.378CysTyr: 0.378 ± 0.008
0.001CysXaa: 0.001 ± 0.0
Asp
4.581AspAla: 4.581 ± 0.03
0.66AspCys: 0.66 ± 0.011
3.873AspAsp: 3.873 ± 0.04
4.166AspGlu: 4.166 ± 0.036
2.14AspPhe: 2.14 ± 0.019
4.1AspGly: 4.1 ± 0.026
1.245AspHis: 1.245 ± 0.015
3.252AspIle: 3.252 ± 0.024
2.225AspLys: 2.225 ± 0.02
5.023AspLeu: 5.023 ± 0.034
1.248AspMet: 1.248 ± 0.014
1.957AspAsn: 1.957 ± 0.019
3.332AspPro: 3.332 ± 0.024
1.879AspGln: 1.879 ± 0.018
3.119AspArg: 3.119 ± 0.028
4.238AspSer: 4.238 ± 0.027
3.073AspThr: 3.073 ± 0.024
3.627AspVal: 3.627 ± 0.025
0.934AspTrp: 0.934 ± 0.013
1.672AspTyr: 1.672 ± 0.017
0.002AspXaa: 0.002 ± 0.0
Glu
4.983GluAla: 4.983 ± 0.035
0.687GluCys: 0.687 ± 0.012
3.886GluAsp: 3.886 ± 0.034
4.973GluGlu: 4.973 ± 0.042
2.017GluPhe: 2.017 ± 0.019
3.719GluGly: 3.719 ± 0.029
1.351GluHis: 1.351 ± 0.015
3.085GluIle: 3.085 ± 0.023
3.361GluLys: 3.361 ± 0.029
5.12GluLeu: 5.12 ± 0.032
1.385GluMet: 1.385 ± 0.013
2.34GluAsn: 2.34 ± 0.023
2.797GluPro: 2.797 ± 0.025
2.433GluGln: 2.433 ± 0.019
3.793GluArg: 3.793 ± 0.028
4.352GluSer: 4.352 ± 0.028
3.58GluThr: 3.58 ± 0.025
3.391GluVal: 3.391 ± 0.025
0.895GluTrp: 0.895 ± 0.011
1.796GluTyr: 1.796 ± 0.017
0.003GluXaa: 0.003 ± 0.001
Phe
3.133PheAla: 3.133 ± 0.025
0.635PheCys: 0.635 ± 0.011
2.29PheAsp: 2.29 ± 0.019
2.107PheGlu: 2.107 ± 0.019
1.756PhePhe: 1.756 ± 0.019
2.972PheGly: 2.972 ± 0.028
0.955PheHis: 0.955 ± 0.012
1.943PheIle: 1.943 ± 0.02
1.394PheLys: 1.394 ± 0.016
3.753PheLeu: 3.753 ± 0.029
0.808PheMet: 0.808 ± 0.012
1.471PheAsn: 1.471 ± 0.016
2.134PhePro: 2.134 ± 0.019
1.437PheGln: 1.437 ± 0.013
2.036PheArg: 2.036 ± 0.018
3.098PheSer: 3.098 ± 0.023
2.231PheThr: 2.231 ± 0.019
2.464PheVal: 2.464 ± 0.022
0.715PheTrp: 0.715 ± 0.009
1.189PheTyr: 1.189 ± 0.016
0.001PheXaa: 0.001 ± 0.001
Gly
5.387GlyAla: 5.387 ± 0.035
1.005GlyCys: 1.005 ± 0.015
3.723GlyAsp: 3.723 ± 0.022
3.689GlyGlu: 3.689 ± 0.031
2.966GlyPhe: 2.966 ± 0.026
5.62GlyGly: 5.62 ± 0.051
1.69GlyHis: 1.69 ± 0.018
3.84GlyIle: 3.84 ± 0.026
3.245GlyLys: 3.245 ± 0.023
6.477GlyLeu: 6.477 ± 0.04
1.589GlyMet: 1.589 ± 0.018
2.615GlyAsn: 2.615 ± 0.023
3.453GlyPro: 3.453 ± 0.029
2.573GlyGln: 2.573 ± 0.023
4.142GlyArg: 4.142 ± 0.028
5.738GlySer: 5.738 ± 0.034
4.006GlyThr: 4.006 ± 0.027
4.706GlyVal: 4.706 ± 0.03
1.245GlyTrp: 1.245 ± 0.016
2.268GlyTyr: 2.268 ± 0.021
0.002GlyXaa: 0.002 ± 0.0
His
1.832HisAla: 1.832 ± 0.018
0.36HisCys: 0.36 ± 0.007
1.311HisAsp: 1.311 ± 0.014
1.304HisGlu: 1.304 ± 0.015
0.932HisPhe: 0.932 ± 0.013
1.764HisGly: 1.764 ± 0.015
0.817HisHis: 0.817 ± 0.013
1.261HisIle: 1.261 ± 0.013
0.828HisLys: 0.828 ± 0.013
2.326HisLeu: 2.326 ± 0.022
0.473HisMet: 0.473 ± 0.009
0.849HisAsn: 0.849 ± 0.012
1.678HisPro: 1.678 ± 0.016
0.959HisGln: 0.959 ± 0.012
1.607HisArg: 1.607 ± 0.018
1.909HisSer: 1.909 ± 0.018
1.272HisThr: 1.272 ± 0.015
1.403HisVal: 1.403 ± 0.014
0.396HisTrp: 0.396 ± 0.008
0.734HisTyr: 0.734 ± 0.011
0.001HisXaa: 0.001 ± 0.0
Ile
4.339IleAla: 4.339 ± 0.031
0.84IleCys: 0.84 ± 0.011
2.824IleAsp: 2.824 ± 0.023
2.799IleGlu: 2.799 ± 0.023
2.118IlePhe: 2.118 ± 0.02
3.357IleGly: 3.357 ± 0.031
1.293IleHis: 1.293 ± 0.015
2.632IleIle: 2.632 ± 0.024
1.976IleLys: 1.976 ± 0.021
4.858IleLeu: 4.858 ± 0.033
1.029IleMet: 1.029 ± 0.014
1.826IleAsn: 1.826 ± 0.019
3.281IlePro: 3.281 ± 0.023
2.041IleGln: 2.041 ± 0.018
2.866IleArg: 2.866 ± 0.023
4.034IleSer: 4.034 ± 0.025
2.917IleThr: 2.917 ± 0.023
3.277IleVal: 3.277 ± 0.021
0.762IleTrp: 0.762 ± 0.011
1.545IleTyr: 1.545 ± 0.015
0.002IleXaa: 0.002 ± 0.001
Lys
3.76LysAla: 3.76 ± 0.029
0.524LysCys: 0.524 ± 0.01
2.563LysAsp: 2.563 ± 0.021
3.05LysGlu: 3.05 ± 0.028
1.356LysPhe: 1.356 ± 0.015
2.812LysGly: 2.812 ± 0.023
1.057LysHis: 1.057 ± 0.015
2.107LysIle: 2.107 ± 0.019
2.762LysLys: 2.762 ± 0.031
3.802LysLeu: 3.802 ± 0.028
0.887LysMet: 0.887 ± 0.014
1.558LysAsn: 1.558 ± 0.018
2.545LysPro: 2.545 ± 0.026
1.739LysGln: 1.739 ± 0.018
3.168LysArg: 3.168 ± 0.028
3.241LysSer: 3.241 ± 0.028
2.579LysThr: 2.579 ± 0.023
2.562LysVal: 2.562 ± 0.021
0.652LysTrp: 0.652 ± 0.01
1.362LysTyr: 1.362 ± 0.017
0.001LysXaa: 0.001 ± 0.0
Leu
7.896LeuAla: 7.896 ± 0.039
1.301LeuCys: 1.301 ± 0.014
5.345LeuAsp: 5.345 ± 0.034
5.391LeuGlu: 5.391 ± 0.04
3.609LeuPhe: 3.609 ± 0.027
6.312LeuGly: 6.312 ± 0.035
2.342LeuHis: 2.342 ± 0.022
4.106LeuIle: 4.106 ± 0.03
3.869LeuLys: 3.869 ± 0.031
8.713LeuLeu: 8.713 ± 0.053
1.813LeuMet: 1.813 ± 0.015
3.273LeuAsn: 3.273 ± 0.026
5.548LeuPro: 5.548 ± 0.033
3.886LeuGln: 3.886 ± 0.029
5.802LeuArg: 5.802 ± 0.033
7.596LeuSer: 7.596 ± 0.043
4.887LeuThr: 4.887 ± 0.031
5.694LeuVal: 5.694 ± 0.03
1.299LeuTrp: 1.299 ± 0.015
2.605LeuTyr: 2.605 ± 0.022
0.005LeuXaa: 0.005 ± 0.001
Met
2.15MetAla: 2.15 ± 0.019
0.255MetCys: 0.255 ± 0.007
1.231MetAsp: 1.231 ± 0.014
1.242MetGlu: 1.242 ± 0.016
0.74MetPhe: 0.74 ± 0.011
1.514MetGly: 1.514 ± 0.016
0.491MetHis: 0.491 ± 0.01
1.0MetIle: 1.0 ± 0.013
0.941MetLys: 0.941 ± 0.013
1.833MetLeu: 1.833 ± 0.018
0.535MetMet: 0.535 ± 0.01
0.784MetAsn: 0.784 ± 0.011
1.199MetPro: 1.199 ± 0.014
0.845MetGln: 0.845 ± 0.011
1.232MetArg: 1.232 ± 0.014
1.799MetSer: 1.799 ± 0.016
1.268MetThr: 1.268 ± 0.014
1.375MetVal: 1.375 ± 0.014
0.273MetTrp: 0.273 ± 0.006
0.535MetTyr: 0.535 ± 0.01
0.001MetXaa: 0.001 ± 0.0
Asn
3.092AsnAla: 3.092 ± 0.024
0.498AsnCys: 0.498 ± 0.009
1.944AsnAsp: 1.944 ± 0.02
2.022AsnGlu: 2.022 ± 0.022
1.346AsnPhe: 1.346 ± 0.015
2.929AsnGly: 2.929 ± 0.023
0.856AsnHis: 0.856 ± 0.012
2.129AsnIle: 2.129 ± 0.019
1.477AsnLys: 1.477 ± 0.016
3.254AsnLeu: 3.254 ± 0.026
0.818AsnMet: 0.818 ± 0.012
1.514AsnAsn: 1.514 ± 0.02
2.542AsnPro: 2.542 ± 0.022
1.365AsnGln: 1.365 ± 0.017
2.041AsnArg: 2.041 ± 0.02
2.751AsnSer: 2.751 ± 0.025
2.249AsnThr: 2.249 ± 0.019
2.311AsnVal: 2.311 ± 0.02
0.591AsnTrp: 0.591 ± 0.011
1.102AsnTyr: 1.102 ± 0.013
0.001AsnXaa: 0.001 ± 0.0
Pro
4.994ProAla: 4.994 ± 0.039
0.629ProCys: 0.629 ± 0.011
3.347ProAsp: 3.347 ± 0.025
3.927ProGlu: 3.927 ± 0.03
2.153ProPhe: 2.153 ± 0.021
4.202ProGly: 4.202 ± 0.034
1.299ProHis: 1.299 ± 0.014
2.547ProIle: 2.547 ± 0.025
2.414ProLys: 2.414 ± 0.024
4.775ProLeu: 4.775 ± 0.028
1.033ProMet: 1.033 ± 0.014
2.226ProAsn: 2.226 ± 0.021
4.597ProPro: 4.597 ± 0.067
2.39ProGln: 2.39 ± 0.026
3.442ProArg: 3.442 ± 0.03
5.948ProSer: 5.948 ± 0.045
3.866ProThr: 3.866 ± 0.033
3.664ProVal: 3.664 ± 0.027
0.813ProTrp: 0.813 ± 0.011
1.562ProTyr: 1.562 ± 0.015
0.002ProXaa: 0.002 ± 0.0
Gln
3.277GlnAla: 3.277 ± 0.023
0.496GlnCys: 0.496 ± 0.008
2.054GlnAsp: 2.054 ± 0.02
2.326GlnGlu: 2.326 ± 0.02
1.32GlnPhe: 1.32 ± 0.014
2.485GlnGly: 2.485 ± 0.019
1.03GlnHis: 1.03 ± 0.013
1.899GlnIle: 1.899 ± 0.016
1.855GlnLys: 1.855 ± 0.018
3.491GlnLeu: 3.491 ± 0.027
0.856GlnMet: 0.856 ± 0.012
1.578GlnAsn: 1.578 ± 0.016
2.512GlnPro: 2.512 ± 0.032
2.212GlnGln: 2.212 ± 0.041
2.643GlnArg: 2.643 ± 0.02
3.185GlnSer: 3.185 ± 0.028
2.389GlnThr: 2.389 ± 0.02
2.208GlnVal: 2.208 ± 0.02
0.591GlnTrp: 0.591 ± 0.01
1.19GlnTyr: 1.19 ± 0.015
0.002GlnXaa: 0.002 ± 0.001
Arg
4.679ArgAla: 4.679 ± 0.029
0.814ArgCys: 0.814 ± 0.012
3.4ArgAsp: 3.4 ± 0.033
3.792ArgGlu: 3.792 ± 0.028
2.259ArgPhe: 2.259 ± 0.019
3.758ArgGly: 3.758 ± 0.029
1.572ArgHis: 1.572 ± 0.016
2.973ArgIle: 2.973 ± 0.021
3.252ArgLys: 3.252 ± 0.025
5.637ArgLeu: 5.637 ± 0.035
1.279ArgMet: 1.279 ± 0.012
2.304ArgAsn: 2.304 ± 0.021
3.469ArgPro: 3.469 ± 0.029
2.516ArgGln: 2.516 ± 0.023
5.091ArgArg: 5.091 ± 0.038
4.754ArgSer: 4.754 ± 0.036
3.327ArgThr: 3.327 ± 0.024
3.578ArgVal: 3.578 ± 0.024
1.009ArgTrp: 1.009 ± 0.013
1.782ArgTyr: 1.782 ± 0.015
0.003ArgXaa: 0.003 ± 0.001
Ser
6.546SerAla: 6.546 ± 0.037
0.991SerCys: 0.991 ± 0.013
4.278SerAsp: 4.278 ± 0.028
4.199SerGlu: 4.199 ± 0.029
3.164SerPhe: 3.164 ± 0.024
5.698SerGly: 5.698 ± 0.031
1.953SerHis: 1.953 ± 0.018
4.104SerIle: 4.104 ± 0.027
3.44SerLys: 3.44 ± 0.024
7.477SerLeu: 7.477 ± 0.043
1.657SerMet: 1.657 ± 0.016
3.014SerAsn: 3.014 ± 0.022
5.428SerPro: 5.428 ± 0.05
3.316SerGln: 3.316 ± 0.029
5.019SerArg: 5.019 ± 0.038
8.54SerSer: 8.54 ± 0.073
5.5SerThr: 5.5 ± 0.037
4.74SerVal: 4.74 ± 0.03
1.211SerTrp: 1.211 ± 0.012
2.153SerTyr: 2.153 ± 0.021
0.005SerXaa: 0.005 ± 0.001
Thr
5.215ThrAla: 5.215 ± 0.03
0.785ThrCys: 0.785 ± 0.012
2.971ThrAsp: 2.971 ± 0.024
3.167ThrGlu: 3.167 ± 0.025
2.254ThrPhe: 2.254 ± 0.02
4.393ThrGly: 4.393 ± 0.028
1.304ThrHis: 1.304 ± 0.016
3.053ThrIle: 3.053 ± 0.022
2.394ThrLys: 2.394 ± 0.02
5.29ThrLeu: 5.29 ± 0.029
1.184ThrMet: 1.184 ± 0.012
2.076ThrAsn: 2.076 ± 0.02
4.244ThrPro: 4.244 ± 0.038
2.103ThrGln: 2.103 ± 0.023
3.15ThrArg: 3.15 ± 0.022
5.06ThrSer: 5.06 ± 0.036
4.148ThrThr: 4.148 ± 0.037
3.952ThrVal: 3.952 ± 0.027
0.915ThrTrp: 0.915 ± 0.011
1.624ThrTyr: 1.624 ± 0.017
0.002ThrXaa: 0.002 ± 0.0
Val
5.201ValAla: 5.201 ± 0.033
0.901ValCys: 0.901 ± 0.013
3.76ValAsp: 3.76 ± 0.025
3.733ValGlu: 3.733 ± 0.025
2.655ValPhe: 2.655 ± 0.02
4.215ValGly: 4.215 ± 0.029
1.446ValHis: 1.446 ± 0.017
3.111ValIle: 3.111 ± 0.026
2.701ValLys: 2.701 ± 0.021
5.901ValLeu: 5.901 ± 0.035
1.326ValMet: 1.326 ± 0.015
2.293ValAsn: 2.293 ± 0.023
3.593ValPro: 3.593 ± 0.027
2.442ValGln: 2.442 ± 0.021
3.546ValArg: 3.546 ± 0.026
4.906ValSer: 4.906 ± 0.031
3.462ValThr: 3.462 ± 0.027
4.41ValVal: 4.41 ± 0.033
0.916ValTrp: 0.916 ± 0.012
1.891ValTyr: 1.891 ± 0.02
0.003ValXaa: 0.003 ± 0.001
Trp
1.195TrpAla: 1.195 ± 0.015
0.22TrpCys: 0.22 ± 0.006
0.932TrpAsp: 0.932 ± 0.015
0.884TrpGlu: 0.884 ± 0.014
0.588TrpPhe: 0.588 ± 0.01
0.992TrpGly: 0.992 ± 0.012
0.371TrpHis: 0.371 ± 0.008
0.822TrpIle: 0.822 ± 0.011
0.823TrpLys: 0.823 ± 0.012
1.453TrpLeu: 1.453 ± 0.017
0.399TrpMet: 0.399 ± 0.008
0.654TrpAsn: 0.654 ± 0.01
0.671TrpPro: 0.671 ± 0.01
0.572TrpGln: 0.572 ± 0.01
1.049TrpArg: 1.049 ± 0.012
1.116TrpSer: 1.116 ± 0.016
0.973TrpThr: 0.973 ± 0.014
0.935TrpVal: 0.935 ± 0.012
0.291TrpTrp: 0.291 ± 0.007
0.486TrpTyr: 0.486 ± 0.01
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.244TyrAla: 2.244 ± 0.019
0.458TyrCys: 0.458 ± 0.008
1.654TyrAsp: 1.654 ± 0.016
1.569TyrGlu: 1.569 ± 0.017
1.294TyrPhe: 1.294 ± 0.015
2.21TyrGly: 2.21 ± 0.021
0.797TyrHis: 0.797 ± 0.012
1.575TyrIle: 1.575 ± 0.017
1.07TyrLys: 1.07 ± 0.013
2.821TyrLeu: 2.821 ± 0.022
0.655TyrMet: 0.655 ± 0.012
1.179TyrAsn: 1.179 ± 0.014
1.63TyrPro: 1.63 ± 0.017
1.132TyrGln: 1.132 ± 0.017
1.751TyrArg: 1.751 ± 0.017
2.166TyrSer: 2.166 ± 0.02
1.734TyrThr: 1.734 ± 0.017
1.672TyrVal: 1.672 ± 0.016
0.486TyrTrp: 0.486 ± 0.01
1.007TyrTyr: 1.007 ± 0.014
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.001XaaAla: 0.001 ± 0.0
0.001XaaCys: 0.001 ± 0.0
0.002XaaAsp: 0.002 ± 0.0
0.002XaaGlu: 0.002 ± 0.001
0.003XaaPhe: 0.003 ± 0.001
0.003XaaGly: 0.003 ± 0.001
0.001XaaHis: 0.001 ± 0.0
0.003XaaIle: 0.003 ± 0.001
0.001XaaLys: 0.001 ± 0.0
0.004XaaLeu: 0.004 ± 0.001
0.001XaaMet: 0.001 ± 0.0
0.001XaaAsn: 0.001 ± 0.0
0.002XaaPro: 0.002 ± 0.001
0.002XaaGln: 0.002 ± 0.001
0.003XaaArg: 0.003 ± 0.001
0.003XaaSer: 0.003 ± 0.001
0.002XaaThr: 0.002 ± 0.001
0.003XaaVal: 0.003 ± 0.001
0.0XaaTrp: 0.0 ± 0.0
0.002XaaTyr: 0.002 ± 0.001
0.077XaaXaa: 0.077 ± 0.019
Statistics based on 13568 proteins (6311667 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski