Amino acid dipepetide frequency for Exophiala sideris

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.704AlaAla: 8.704 ± 0.052
1.049AlaCys: 1.049 ± 0.017
4.444AlaAsp: 4.444 ± 0.031
5.182AlaGlu: 5.182 ± 0.04
3.118AlaPhe: 3.118 ± 0.027
5.855AlaGly: 5.855 ± 0.04
1.798AlaHis: 1.798 ± 0.02
4.204AlaIle: 4.204 ± 0.029
4.158AlaLys: 4.158 ± 0.033
7.628AlaLeu: 7.628 ± 0.044
2.046AlaMet: 2.046 ± 0.021
3.029AlaAsn: 3.029 ± 0.023
4.57AlaPro: 4.57 ± 0.044
3.672AlaGln: 3.672 ± 0.03
5.045AlaArg: 5.045 ± 0.033
7.165AlaSer: 7.165 ± 0.056
5.495AlaThr: 5.495 ± 0.034
5.327AlaVal: 5.327 ± 0.035
1.149AlaTrp: 1.149 ± 0.015
2.205AlaTyr: 2.205 ± 0.022
0.0AlaXaa: 0.0 ± 0.0
Cys
0.936CysAla: 0.936 ± 0.016
0.229CysCys: 0.229 ± 0.008
0.637CysAsp: 0.637 ± 0.012
0.574CysGlu: 0.574 ± 0.01
0.529CysPhe: 0.529 ± 0.009
0.872CysGly: 0.872 ± 0.016
0.323CysHis: 0.323 ± 0.009
0.682CysIle: 0.682 ± 0.012
0.496CysLys: 0.496 ± 0.01
1.24CysLeu: 1.24 ± 0.017
0.273CysMet: 0.273 ± 0.008
0.4CysAsn: 0.4 ± 0.009
0.608CysPro: 0.608 ± 0.011
0.453CysGln: 0.453 ± 0.009
0.758CysArg: 0.758 ± 0.015
0.858CysSer: 0.858 ± 0.014
0.662CysThr: 0.662 ± 0.01
0.767CysVal: 0.767 ± 0.013
0.197CysTrp: 0.197 ± 0.006
0.34CysTyr: 0.34 ± 0.008
0.0CysXaa: 0.0 ± 0.0
Asp
4.765AspAla: 4.765 ± 0.034
0.598AspCys: 0.598 ± 0.013
4.404AspAsp: 4.404 ± 0.046
4.629AspGlu: 4.629 ± 0.042
2.344AspPhe: 2.344 ± 0.023
3.949AspGly: 3.949 ± 0.034
1.33AspHis: 1.33 ± 0.017
3.03AspIle: 3.03 ± 0.028
2.499AspLys: 2.499 ± 0.024
5.225AspLeu: 5.225 ± 0.033
1.327AspMet: 1.327 ± 0.015
1.934AspAsn: 1.934 ± 0.019
3.349AspPro: 3.349 ± 0.028
2.047AspGln: 2.047 ± 0.021
3.101AspArg: 3.101 ± 0.034
4.062AspSer: 4.062 ± 0.031
3.139AspThr: 3.139 ± 0.023
3.854AspVal: 3.854 ± 0.028
0.898AspTrp: 0.898 ± 0.014
1.608AspTyr: 1.608 ± 0.018
0.0AspXaa: 0.0 ± 0.0
Glu
5.326GluAla: 5.326 ± 0.038
0.608GluCys: 0.608 ± 0.013
4.357GluAsp: 4.357 ± 0.046
5.263GluGlu: 5.263 ± 0.054
1.825GluPhe: 1.825 ± 0.021
3.62GluGly: 3.62 ± 0.028
1.508GluHis: 1.508 ± 0.017
2.989GluIle: 2.989 ± 0.026
3.673GluLys: 3.673 ± 0.036
5.006GluLeu: 5.006 ± 0.042
1.474GluMet: 1.474 ± 0.018
2.197GluAsn: 2.197 ± 0.02
2.773GluPro: 2.773 ± 0.029
2.697GluGln: 2.697 ± 0.023
3.897GluArg: 3.897 ± 0.034
4.266GluSer: 4.266 ± 0.034
3.527GluThr: 3.527 ± 0.031
3.706GluVal: 3.706 ± 0.029
0.855GluTrp: 0.855 ± 0.013
1.643GluTyr: 1.643 ± 0.018
0.0GluXaa: 0.0 ± 0.0
Phe
3.044PheAla: 3.044 ± 0.027
0.556PheCys: 0.556 ± 0.011
2.352PheAsp: 2.352 ± 0.018
2.211PheGlu: 2.211 ± 0.022
1.565PhePhe: 1.565 ± 0.018
2.842PheGly: 2.842 ± 0.03
0.884PheHis: 0.884 ± 0.013
1.685PheIle: 1.685 ± 0.023
1.448PheLys: 1.448 ± 0.019
3.386PheLeu: 3.386 ± 0.034
0.793PheMet: 0.793 ± 0.011
1.392PheAsn: 1.392 ± 0.017
1.878PhePro: 1.878 ± 0.018
1.42PheGln: 1.42 ± 0.016
1.932PheArg: 1.932 ± 0.019
2.897PheSer: 2.897 ± 0.028
2.098PheThr: 2.098 ± 0.019
2.356PheVal: 2.356 ± 0.022
0.619PheTrp: 0.619 ± 0.012
1.063PheTyr: 1.063 ± 0.015
0.0PheXaa: 0.0 ± 0.0
Gly
5.168GlyAla: 5.168 ± 0.037
0.778GlyCys: 0.778 ± 0.014
3.53GlyAsp: 3.53 ± 0.03
3.494GlyGlu: 3.494 ± 0.027
2.747GlyPhe: 2.747 ± 0.024
5.518GlyGly: 5.518 ± 0.053
1.747GlyHis: 1.747 ± 0.02
3.432GlyIle: 3.432 ± 0.03
3.43GlyLys: 3.43 ± 0.029
6.063GlyLeu: 6.063 ± 0.041
1.673GlyMet: 1.673 ± 0.019
2.42GlyAsn: 2.42 ± 0.021
3.307GlyPro: 3.307 ± 0.03
2.796GlyGln: 2.796 ± 0.031
4.108GlyArg: 4.108 ± 0.031
5.47GlySer: 5.47 ± 0.035
4.07GlyThr: 4.07 ± 0.031
4.288GlyVal: 4.288 ± 0.035
1.104GlyTrp: 1.104 ± 0.015
2.142GlyTyr: 2.142 ± 0.023
0.0GlyXaa: 0.0 ± 0.0
His
1.917HisAla: 1.917 ± 0.021
0.322HisCys: 0.322 ± 0.008
1.483HisAsp: 1.483 ± 0.017
1.44HisGlu: 1.44 ± 0.019
0.951HisPhe: 0.951 ± 0.015
1.765HisGly: 1.765 ± 0.02
0.866HisHis: 0.866 ± 0.014
1.223HisIle: 1.223 ± 0.016
0.988HisLys: 0.988 ± 0.014
2.256HisLeu: 2.256 ± 0.021
0.524HisMet: 0.524 ± 0.01
0.895HisAsn: 0.895 ± 0.012
1.632HisPro: 1.632 ± 0.018
1.037HisGln: 1.037 ± 0.015
1.487HisArg: 1.487 ± 0.018
1.811HisSer: 1.811 ± 0.019
1.318HisThr: 1.318 ± 0.017
1.504HisVal: 1.504 ± 0.017
0.348HisTrp: 0.348 ± 0.008
0.702HisTyr: 0.702 ± 0.012
0.0HisXaa: 0.0 ± 0.0
Ile
4.183IleAla: 4.183 ± 0.031
0.726IleCys: 0.726 ± 0.012
2.887IleAsp: 2.887 ± 0.024
2.863IleGlu: 2.863 ± 0.025
1.914IlePhe: 1.914 ± 0.023
3.166IleGly: 3.166 ± 0.029
1.188IleHis: 1.188 ± 0.016
2.389IleIle: 2.389 ± 0.027
2.143IleLys: 2.143 ± 0.023
4.531IleLeu: 4.531 ± 0.036
1.009IleMet: 1.009 ± 0.014
1.784IleAsn: 1.784 ± 0.018
3.01IlePro: 3.01 ± 0.025
1.876IleGln: 1.876 ± 0.019
2.705IleArg: 2.705 ± 0.018
3.832IleSer: 3.832 ± 0.027
2.832IleThr: 2.832 ± 0.027
3.151IleVal: 3.151 ± 0.025
0.71IleTrp: 0.71 ± 0.012
1.386IleTyr: 1.386 ± 0.017
0.0IleXaa: 0.0 ± 0.0
Lys
4.378LysAla: 4.378 ± 0.035
0.514LysCys: 0.514 ± 0.01
2.964LysAsp: 2.964 ± 0.027
3.377LysGlu: 3.377 ± 0.038
1.45LysPhe: 1.45 ± 0.017
2.934LysGly: 2.934 ± 0.026
1.221LysHis: 1.221 ± 0.015
2.263LysIle: 2.263 ± 0.019
3.267LysLys: 3.267 ± 0.053
3.995LysLeu: 3.995 ± 0.029
1.058LysMet: 1.058 ± 0.016
1.696LysAsn: 1.696 ± 0.018
2.779LysPro: 2.779 ± 0.029
1.988LysGln: 1.988 ± 0.021
3.415LysArg: 3.415 ± 0.032
3.706LysSer: 3.706 ± 0.035
2.91LysThr: 2.91 ± 0.026
2.962LysVal: 2.962 ± 0.026
0.674LysTrp: 0.674 ± 0.012
1.4LysTyr: 1.4 ± 0.019
0.0LysXaa: 0.0 ± 0.0
Leu
7.699LeuAla: 7.699 ± 0.044
1.204LeuCys: 1.204 ± 0.015
5.257LeuAsp: 5.257 ± 0.033
5.413LeuGlu: 5.413 ± 0.045
3.231LeuPhe: 3.231 ± 0.027
5.761LeuGly: 5.761 ± 0.036
2.221LeuHis: 2.221 ± 0.021
3.875LeuIle: 3.875 ± 0.034
4.264LeuLys: 4.264 ± 0.035
8.203LeuLeu: 8.203 ± 0.056
1.794LeuMet: 1.794 ± 0.018
3.183LeuAsn: 3.183 ± 0.028
5.427LeuPro: 5.427 ± 0.035
3.945LeuGln: 3.945 ± 0.032
5.643LeuArg: 5.643 ± 0.038
7.221LeuSer: 7.221 ± 0.045
4.934LeuThr: 4.934 ± 0.033
5.281LeuVal: 5.281 ± 0.038
1.175LeuTrp: 1.175 ± 0.017
2.312LeuTyr: 2.312 ± 0.022
0.0LeuXaa: 0.0 ± 0.0
Met
2.279MetAla: 2.279 ± 0.021
0.237MetCys: 0.237 ± 0.008
1.273MetAsp: 1.273 ± 0.014
1.206MetGlu: 1.206 ± 0.016
0.769MetPhe: 0.769 ± 0.013
1.479MetGly: 1.479 ± 0.019
0.53MetHis: 0.53 ± 0.011
1.021MetIle: 1.021 ± 0.011
0.999MetLys: 0.999 ± 0.014
1.944MetLeu: 1.944 ± 0.019
0.566MetMet: 0.566 ± 0.011
0.827MetAsn: 0.827 ± 0.014
1.384MetPro: 1.384 ± 0.017
0.944MetGln: 0.944 ± 0.015
1.227MetArg: 1.227 ± 0.015
2.051MetSer: 2.051 ± 0.017
1.405MetThr: 1.405 ± 0.017
1.318MetVal: 1.318 ± 0.017
0.255MetTrp: 0.255 ± 0.008
0.549MetTyr: 0.549 ± 0.01
0.0MetXaa: 0.0 ± 0.0
Asn
3.215AsnAla: 3.215 ± 0.027
0.401AsnCys: 0.401 ± 0.008
2.071AsnAsp: 2.071 ± 0.019
2.046AsnGlu: 2.046 ± 0.022
1.347AsnPhe: 1.347 ± 0.018
3.035AsnGly: 3.035 ± 0.031
0.887AsnHis: 0.887 ± 0.013
2.014AsnIle: 2.014 ± 0.022
1.594AsnLys: 1.594 ± 0.019
3.234AsnLeu: 3.234 ± 0.024
0.857AsnMet: 0.857 ± 0.013
1.475AsnAsn: 1.475 ± 0.018
2.367AsnPro: 2.367 ± 0.023
1.377AsnGln: 1.377 ± 0.018
1.81AsnArg: 1.81 ± 0.018
2.648AsnSer: 2.648 ± 0.027
2.247AsnThr: 2.247 ± 0.022
2.47AsnVal: 2.47 ± 0.021
0.513AsnTrp: 0.513 ± 0.011
1.046AsnTyr: 1.046 ± 0.016
0.0AsnXaa: 0.0 ± 0.0
Pro
5.177ProAla: 5.177 ± 0.038
0.485ProCys: 0.485 ± 0.009
3.257ProAsp: 3.257 ± 0.027
3.802ProGlu: 3.802 ± 0.029
1.983ProPhe: 1.983 ± 0.022
3.869ProGly: 3.869 ± 0.03
1.323ProHis: 1.323 ± 0.015
2.488ProIle: 2.488 ± 0.021
2.669ProLys: 2.669 ± 0.027
4.627ProLeu: 4.627 ± 0.032
1.131ProMet: 1.131 ± 0.016
2.208ProAsn: 2.208 ± 0.022
4.981ProPro: 4.981 ± 0.063
2.681ProGln: 2.681 ± 0.03
3.374ProArg: 3.374 ± 0.032
5.912ProSer: 5.912 ± 0.048
4.115ProThr: 4.115 ± 0.033
3.574ProVal: 3.574 ± 0.03
0.738ProTrp: 0.738 ± 0.012
1.519ProTyr: 1.519 ± 0.02
0.0ProXaa: 0.0 ± 0.0
Gln
3.731GlnAla: 3.731 ± 0.028
0.454GlnCys: 0.454 ± 0.011
2.28GlnAsp: 2.28 ± 0.02
2.582GlnGlu: 2.582 ± 0.025
1.313GlnPhe: 1.313 ± 0.017
2.557GlnGly: 2.557 ± 0.027
1.183GlnHis: 1.183 ± 0.017
2.008GlnIle: 2.008 ± 0.02
2.118GlnLys: 2.118 ± 0.022
3.429GlnLeu: 3.429 ± 0.023
0.932GlnMet: 0.932 ± 0.017
1.668GlnAsn: 1.668 ± 0.021
2.716GlnPro: 2.716 ± 0.034
2.601GlnGln: 2.601 ± 0.039
2.747GlnArg: 2.747 ± 0.022
3.455GlnSer: 3.455 ± 0.031
2.516GlnThr: 2.516 ± 0.022
2.239GlnVal: 2.239 ± 0.022
0.569GlnTrp: 0.569 ± 0.011
1.25GlnTyr: 1.25 ± 0.016
0.0GlnXaa: 0.0 ± 0.0
Arg
4.627ArgAla: 4.627 ± 0.031
0.7ArgCys: 0.7 ± 0.013
3.351ArgAsp: 3.351 ± 0.034
3.675ArgGlu: 3.675 ± 0.036
2.009ArgPhe: 2.009 ± 0.016
3.574ArgGly: 3.574 ± 0.03
1.605ArgHis: 1.605 ± 0.017
2.793ArgIle: 2.793 ± 0.025
3.634ArgLys: 3.634 ± 0.029
5.429ArgLeu: 5.429 ± 0.035
1.31ArgMet: 1.31 ± 0.016
2.238ArgAsn: 2.238 ± 0.023
3.691ArgPro: 3.691 ± 0.029
2.745ArgGln: 2.745 ± 0.023
5.113ArgArg: 5.113 ± 0.044
4.899ArgSer: 4.899 ± 0.038
3.462ArgThr: 3.462 ± 0.028
3.21ArgVal: 3.21 ± 0.027
0.871ArgTrp: 0.871 ± 0.015
1.674ArgTyr: 1.674 ± 0.018
0.0ArgXaa: 0.0 ± 0.0
Ser
6.747SerAla: 6.747 ± 0.048
0.842SerCys: 0.842 ± 0.015
4.227SerAsp: 4.227 ± 0.03
4.134SerGlu: 4.134 ± 0.034
2.939SerPhe: 2.939 ± 0.023
5.445SerGly: 5.445 ± 0.037
1.95SerHis: 1.95 ± 0.021
3.988SerIle: 3.988 ± 0.031
3.849SerLys: 3.849 ± 0.034
7.056SerLeu: 7.056 ± 0.044
1.827SerMet: 1.827 ± 0.023
3.011SerAsn: 3.011 ± 0.023
5.402SerPro: 5.402 ± 0.051
3.47SerGln: 3.47 ± 0.03
5.112SerArg: 5.112 ± 0.04
8.771SerSer: 8.771 ± 0.086
6.175SerThr: 6.175 ± 0.067
4.535SerVal: 4.535 ± 0.031
1.101SerTrp: 1.101 ± 0.015
2.047SerTyr: 2.047 ± 0.021
0.0SerXaa: 0.0 ± 0.0
Thr
5.377ThrAla: 5.377 ± 0.04
0.744ThrCys: 0.744 ± 0.012
2.958ThrAsp: 2.958 ± 0.025
3.152ThrGlu: 3.152 ± 0.022
2.33ThrPhe: 2.33 ± 0.02
4.204ThrGly: 4.204 ± 0.035
1.31ThrHis: 1.31 ± 0.017
3.206ThrIle: 3.206 ± 0.028
2.82ThrLys: 2.82 ± 0.028
5.245ThrLeu: 5.245 ± 0.034
1.321ThrMet: 1.321 ± 0.015
2.328ThrAsn: 2.328 ± 0.02
4.357ThrPro: 4.357 ± 0.034
2.324ThrGln: 2.324 ± 0.025
3.188ThrArg: 3.188 ± 0.027
5.955ThrSer: 5.955 ± 0.059
5.197ThrThr: 5.197 ± 0.103
3.785ThrVal: 3.785 ± 0.03
0.859ThrTrp: 0.859 ± 0.013
1.624ThrTyr: 1.624 ± 0.017
0.0ThrXaa: 0.0 ± 0.0
Val
5.252ValAla: 5.252 ± 0.033
0.799ValCys: 0.799 ± 0.013
3.804ValAsp: 3.804 ± 0.029
3.857ValGlu: 3.857 ± 0.028
2.348ValPhe: 2.348 ± 0.019
3.901ValGly: 3.901 ± 0.035
1.45ValHis: 1.45 ± 0.016
2.902ValIle: 2.902 ± 0.032
2.994ValLys: 2.994 ± 0.026
5.588ValLeu: 5.588 ± 0.037
1.34ValMet: 1.34 ± 0.017
2.256ValAsn: 2.256 ± 0.023
3.609ValPro: 3.609 ± 0.024
2.507ValGln: 2.507 ± 0.022
3.457ValArg: 3.457 ± 0.026
4.584ValSer: 4.584 ± 0.029
3.592ValThr: 3.592 ± 0.033
4.319ValVal: 4.319 ± 0.043
0.865ValTrp: 0.865 ± 0.014
1.692ValTyr: 1.692 ± 0.021
0.0ValXaa: 0.0 ± 0.0
Trp
1.089TrpAla: 1.089 ± 0.018
0.179TrpCys: 0.179 ± 0.006
0.852TrpAsp: 0.852 ± 0.015
0.774TrpGlu: 0.774 ± 0.012
0.531TrpPhe: 0.531 ± 0.01
0.855TrpGly: 0.855 ± 0.016
0.389TrpHis: 0.389 ± 0.009
0.744TrpIle: 0.744 ± 0.011
0.773TrpLys: 0.773 ± 0.012
1.34TrpLeu: 1.34 ± 0.016
0.375TrpMet: 0.375 ± 0.009
0.593TrpAsn: 0.593 ± 0.009
0.599TrpPro: 0.599 ± 0.011
0.614TrpGln: 0.614 ± 0.011
0.933TrpArg: 0.933 ± 0.014
1.078TrpSer: 1.078 ± 0.016
0.951TrpThr: 0.951 ± 0.014
0.82TrpVal: 0.82 ± 0.014
0.275TrpTrp: 0.275 ± 0.009
0.441TrpTyr: 0.441 ± 0.008
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.227TyrAla: 2.227 ± 0.021
0.41TyrCys: 0.41 ± 0.01
1.704TyrAsp: 1.704 ± 0.017
1.538TyrGlu: 1.538 ± 0.016
1.177TyrPhe: 1.177 ± 0.017
2.081TyrGly: 2.081 ± 0.026
0.756TyrHis: 0.756 ± 0.011
1.348TyrIle: 1.348 ± 0.017
1.133TyrLys: 1.133 ± 0.015
2.617TyrLeu: 2.617 ± 0.023
0.635TyrMet: 0.635 ± 0.011
1.119TyrAsn: 1.119 ± 0.016
1.473TyrPro: 1.473 ± 0.019
1.156TyrGln: 1.156 ± 0.015
1.558TyrArg: 1.558 ± 0.016
1.989TyrSer: 1.989 ± 0.02
1.642TyrThr: 1.642 ± 0.017
1.649TyrVal: 1.649 ± 0.02
0.431TyrTrp: 0.431 ± 0.011
0.909TyrTyr: 0.909 ± 0.015
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 10841 proteins (5408673 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski