Amino acid dipepetide frequency for Leucoagaricus sp. SymC.cos

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.93AlaAla: 6.93 ± 0.055
0.998AlaCys: 0.998 ± 0.015
3.576AlaAsp: 3.576 ± 0.03
4.36AlaGlu: 4.36 ± 0.035
3.157AlaPhe: 3.157 ± 0.027
4.859AlaGly: 4.859 ± 0.037
1.877AlaHis: 1.877 ± 0.02
4.151AlaIle: 4.151 ± 0.028
3.715AlaLys: 3.715 ± 0.031
7.494AlaLeu: 7.494 ± 0.044
1.715AlaMet: 1.715 ± 0.019
2.825AlaAsn: 2.825 ± 0.027
4.381AlaPro: 4.381 ± 0.041
3.152AlaGln: 3.152 ± 0.026
4.319AlaArg: 4.319 ± 0.031
6.854AlaSer: 6.854 ± 0.047
4.822AlaThr: 4.822 ± 0.035
4.801AlaVal: 4.801 ± 0.029
1.041AlaTrp: 1.041 ± 0.013
2.056AlaTyr: 2.056 ± 0.02
0.003AlaXaa: 0.003 ± 0.001
Cys
0.87CysAla: 0.87 ± 0.014
0.271CysCys: 0.271 ± 0.008
0.624CysAsp: 0.624 ± 0.012
0.576CysGlu: 0.576 ± 0.011
0.551CysPhe: 0.551 ± 0.011
0.871CysGly: 0.871 ± 0.015
0.414CysHis: 0.414 ± 0.009
0.767CysIle: 0.767 ± 0.013
0.568CysLys: 0.568 ± 0.011
1.365CysLeu: 1.365 ± 0.017
0.275CysMet: 0.275 ± 0.007
0.462CysAsn: 0.462 ± 0.009
0.764CysPro: 0.764 ± 0.015
0.474CysGln: 0.474 ± 0.011
0.704CysArg: 0.704 ± 0.014
0.993CysSer: 0.993 ± 0.014
0.701CysThr: 0.701 ± 0.013
0.807CysVal: 0.807 ± 0.013
0.257CysTrp: 0.257 ± 0.008
0.366CysTyr: 0.366 ± 0.008
0.0CysXaa: 0.0 ± 0.0
Asp
4.111AspAla: 4.111 ± 0.031
0.588AspCys: 0.588 ± 0.012
3.938AspAsp: 3.938 ± 0.042
4.156AspGlu: 4.156 ± 0.041
2.242AspPhe: 2.242 ± 0.022
3.642AspGly: 3.642 ± 0.029
1.269AspHis: 1.269 ± 0.016
3.194AspIle: 3.194 ± 0.027
2.32AspLys: 2.32 ± 0.02
5.058AspLeu: 5.058 ± 0.036
1.116AspMet: 1.116 ± 0.013
1.948AspAsn: 1.948 ± 0.02
3.394AspPro: 3.394 ± 0.026
1.733AspGln: 1.733 ± 0.021
2.708AspArg: 2.708 ± 0.027
4.034AspSer: 4.034 ± 0.036
2.83AspThr: 2.83 ± 0.022
3.836AspVal: 3.836 ± 0.032
0.879AspTrp: 0.879 ± 0.014
1.548AspTyr: 1.548 ± 0.019
0.001AspXaa: 0.001 ± 0.001
Glu
4.648GluAla: 4.648 ± 0.037
0.649GluCys: 0.649 ± 0.011
3.855GluAsp: 3.855 ± 0.036
5.351GluGlu: 5.351 ± 0.055
1.994GluPhe: 1.994 ± 0.022
3.647GluGly: 3.647 ± 0.029
1.359GluHis: 1.359 ± 0.016
3.173GluIle: 3.173 ± 0.028
3.524GluLys: 3.524 ± 0.035
5.361GluLeu: 5.361 ± 0.04
1.363GluMet: 1.363 ± 0.016
2.211GluAsn: 2.211 ± 0.021
2.627GluPro: 2.627 ± 0.027
2.132GluGln: 2.132 ± 0.023
3.893GluArg: 3.893 ± 0.034
3.886GluSer: 3.886 ± 0.026
3.049GluThr: 3.049 ± 0.025
3.722GluVal: 3.722 ± 0.033
0.946GluTrp: 0.946 ± 0.014
1.653GluTyr: 1.653 ± 0.019
0.001GluXaa: 0.001 ± 0.001
Phe
2.905PheAla: 2.905 ± 0.022
0.567PheCys: 0.567 ± 0.011
2.34PheAsp: 2.34 ± 0.022
2.116PheGlu: 2.116 ± 0.023
1.785PhePhe: 1.785 ± 0.023
2.86PheGly: 2.86 ± 0.027
1.029PheHis: 1.029 ± 0.013
2.131PheIle: 2.131 ± 0.022
1.681PheLys: 1.681 ± 0.019
3.765PheLeu: 3.765 ± 0.033
0.801PheMet: 0.801 ± 0.012
1.578PheAsn: 1.578 ± 0.019
2.128PhePro: 2.128 ± 0.022
1.359PheGln: 1.359 ± 0.017
1.981PheArg: 1.981 ± 0.021
3.381PheSer: 3.381 ± 0.03
2.29PheThr: 2.29 ± 0.022
2.607PheVal: 2.607 ± 0.025
0.63PheTrp: 0.63 ± 0.013
1.161PheTyr: 1.161 ± 0.015
0.001PheXaa: 0.001 ± 0.001
Gly
4.569GlyAla: 4.569 ± 0.031
0.867GlyCys: 0.867 ± 0.013
3.272GlyAsp: 3.272 ± 0.027
3.369GlyGlu: 3.369 ± 0.033
2.657GlyPhe: 2.657 ± 0.027
5.409GlyGly: 5.409 ± 0.062
1.675GlyHis: 1.675 ± 0.02
3.432GlyIle: 3.432 ± 0.031
3.42GlyLys: 3.42 ± 0.026
5.744GlyLeu: 5.744 ± 0.039
1.422GlyMet: 1.422 ± 0.019
2.42GlyAsn: 2.42 ± 0.027
3.111GlyPro: 3.111 ± 0.027
2.275GlyGln: 2.275 ± 0.019
3.726GlyArg: 3.726 ± 0.031
5.439GlySer: 5.439 ± 0.04
3.895GlyThr: 3.895 ± 0.03
4.276GlyVal: 4.276 ± 0.032
1.043GlyTrp: 1.043 ± 0.017
1.934GlyTyr: 1.934 ± 0.021
0.003GlyXaa: 0.003 ± 0.001
His
1.845HisAla: 1.845 ± 0.018
0.379HisCys: 0.379 ± 0.009
1.351HisAsp: 1.351 ± 0.014
1.299HisGlu: 1.299 ± 0.014
1.027HisPhe: 1.027 ± 0.014
1.624HisGly: 1.624 ± 0.02
0.924HisHis: 0.924 ± 0.016
1.52HisIle: 1.52 ± 0.016
1.008HisLys: 1.008 ± 0.013
2.599HisLeu: 2.599 ± 0.023
0.474HisMet: 0.474 ± 0.011
0.943HisAsn: 0.943 ± 0.015
1.959HisPro: 1.959 ± 0.019
1.036HisGln: 1.036 ± 0.015
1.487HisArg: 1.487 ± 0.018
2.197HisSer: 2.197 ± 0.022
1.484HisThr: 1.484 ± 0.018
1.594HisVal: 1.594 ± 0.018
0.367HisTrp: 0.367 ± 0.008
0.741HisTyr: 0.741 ± 0.013
0.0HisXaa: 0.0 ± 0.0
Ile
4.077IleAla: 4.077 ± 0.031
0.844IleCys: 0.844 ± 0.012
3.008IleAsp: 3.008 ± 0.022
2.766IleGlu: 2.766 ± 0.025
2.169IlePhe: 2.169 ± 0.023
3.162IleGly: 3.162 ± 0.027
1.459IleHis: 1.459 ± 0.015
2.887IleIle: 2.887 ± 0.026
2.316IleLys: 2.316 ± 0.024
5.16IleLeu: 5.16 ± 0.038
1.043IleMet: 1.043 ± 0.013
1.996IleAsn: 1.996 ± 0.019
3.533IlePro: 3.533 ± 0.028
2.055IleGln: 2.055 ± 0.02
2.934IleArg: 2.934 ± 0.028
4.269IleSer: 4.269 ± 0.034
3.132IleThr: 3.132 ± 0.026
3.493IleVal: 3.493 ± 0.027
0.824IleTrp: 0.824 ± 0.015
1.496IleTyr: 1.496 ± 0.018
0.001IleXaa: 0.001 ± 0.0
Lys
4.064LysAla: 4.064 ± 0.031
0.574LysCys: 0.574 ± 0.01
2.696LysAsp: 2.696 ± 0.023
3.464LysGlu: 3.464 ± 0.036
1.56LysPhe: 1.56 ± 0.017
2.961LysGly: 2.961 ± 0.027
1.204LysHis: 1.204 ± 0.016
2.459LysIle: 2.459 ± 0.024
3.254LysLys: 3.254 ± 0.038
4.314LysLeu: 4.314 ± 0.029
0.956LysMet: 0.956 ± 0.013
1.798LysAsn: 1.798 ± 0.017
2.739LysPro: 2.739 ± 0.027
1.748LysGln: 1.748 ± 0.017
3.365LysArg: 3.365 ± 0.031
3.572LysSer: 3.572 ± 0.031
2.739LysThr: 2.739 ± 0.025
2.893LysVal: 2.893 ± 0.025
0.743LysTrp: 0.743 ± 0.013
1.405LysTyr: 1.405 ± 0.016
0.001LysXaa: 0.001 ± 0.0
Leu
7.493LeuAla: 7.493 ± 0.044
1.268LeuCys: 1.268 ± 0.016
5.245LeuAsp: 5.245 ± 0.033
5.582LeuGlu: 5.582 ± 0.042
3.588LeuPhe: 3.588 ± 0.029
5.508LeuGly: 5.508 ± 0.036
2.55LeuHis: 2.55 ± 0.025
4.517LeuIle: 4.517 ± 0.032
4.647LeuLys: 4.647 ± 0.034
8.905LeuLeu: 8.905 ± 0.053
1.767LeuMet: 1.767 ± 0.017
3.566LeuAsn: 3.566 ± 0.026
6.165LeuPro: 6.165 ± 0.035
3.776LeuGln: 3.776 ± 0.029
5.779LeuArg: 5.779 ± 0.035
8.057LeuSer: 8.057 ± 0.044
5.396LeuThr: 5.396 ± 0.032
5.974LeuVal: 5.974 ± 0.042
1.25LeuTrp: 1.25 ± 0.017
2.428LeuTyr: 2.428 ± 0.023
0.002LeuXaa: 0.002 ± 0.001
Met
1.711MetAla: 1.711 ± 0.018
0.256MetCys: 0.256 ± 0.007
1.175MetAsp: 1.175 ± 0.016
1.221MetGlu: 1.221 ± 0.016
0.761MetPhe: 0.761 ± 0.014
1.255MetGly: 1.255 ± 0.018
0.506MetHis: 0.506 ± 0.008
1.064MetIle: 1.064 ± 0.014
1.082MetLys: 1.082 ± 0.013
1.829MetLeu: 1.829 ± 0.018
0.525MetMet: 0.525 ± 0.01
0.809MetAsn: 0.809 ± 0.013
1.191MetPro: 1.191 ± 0.016
0.764MetGln: 0.764 ± 0.012
1.194MetArg: 1.194 ± 0.016
1.758MetSer: 1.758 ± 0.017
1.311MetThr: 1.311 ± 0.017
1.213MetVal: 1.213 ± 0.016
0.286MetTrp: 0.286 ± 0.008
0.511MetTyr: 0.511 ± 0.01
0.0MetXaa: 0.0 ± 0.0
Asn
3.079AsnAla: 3.079 ± 0.025
0.461AsnCys: 0.461 ± 0.01
2.002AsnAsp: 2.002 ± 0.021
1.941AsnGlu: 1.941 ± 0.018
1.469AsnPhe: 1.469 ± 0.018
2.802AsnGly: 2.802 ± 0.032
0.939AsnHis: 0.939 ± 0.014
2.191AsnIle: 2.191 ± 0.023
1.639AsnLys: 1.639 ± 0.018
3.55AsnLeu: 3.55 ± 0.028
0.817AsnMet: 0.817 ± 0.012
1.55AsnAsn: 1.55 ± 0.022
2.672AsnPro: 2.672 ± 0.024
1.362AsnGln: 1.362 ± 0.018
1.886AsnArg: 1.886 ± 0.02
2.976AsnSer: 2.976 ± 0.03
2.341AsnThr: 2.341 ± 0.021
2.645AsnVal: 2.645 ± 0.026
0.558AsnTrp: 0.558 ± 0.01
1.026AsnTyr: 1.026 ± 0.015
0.001AsnXaa: 0.001 ± 0.001
Pro
4.488ProAla: 4.488 ± 0.043
0.553ProCys: 0.553 ± 0.011
3.168ProAsp: 3.168 ± 0.028
3.69ProGlu: 3.69 ± 0.028
2.398ProPhe: 2.398 ± 0.022
3.719ProGly: 3.719 ± 0.033
1.63ProHis: 1.63 ± 0.017
2.894ProIle: 2.894 ± 0.026
2.679ProLys: 2.679 ± 0.025
5.339ProLeu: 5.339 ± 0.041
1.029ProMet: 1.029 ± 0.015
2.49ProAsn: 2.49 ± 0.023
6.247ProPro: 6.247 ± 0.081
2.587ProGln: 2.587 ± 0.032
3.346ProArg: 3.346 ± 0.034
6.97ProSer: 6.97 ± 0.056
4.481ProThr: 4.481 ± 0.034
3.672ProVal: 3.672 ± 0.032
0.737ProTrp: 0.737 ± 0.011
1.636ProTyr: 1.636 ± 0.02
0.002ProXaa: 0.002 ± 0.0
Gln
3.161GlnAla: 3.161 ± 0.03
0.5GlnCys: 0.5 ± 0.011
1.94GlnAsp: 1.94 ± 0.02
2.292GlnGlu: 2.292 ± 0.023
1.387GlnPhe: 1.387 ± 0.018
2.163GlnGly: 2.163 ± 0.02
1.054GlnHis: 1.054 ± 0.017
1.899GlnIle: 1.899 ± 0.02
1.855GlnLys: 1.855 ± 0.02
3.573GlnLeu: 3.573 ± 0.028
0.801GlnMet: 0.801 ± 0.013
1.548GlnAsn: 1.548 ± 0.019
2.429GlnPro: 2.429 ± 0.031
2.115GlnGln: 2.115 ± 0.037
2.401GlnArg: 2.401 ± 0.026
2.955GlnSer: 2.955 ± 0.027
2.327GlnThr: 2.327 ± 0.023
2.318GlnVal: 2.318 ± 0.023
0.55GlnTrp: 0.55 ± 0.011
1.115GlnTyr: 1.115 ± 0.016
0.001GlnXaa: 0.001 ± 0.0
Arg
4.272ArgAla: 4.272 ± 0.031
0.733ArgCys: 0.733 ± 0.013
3.055ArgAsp: 3.055 ± 0.026
3.673ArgGlu: 3.673 ± 0.033
2.159ArgPhe: 2.159 ± 0.017
3.371ArgGly: 3.371 ± 0.028
1.473ArgHis: 1.473 ± 0.021
3.088ArgIle: 3.088 ± 0.024
3.371ArgLys: 3.371 ± 0.026
5.45ArgLeu: 5.45 ± 0.036
1.222ArgMet: 1.222 ± 0.014
2.231ArgAsn: 2.231 ± 0.025
3.441ArgPro: 3.441 ± 0.03
2.252ArgGln: 2.252 ± 0.022
4.593ArgArg: 4.593 ± 0.037
4.751ArgSer: 4.751 ± 0.037
3.246ArgThr: 3.246 ± 0.026
3.543ArgVal: 3.543 ± 0.027
0.93ArgTrp: 0.93 ± 0.014
1.638ArgTyr: 1.638 ± 0.019
0.003ArgXaa: 0.003 ± 0.001
Ser
6.341SerAla: 6.341 ± 0.036
0.934SerCys: 0.934 ± 0.016
4.266SerAsp: 4.266 ± 0.031
4.043SerGlu: 4.043 ± 0.033
3.279SerPhe: 3.279 ± 0.024
5.533SerGly: 5.533 ± 0.037
2.219SerHis: 2.219 ± 0.022
4.239SerIle: 4.239 ± 0.03
3.605SerLys: 3.605 ± 0.028
7.932SerLeu: 7.932 ± 0.043
1.664SerMet: 1.664 ± 0.017
3.171SerAsn: 3.171 ± 0.028
6.161SerPro: 6.161 ± 0.053
3.392SerGln: 3.392 ± 0.03
5.043SerArg: 5.043 ± 0.039
9.857SerSer: 9.857 ± 0.086
6.216SerThr: 6.216 ± 0.043
4.913SerVal: 4.913 ± 0.032
1.089SerTrp: 1.089 ± 0.016
2.063SerTyr: 2.063 ± 0.023
0.003SerXaa: 0.003 ± 0.001
Thr
4.624ThrAla: 4.624 ± 0.032
0.777ThrCys: 0.777 ± 0.012
2.661ThrAsp: 2.661 ± 0.023
2.87ThrGlu: 2.87 ± 0.025
2.483ThrPhe: 2.483 ± 0.025
3.949ThrGly: 3.949 ± 0.026
1.477ThrHis: 1.477 ± 0.016
3.331ThrIle: 3.331 ± 0.029
2.662ThrLys: 2.662 ± 0.024
5.785ThrLeu: 5.785 ± 0.039
1.155ThrMet: 1.155 ± 0.016
2.22ThrAsn: 2.22 ± 0.02
4.689ThrPro: 4.689 ± 0.04
2.172ThrGln: 2.172 ± 0.021
3.301ThrArg: 3.301 ± 0.029
6.0ThrSer: 6.0 ± 0.046
4.504ThrThr: 4.504 ± 0.041
3.767ThrVal: 3.767 ± 0.027
0.849ThrTrp: 0.849 ± 0.012
1.607ThrTyr: 1.607 ± 0.017
0.002ThrXaa: 0.002 ± 0.001
Val
4.85ValAla: 4.85 ± 0.034
0.867ValCys: 0.867 ± 0.013
3.782ValAsp: 3.782 ± 0.031
3.863ValGlu: 3.863 ± 0.031
2.605ValPhe: 2.605 ± 0.023
3.888ValGly: 3.888 ± 0.029
1.629ValHis: 1.629 ± 0.019
3.387ValIle: 3.387 ± 0.026
3.14ValLys: 3.14 ± 0.029
6.089ValLeu: 6.089 ± 0.035
1.332ValMet: 1.332 ± 0.017
2.403ValAsn: 2.403 ± 0.024
3.897ValPro: 3.897 ± 0.032
2.423ValGln: 2.423 ± 0.019
3.342ValArg: 3.342 ± 0.025
4.823ValSer: 4.823 ± 0.037
3.577ValThr: 3.577 ± 0.029
4.544ValVal: 4.544 ± 0.035
0.941ValTrp: 0.941 ± 0.016
1.778ValTyr: 1.778 ± 0.02
0.001ValXaa: 0.001 ± 0.001
Trp
1.036TrpAla: 1.036 ± 0.015
0.211TrpCys: 0.211 ± 0.007
0.932TrpAsp: 0.932 ± 0.018
0.898TrpGlu: 0.898 ± 0.015
0.617TrpPhe: 0.617 ± 0.014
0.865TrpGly: 0.865 ± 0.014
0.358TrpHis: 0.358 ± 0.008
0.801TrpIle: 0.801 ± 0.013
0.848TrpLys: 0.848 ± 0.013
1.353TrpLeu: 1.353 ± 0.017
0.395TrpMet: 0.395 ± 0.01
0.656TrpAsn: 0.656 ± 0.012
0.603TrpPro: 0.603 ± 0.011
0.503TrpGln: 0.503 ± 0.009
0.936TrpArg: 0.936 ± 0.014
1.153TrpSer: 1.153 ± 0.017
0.9TrpThr: 0.9 ± 0.013
0.885TrpVal: 0.885 ± 0.014
0.255TrpTrp: 0.255 ± 0.008
0.412TrpTyr: 0.412 ± 0.011
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.009TyrAla: 2.009 ± 0.021
0.385TyrCys: 0.385 ± 0.009
1.657TyrAsp: 1.657 ± 0.018
1.511TyrGlu: 1.511 ± 0.019
1.209TyrPhe: 1.209 ± 0.017
1.868TyrGly: 1.868 ± 0.021
0.81TyrHis: 0.81 ± 0.013
1.492TyrIle: 1.492 ± 0.017
1.177TyrLys: 1.177 ± 0.015
2.795TyrLeu: 2.795 ± 0.026
0.552TyrMet: 0.552 ± 0.01
1.08TyrAsn: 1.08 ± 0.016
1.623TyrPro: 1.623 ± 0.021
1.074TyrGln: 1.074 ± 0.014
1.553TyrArg: 1.553 ± 0.019
2.036TyrSer: 2.036 ± 0.021
1.644TyrThr: 1.644 ± 0.018
1.658TyrVal: 1.658 ± 0.019
0.441TyrTrp: 0.441 ± 0.009
0.879TyrTyr: 0.879 ± 0.015
0.001TyrXaa: 0.001 ± 0.0
Xaa
0.002XaaAla: 0.002 ± 0.001
0.0XaaCys: 0.0 ± 0.0
0.001XaaAsp: 0.001 ± 0.0
0.002XaaGlu: 0.002 ± 0.001
0.001XaaPhe: 0.001 ± 0.0
0.002XaaGly: 0.002 ± 0.001
0.001XaaHis: 0.001 ± 0.0
0.001XaaIle: 0.001 ± 0.0
0.001XaaLys: 0.001 ± 0.0
0.002XaaLeu: 0.002 ± 0.001
0.001XaaMet: 0.001 ± 0.0
0.001XaaAsn: 0.001 ± 0.0
0.003XaaPro: 0.003 ± 0.001
0.001XaaGln: 0.001 ± 0.0
0.003XaaArg: 0.003 ± 0.001
0.003XaaSer: 0.003 ± 0.001
0.001XaaThr: 0.001 ± 0.0
0.002XaaVal: 0.002 ± 0.001
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
2.064XaaXaa: 2.064 ± 0.299
Statistics based on 12747 proteins (5393887 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski