Amino acid dipepetide frequency for Colletotrichum chlorophyti

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.46AlaAla: 10.46 ± 0.061
1.064AlaCys: 1.064 ± 0.016
4.678AlaAsp: 4.678 ± 0.035
5.441AlaGlu: 5.441 ± 0.048
3.29AlaPhe: 3.29 ± 0.029
6.404AlaGly: 6.404 ± 0.043
1.791AlaHis: 1.791 ± 0.018
4.329AlaIle: 4.329 ± 0.03
4.315AlaLys: 4.315 ± 0.035
7.943AlaLeu: 7.943 ± 0.048
2.062AlaMet: 2.062 ± 0.021
3.145AlaAsn: 3.145 ± 0.024
5.17AlaPro: 5.17 ± 0.045
3.456AlaGln: 3.456 ± 0.034
5.01AlaArg: 5.01 ± 0.036
7.525AlaSer: 7.525 ± 0.047
5.586AlaThr: 5.586 ± 0.038
6.029AlaVal: 6.029 ± 0.042
1.282AlaTrp: 1.282 ± 0.017
2.245AlaTyr: 2.245 ± 0.022
0.001AlaXaa: 0.001 ± 0.0
Cys
0.925CysAla: 0.925 ± 0.015
0.224CysCys: 0.224 ± 0.008
0.622CysAsp: 0.622 ± 0.011
0.553CysGlu: 0.553 ± 0.012
0.506CysPhe: 0.506 ± 0.01
0.954CysGly: 0.954 ± 0.019
0.289CysHis: 0.289 ± 0.007
0.662CysIle: 0.662 ± 0.012
0.479CysLys: 0.479 ± 0.011
1.144CysLeu: 1.144 ± 0.017
0.247CysMet: 0.247 ± 0.007
0.417CysAsn: 0.417 ± 0.01
0.615CysPro: 0.615 ± 0.013
0.399CysGln: 0.399 ± 0.01
0.688CysArg: 0.688 ± 0.012
0.834CysSer: 0.834 ± 0.015
0.659CysThr: 0.659 ± 0.013
0.777CysVal: 0.777 ± 0.015
0.204CysTrp: 0.204 ± 0.008
0.329CysTyr: 0.329 ± 0.008
0.0CysXaa: 0.0 ± 0.0
Asp
5.073AspAla: 5.073 ± 0.038
0.591AspCys: 0.591 ± 0.011
4.475AspAsp: 4.475 ± 0.052
4.516AspGlu: 4.516 ± 0.038
2.331AspPhe: 2.331 ± 0.024
4.325AspGly: 4.325 ± 0.03
1.182AspHis: 1.182 ± 0.016
2.945AspIle: 2.945 ± 0.027
2.487AspLys: 2.487 ± 0.027
4.962AspLeu: 4.962 ± 0.033
1.265AspMet: 1.265 ± 0.017
1.89AspAsn: 1.89 ± 0.021
3.29AspPro: 3.29 ± 0.025
1.78AspGln: 1.78 ± 0.023
3.036AspArg: 3.036 ± 0.033
3.908AspSer: 3.908 ± 0.033
2.832AspThr: 2.832 ± 0.022
3.942AspVal: 3.942 ± 0.032
0.902AspTrp: 0.902 ± 0.014
1.569AspTyr: 1.569 ± 0.019
0.0AspXaa: 0.0 ± 0.0
Glu
5.659GluAla: 5.659 ± 0.044
0.547GluCys: 0.547 ± 0.011
4.169GluAsp: 4.169 ± 0.041
5.295GluGlu: 5.295 ± 0.051
1.941GluPhe: 1.941 ± 0.021
3.787GluGly: 3.787 ± 0.031
1.346GluHis: 1.346 ± 0.016
2.881GluIle: 2.881 ± 0.026
3.783GluLys: 3.783 ± 0.037
5.032GluLeu: 5.032 ± 0.041
1.415GluMet: 1.415 ± 0.016
2.215GluAsn: 2.215 ± 0.022
2.864GluPro: 2.864 ± 0.037
2.321GluGln: 2.321 ± 0.026
3.827GluArg: 3.827 ± 0.036
4.051GluSer: 4.051 ± 0.032
3.457GluThr: 3.457 ± 0.033
3.668GluVal: 3.668 ± 0.032
0.876GluTrp: 0.876 ± 0.011
1.601GluTyr: 1.601 ± 0.019
0.001GluXaa: 0.001 ± 0.0
Phe
3.255PheAla: 3.255 ± 0.029
0.56PheCys: 0.56 ± 0.011
2.356PheAsp: 2.356 ± 0.023
2.157PheGlu: 2.157 ± 0.019
1.715PhePhe: 1.715 ± 0.022
3.049PheGly: 3.049 ± 0.03
0.851PheHis: 0.851 ± 0.013
1.814PheIle: 1.814 ± 0.023
1.573PheLys: 1.573 ± 0.017
3.449PheLeu: 3.449 ± 0.03
0.796PheMet: 0.796 ± 0.014
1.508PheAsn: 1.508 ± 0.02
1.938PhePro: 1.938 ± 0.02
1.356PheGln: 1.356 ± 0.017
1.97PheArg: 1.97 ± 0.021
2.963PheSer: 2.963 ± 0.023
2.234PheThr: 2.234 ± 0.025
2.573PheVal: 2.573 ± 0.024
0.691PheTrp: 0.691 ± 0.014
1.148PheTyr: 1.148 ± 0.016
0.001PheXaa: 0.001 ± 0.0
Gly
5.903GlyAla: 5.903 ± 0.044
0.864GlyCys: 0.864 ± 0.015
3.759GlyAsp: 3.759 ± 0.03
3.642GlyGlu: 3.642 ± 0.029
2.993GlyPhe: 2.993 ± 0.027
6.35GlyGly: 6.35 ± 0.062
1.69GlyHis: 1.69 ± 0.019
3.593GlyIle: 3.593 ± 0.03
3.534GlyLys: 3.534 ± 0.032
6.303GlyLeu: 6.303 ± 0.035
1.614GlyMet: 1.614 ± 0.02
2.685GlyAsn: 2.685 ± 0.024
3.489GlyPro: 3.489 ± 0.032
2.594GlyGln: 2.594 ± 0.023
4.273GlyArg: 4.273 ± 0.031
5.79GlySer: 5.79 ± 0.047
4.205GlyThr: 4.205 ± 0.048
4.725GlyVal: 4.725 ± 0.032
1.225GlyTrp: 1.225 ± 0.018
2.19GlyTyr: 2.19 ± 0.026
0.001GlyXaa: 0.001 ± 0.0
His
1.833HisAla: 1.833 ± 0.019
0.307HisCys: 0.307 ± 0.009
1.265HisAsp: 1.265 ± 0.018
1.269HisGlu: 1.269 ± 0.017
0.909HisPhe: 0.909 ± 0.015
1.686HisGly: 1.686 ± 0.019
0.795HisHis: 0.795 ± 0.017
1.104HisIle: 1.104 ± 0.014
0.891HisLys: 0.891 ± 0.014
2.092HisLeu: 2.092 ± 0.024
0.5HisMet: 0.5 ± 0.01
0.836HisAsn: 0.836 ± 0.015
1.547HisPro: 1.547 ± 0.018
0.953HisGln: 0.953 ± 0.017
1.421HisArg: 1.421 ± 0.019
1.667HisSer: 1.667 ± 0.018
1.149HisThr: 1.149 ± 0.014
1.459HisVal: 1.459 ± 0.018
0.321HisTrp: 0.321 ± 0.008
0.664HisTyr: 0.664 ± 0.013
0.0HisXaa: 0.0 ± 0.0
Ile
4.241IleAla: 4.241 ± 0.035
0.679IleCys: 0.679 ± 0.015
2.748IleAsp: 2.748 ± 0.025
2.729IleGlu: 2.729 ± 0.024
1.977IlePhe: 1.977 ± 0.023
3.255IleGly: 3.255 ± 0.029
1.093IleHis: 1.093 ± 0.015
2.487IleIle: 2.487 ± 0.032
2.149IleLys: 2.149 ± 0.024
4.362IleLeu: 4.362 ± 0.032
1.017IleMet: 1.017 ± 0.015
1.798IleAsn: 1.798 ± 0.02
2.933IlePro: 2.933 ± 0.025
1.792IleGln: 1.792 ± 0.019
2.688IleArg: 2.688 ± 0.024
3.569IleSer: 3.569 ± 0.029
2.814IleThr: 2.814 ± 0.026
3.26IleVal: 3.26 ± 0.031
0.732IleTrp: 0.732 ± 0.011
1.33IleTyr: 1.33 ± 0.016
0.0IleXaa: 0.0 ± 0.0
Lys
4.537LysAla: 4.537 ± 0.032
0.463LysCys: 0.463 ± 0.011
2.831LysAsp: 2.831 ± 0.031
3.37LysGlu: 3.37 ± 0.035
1.481LysPhe: 1.481 ± 0.017
3.075LysGly: 3.075 ± 0.027
1.104LysHis: 1.104 ± 0.016
2.168LysIle: 2.168 ± 0.023
3.42LysLys: 3.42 ± 0.052
4.046LysLeu: 4.046 ± 0.031
1.021LysMet: 1.021 ± 0.015
1.8LysAsn: 1.8 ± 0.024
2.833LysPro: 2.833 ± 0.028
1.808LysGln: 1.808 ± 0.018
3.287LysArg: 3.287 ± 0.032
3.431LysSer: 3.431 ± 0.029
2.969LysThr: 2.969 ± 0.028
2.921LysVal: 2.921 ± 0.026
0.713LysTrp: 0.713 ± 0.011
1.365LysTyr: 1.365 ± 0.016
0.001LysXaa: 0.001 ± 0.0
Leu
8.179LeuAla: 8.179 ± 0.05
1.107LeuCys: 1.107 ± 0.015
5.021LeuAsp: 5.021 ± 0.035
5.26LeuGlu: 5.26 ± 0.046
3.324LeuPhe: 3.324 ± 0.03
6.114LeuGly: 6.114 ± 0.038
2.042LeuHis: 2.042 ± 0.021
3.805LeuIle: 3.805 ± 0.037
4.122LeuLys: 4.122 ± 0.036
7.979LeuLeu: 7.979 ± 0.051
1.794LeuMet: 1.794 ± 0.022
3.138LeuAsn: 3.138 ± 0.026
5.399LeuPro: 5.399 ± 0.041
3.55LeuGln: 3.55 ± 0.03
5.589LeuArg: 5.589 ± 0.04
6.911LeuSer: 6.911 ± 0.037
4.919LeuThr: 4.919 ± 0.037
5.653LeuVal: 5.653 ± 0.041
1.199LeuTrp: 1.199 ± 0.015
2.268LeuTyr: 2.268 ± 0.027
0.001LeuXaa: 0.001 ± 0.0
Met
2.371MetAla: 2.371 ± 0.023
0.234MetCys: 0.234 ± 0.007
1.201MetAsp: 1.201 ± 0.016
1.213MetGlu: 1.213 ± 0.016
0.78MetPhe: 0.78 ± 0.013
1.478MetGly: 1.478 ± 0.018
0.448MetHis: 0.448 ± 0.011
0.96MetIle: 0.96 ± 0.013
1.002MetLys: 1.002 ± 0.015
1.836MetLeu: 1.836 ± 0.022
0.577MetMet: 0.577 ± 0.013
0.787MetAsn: 0.787 ± 0.014
1.283MetPro: 1.283 ± 0.017
0.787MetGln: 0.787 ± 0.014
1.276MetArg: 1.276 ± 0.019
1.872MetSer: 1.872 ± 0.019
1.323MetThr: 1.323 ± 0.017
1.322MetVal: 1.322 ± 0.019
0.27MetTrp: 0.27 ± 0.008
0.532MetTyr: 0.532 ± 0.011
0.0MetXaa: 0.0 ± 0.0
Asn
3.206AsnAla: 3.206 ± 0.029
0.438AsnCys: 0.438 ± 0.01
1.98AsnAsp: 1.98 ± 0.022
1.98AsnGlu: 1.98 ± 0.018
1.45AsnPhe: 1.45 ± 0.017
3.163AsnGly: 3.163 ± 0.03
0.825AsnHis: 0.825 ± 0.014
2.004AsnIle: 2.004 ± 0.02
1.624AsnLys: 1.624 ± 0.017
3.257AsnLeu: 3.257 ± 0.024
0.832AsnMet: 0.832 ± 0.015
1.535AsnAsn: 1.535 ± 0.022
2.475AsnPro: 2.475 ± 0.025
1.291AsnGln: 1.291 ± 0.02
1.916AsnArg: 1.916 ± 0.021
2.654AsnSer: 2.654 ± 0.021
2.181AsnThr: 2.181 ± 0.023
2.41AsnVal: 2.41 ± 0.021
0.583AsnTrp: 0.583 ± 0.009
1.075AsnTyr: 1.075 ± 0.017
0.0AsnXaa: 0.0 ± 0.0
Pro
5.689ProAla: 5.689 ± 0.049
0.466ProCys: 0.466 ± 0.011
3.247ProAsp: 3.247 ± 0.027
3.921ProGlu: 3.921 ± 0.036
2.094ProPhe: 2.094 ± 0.019
4.139ProGly: 4.139 ± 0.037
1.218ProHis: 1.218 ± 0.017
2.449ProIle: 2.449 ± 0.022
2.744ProLys: 2.744 ± 0.027
4.637ProLeu: 4.637 ± 0.032
1.056ProMet: 1.056 ± 0.018
2.242ProAsn: 2.242 ± 0.024
5.153ProPro: 5.153 ± 0.07
2.445ProGln: 2.445 ± 0.033
3.392ProArg: 3.392 ± 0.034
5.721ProSer: 5.721 ± 0.048
4.008ProThr: 4.008 ± 0.039
3.762ProVal: 3.762 ± 0.032
0.775ProTrp: 0.775 ± 0.015
1.469ProTyr: 1.469 ± 0.019
0.002ProXaa: 0.002 ± 0.001
Gln
3.492GlnAla: 3.492 ± 0.031
0.399GlnCys: 0.399 ± 0.011
1.987GlnAsp: 1.987 ± 0.022
2.249GlnGlu: 2.249 ± 0.026
1.243GlnPhe: 1.243 ± 0.017
2.405GlnGly: 2.405 ± 0.024
1.013GlnHis: 1.013 ± 0.016
1.786GlnIle: 1.786 ± 0.021
1.883GlnLys: 1.883 ± 0.022
3.26GlnLeu: 3.26 ± 0.028
0.875GlnMet: 0.875 ± 0.013
1.481GlnAsn: 1.481 ± 0.019
2.531GlnPro: 2.531 ± 0.036
2.496GlnGln: 2.496 ± 0.056
2.543GlnArg: 2.543 ± 0.023
2.863GlnSer: 2.863 ± 0.026
2.305GlnThr: 2.305 ± 0.031
2.153GlnVal: 2.153 ± 0.022
0.566GlnTrp: 0.566 ± 0.011
1.123GlnTyr: 1.123 ± 0.016
0.0GlnXaa: 0.0 ± 0.0
Arg
4.76ArgAla: 4.76 ± 0.037
0.652ArgCys: 0.652 ± 0.014
3.427ArgAsp: 3.427 ± 0.034
3.747ArgGlu: 3.747 ± 0.035
2.15ArgPhe: 2.15 ± 0.021
3.805ArgGly: 3.805 ± 0.036
1.505ArgHis: 1.505 ± 0.019
2.769ArgIle: 2.769 ± 0.025
3.411ArgLys: 3.411 ± 0.031
5.289ArgLeu: 5.289 ± 0.032
1.311ArgMet: 1.311 ± 0.017
2.237ArgAsn: 2.237 ± 0.022
3.566ArgPro: 3.566 ± 0.032
2.546ArgGln: 2.546 ± 0.027
5.059ArgArg: 5.059 ± 0.049
4.595ArgSer: 4.595 ± 0.039
3.225ArgThr: 3.225 ± 0.024
3.446ArgVal: 3.446 ± 0.028
0.939ArgTrp: 0.939 ± 0.014
1.592ArgTyr: 1.592 ± 0.016
0.001ArgXaa: 0.001 ± 0.0
Ser
6.871SerAla: 6.871 ± 0.038
0.797SerCys: 0.797 ± 0.014
4.118SerAsp: 4.118 ± 0.031
3.926SerGlu: 3.926 ± 0.031
3.021SerPhe: 3.021 ± 0.027
5.606SerGly: 5.606 ± 0.043
1.799SerHis: 1.799 ± 0.021
3.716SerIle: 3.716 ± 0.028
3.628SerLys: 3.628 ± 0.032
6.832SerLeu: 6.832 ± 0.043
1.665SerMet: 1.665 ± 0.02
2.901SerAsn: 2.901 ± 0.025
5.317SerPro: 5.317 ± 0.046
3.07SerGln: 3.07 ± 0.028
4.812SerArg: 4.812 ± 0.037
8.073SerSer: 8.073 ± 0.061
5.299SerThr: 5.299 ± 0.046
4.722SerVal: 4.722 ± 0.034
1.106SerTrp: 1.106 ± 0.016
2.009SerTyr: 2.009 ± 0.025
0.001SerXaa: 0.001 ± 0.001
Thr
5.561ThrAla: 5.561 ± 0.039
0.724ThrCys: 0.724 ± 0.014
2.89ThrAsp: 2.89 ± 0.022
3.145ThrGlu: 3.145 ± 0.03
2.276ThrPhe: 2.276 ± 0.026
4.311ThrGly: 4.311 ± 0.044
1.223ThrHis: 1.223 ± 0.015
3.028ThrIle: 3.028 ± 0.029
2.673ThrLys: 2.673 ± 0.025
5.138ThrLeu: 5.138 ± 0.046
1.168ThrMet: 1.168 ± 0.015
2.15ThrAsn: 2.15 ± 0.024
4.388ThrPro: 4.388 ± 0.04
2.045ThrGln: 2.045 ± 0.021
3.103ThrArg: 3.103 ± 0.026
5.14ThrSer: 5.14 ± 0.043
4.385ThrThr: 4.385 ± 0.067
3.981ThrVal: 3.981 ± 0.048
0.911ThrTrp: 0.911 ± 0.014
1.674ThrTyr: 1.674 ± 0.02
0.0ThrXaa: 0.0 ± 0.0
Val
5.819ValAla: 5.819 ± 0.038
0.819ValCys: 0.819 ± 0.016
3.893ValAsp: 3.893 ± 0.027
3.96ValGlu: 3.96 ± 0.032
2.675ValPhe: 2.675 ± 0.028
4.385ValGly: 4.385 ± 0.037
1.347ValHis: 1.347 ± 0.018
3.054ValIle: 3.054 ± 0.026
3.026ValLys: 3.026 ± 0.026
5.842ValLeu: 5.842 ± 0.041
1.346ValMet: 1.346 ± 0.017
2.327ValAsn: 2.327 ± 0.022
3.776ValPro: 3.776 ± 0.032
2.332ValGln: 2.332 ± 0.022
3.583ValArg: 3.583 ± 0.031
4.717ValSer: 4.717 ± 0.032
3.828ValThr: 3.828 ± 0.039
4.721ValVal: 4.721 ± 0.043
0.951ValTrp: 0.951 ± 0.013
1.779ValTyr: 1.779 ± 0.02
0.001ValXaa: 0.001 ± 0.0
Trp
1.192TrpAla: 1.192 ± 0.014
0.202TrpCys: 0.202 ± 0.006
0.938TrpAsp: 0.938 ± 0.016
0.843TrpGlu: 0.843 ± 0.015
0.586TrpPhe: 0.586 ± 0.013
0.998TrpGly: 0.998 ± 0.018
0.386TrpHis: 0.386 ± 0.008
0.759TrpIle: 0.759 ± 0.014
0.803TrpLys: 0.803 ± 0.012
1.394TrpLeu: 1.394 ± 0.02
0.378TrpMet: 0.378 ± 0.008
0.657TrpAsn: 0.657 ± 0.012
0.633TrpPro: 0.633 ± 0.011
0.577TrpGln: 0.577 ± 0.011
0.95TrpArg: 0.95 ± 0.015
1.05TrpSer: 1.05 ± 0.016
0.969TrpThr: 0.969 ± 0.014
0.927TrpVal: 0.927 ± 0.015
0.297TrpTrp: 0.297 ± 0.007
0.467TrpTyr: 0.467 ± 0.011
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.199TyrAla: 2.199 ± 0.024
0.39TyrCys: 0.39 ± 0.009
1.694TyrAsp: 1.694 ± 0.02
1.521TyrGlu: 1.521 ± 0.019
1.213TyrPhe: 1.213 ± 0.019
2.135TyrGly: 2.135 ± 0.023
0.721TyrHis: 0.721 ± 0.011
1.342TyrIle: 1.342 ± 0.016
1.097TyrLys: 1.097 ± 0.017
2.576TyrLeu: 2.576 ± 0.024
0.611TyrMet: 0.611 ± 0.011
1.131TyrAsn: 1.131 ± 0.014
1.427TyrPro: 1.427 ± 0.018
1.055TyrGln: 1.055 ± 0.015
1.606TyrArg: 1.606 ± 0.018
1.951TyrSer: 1.951 ± 0.021
1.566TyrThr: 1.566 ± 0.023
1.728TyrVal: 1.728 ± 0.02
0.463TyrTrp: 0.463 ± 0.01
0.914TyrTyr: 0.914 ± 0.017
0.001TyrXaa: 0.001 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.001XaaGlu: 0.001 ± 0.0
0.001XaaPhe: 0.001 ± 0.0
0.001XaaGly: 0.001 ± 0.001
0.0XaaHis: 0.0 ± 0.0
0.001XaaIle: 0.001 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.001XaaLeu: 0.001 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.001XaaAsn: 0.001 ± 0.0
0.003XaaPro: 0.003 ± 0.001
0.0XaaGln: 0.0 ± 0.0
0.001XaaArg: 0.001 ± 0.0
0.001XaaSer: 0.001 ± 0.001
0.0XaaThr: 0.0 ± 0.0
0.001XaaVal: 0.001 ± 0.0
0.001XaaTrp: 0.001 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.01XaaXaa: 0.01 ± 0.003
Statistics based on 10302 proteins (5134087 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski