Amino acid dipepetide frequency for Rachicladosporium antarcticum

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.824AlaAla: 10.824 ± 0.052
1.028AlaCys: 1.028 ± 0.013
4.878AlaAsp: 4.878 ± 0.027
5.909AlaGlu: 5.909 ± 0.037
3.242AlaPhe: 3.242 ± 0.02
6.779AlaGly: 6.779 ± 0.034
2.01AlaHis: 2.01 ± 0.017
4.261AlaIle: 4.261 ± 0.025
4.771AlaLys: 4.771 ± 0.029
8.528AlaLeu: 8.528 ± 0.038
2.213AlaMet: 2.213 ± 0.017
3.121AlaAsn: 3.121 ± 0.021
5.632AlaPro: 5.632 ± 0.037
4.003AlaGln: 4.003 ± 0.024
5.571AlaArg: 5.571 ± 0.031
8.102AlaSer: 8.102 ± 0.039
6.304AlaThr: 6.304 ± 0.033
6.123AlaVal: 6.123 ± 0.028
1.277AlaTrp: 1.277 ± 0.013
2.368AlaTyr: 2.368 ± 0.018
0.0AlaXaa: 0.0 ± 0.0
Cys
0.928CysAla: 0.928 ± 0.012
0.193CysCys: 0.193 ± 0.005
0.581CysAsp: 0.581 ± 0.009
0.553CysGlu: 0.553 ± 0.007
0.455CysPhe: 0.455 ± 0.008
0.878CysGly: 0.878 ± 0.011
0.285CysHis: 0.285 ± 0.005
0.597CysIle: 0.597 ± 0.009
0.492CysLys: 0.492 ± 0.008
1.058CysLeu: 1.058 ± 0.012
0.234CysMet: 0.234 ± 0.005
0.371CysAsn: 0.371 ± 0.007
0.561CysPro: 0.561 ± 0.009
0.369CysGln: 0.369 ± 0.006
0.639CysArg: 0.639 ± 0.009
0.751CysSer: 0.751 ± 0.009
0.661CysThr: 0.661 ± 0.009
0.713CysVal: 0.713 ± 0.01
0.178CysTrp: 0.178 ± 0.005
0.333CysTyr: 0.333 ± 0.006
0.0CysXaa: 0.0 ± 0.0
Asp
5.528AspAla: 5.528 ± 0.03
0.575AspCys: 0.575 ± 0.009
4.463AspAsp: 4.463 ± 0.031
4.657AspGlu: 4.657 ± 0.032
2.26AspPhe: 2.26 ± 0.014
4.315AspGly: 4.315 ± 0.026
1.291AspHis: 1.291 ± 0.013
2.645AspIle: 2.645 ± 0.017
2.189AspLys: 2.189 ± 0.016
5.25AspLeu: 5.25 ± 0.026
1.281AspMet: 1.281 ± 0.011
1.638AspAsn: 1.638 ± 0.013
3.201AspPro: 3.201 ± 0.023
1.763AspGln: 1.763 ± 0.014
3.165AspArg: 3.165 ± 0.024
4.057AspSer: 4.057 ± 0.021
3.023AspThr: 3.023 ± 0.018
3.952AspVal: 3.952 ± 0.022
0.889AspTrp: 0.889 ± 0.011
1.542AspTyr: 1.542 ± 0.013
0.0AspXaa: 0.0 ± 0.0
Glu
5.919GluAla: 5.919 ± 0.033
0.562GluCys: 0.562 ± 0.008
4.262GluAsp: 4.262 ± 0.033
5.087GluGlu: 5.087 ± 0.037
1.614GluPhe: 1.614 ± 0.014
4.282GluGly: 4.282 ± 0.027
1.492GluHis: 1.492 ± 0.014
2.778GluIle: 2.778 ± 0.017
3.423GluLys: 3.423 ± 0.027
5.162GluLeu: 5.162 ± 0.03
1.489GluMet: 1.489 ± 0.013
1.795GluAsn: 1.795 ± 0.014
2.609GluPro: 2.609 ± 0.022
2.557GluGln: 2.557 ± 0.02
4.289GluArg: 4.289 ± 0.03
4.024GluSer: 4.024 ± 0.028
3.359GluThr: 3.359 ± 0.02
3.994GluVal: 3.994 ± 0.025
0.837GluTrp: 0.837 ± 0.011
1.55GluTyr: 1.55 ± 0.015
0.0GluXaa: 0.0 ± 0.0
Phe
3.378PheAla: 3.378 ± 0.018
0.462PheCys: 0.462 ± 0.007
2.215PheAsp: 2.215 ± 0.016
2.044PheGlu: 2.044 ± 0.015
1.315PhePhe: 1.315 ± 0.016
2.819PheGly: 2.819 ± 0.025
0.769PheHis: 0.769 ± 0.009
1.472PheIle: 1.472 ± 0.018
1.313PheLys: 1.313 ± 0.013
2.919PheLeu: 2.919 ± 0.022
0.709PheMet: 0.709 ± 0.01
1.244PheAsn: 1.244 ± 0.012
1.686PhePro: 1.686 ± 0.017
1.117PheGln: 1.117 ± 0.01
1.755PheArg: 1.755 ± 0.014
2.541PheSer: 2.541 ± 0.017
2.163PheThr: 2.163 ± 0.019
2.246PheVal: 2.246 ± 0.018
0.56PheTrp: 0.56 ± 0.008
0.94PheTyr: 0.94 ± 0.011
0.0PheXaa: 0.0 ± 0.0
Gly
6.119GlyAla: 6.119 ± 0.031
0.781GlyCys: 0.781 ± 0.01
3.842GlyAsp: 3.842 ± 0.02
4.151GlyGlu: 4.151 ± 0.025
2.733GlyPhe: 2.733 ± 0.022
6.6GlyGly: 6.6 ± 0.046
1.683GlyHis: 1.683 ± 0.015
3.314GlyIle: 3.314 ± 0.026
3.816GlyLys: 3.816 ± 0.025
6.173GlyLeu: 6.173 ± 0.031
1.891GlyMet: 1.891 ± 0.017
2.53GlyAsn: 2.53 ± 0.022
3.196GlyPro: 3.196 ± 0.023
2.67GlyGln: 2.67 ± 0.02
4.36GlyArg: 4.36 ± 0.025
6.04GlySer: 6.04 ± 0.037
4.316GlyThr: 4.316 ± 0.028
4.737GlyVal: 4.737 ± 0.026
1.202GlyTrp: 1.202 ± 0.013
2.159GlyTyr: 2.159 ± 0.018
0.0GlyXaa: 0.0 ± 0.0
His
2.218HisAla: 2.218 ± 0.018
0.31HisCys: 0.31 ± 0.006
1.433HisAsp: 1.433 ± 0.014
1.383HisGlu: 1.383 ± 0.014
0.893HisPhe: 0.893 ± 0.011
1.754HisGly: 1.754 ± 0.016
0.785HisHis: 0.785 ± 0.011
1.075HisIle: 1.075 ± 0.013
0.871HisLys: 0.871 ± 0.011
2.172HisLeu: 2.172 ± 0.017
0.504HisMet: 0.504 ± 0.008
0.79HisAsn: 0.79 ± 0.011
1.537HisPro: 1.537 ± 0.015
0.895HisGln: 0.895 ± 0.01
1.491HisArg: 1.491 ± 0.015
1.789HisSer: 1.789 ± 0.016
1.383HisThr: 1.383 ± 0.013
1.527HisVal: 1.527 ± 0.013
0.351HisTrp: 0.351 ± 0.006
0.676HisTyr: 0.676 ± 0.009
0.0HisXaa: 0.0 ± 0.0
Ile
4.513IleAla: 4.513 ± 0.024
0.636IleCys: 0.636 ± 0.009
2.821IleAsp: 2.821 ± 0.017
2.716IleGlu: 2.716 ± 0.02
1.648IlePhe: 1.648 ± 0.018
3.067IleGly: 3.067 ± 0.026
0.989IleHis: 0.989 ± 0.011
2.069IleIle: 2.069 ± 0.017
1.911IleLys: 1.911 ± 0.014
3.88IleLeu: 3.88 ± 0.025
0.937IleMet: 0.937 ± 0.01
1.562IleAsn: 1.562 ± 0.014
2.593IlePro: 2.593 ± 0.017
1.46IleGln: 1.46 ± 0.012
2.48IleArg: 2.48 ± 0.017
3.409IleSer: 3.409 ± 0.02
2.884IleThr: 2.884 ± 0.024
2.97IleVal: 2.97 ± 0.022
0.637IleTrp: 0.637 ± 0.008
1.219IleTyr: 1.219 ± 0.011
0.0IleXaa: 0.0 ± 0.0
Lys
4.718LysAla: 4.718 ± 0.033
0.439LysCys: 0.439 ± 0.007
2.648LysAsp: 2.648 ± 0.02
3.022LysGlu: 3.022 ± 0.025
1.24LysPhe: 1.24 ± 0.012
3.186LysGly: 3.186 ± 0.024
1.152LysHis: 1.152 ± 0.012
1.983LysIle: 1.983 ± 0.016
3.028LysLys: 3.028 ± 0.035
4.02LysLeu: 4.02 ± 0.022
0.983LysMet: 0.983 ± 0.011
1.358LysAsn: 1.358 ± 0.012
2.785LysPro: 2.785 ± 0.022
1.913LysGln: 1.913 ± 0.016
3.568LysArg: 3.568 ± 0.025
3.443LysSer: 3.443 ± 0.02
2.764LysThr: 2.764 ± 0.02
2.878LysVal: 2.878 ± 0.02
0.634LysTrp: 0.634 ± 0.009
1.247LysTyr: 1.247 ± 0.011
0.0LysXaa: 0.0 ± 0.0
Leu
8.447LeuAla: 8.447 ± 0.034
1.084LeuCys: 1.084 ± 0.011
5.198LeuAsp: 5.198 ± 0.025
5.343LeuGlu: 5.343 ± 0.032
2.913LeuPhe: 2.913 ± 0.02
5.897LeuGly: 5.897 ± 0.03
2.28LeuHis: 2.28 ± 0.017
3.585LeuIle: 3.585 ± 0.025
3.919LeuLys: 3.919 ± 0.025
8.094LeuLeu: 8.094 ± 0.049
1.698LeuMet: 1.698 ± 0.013
2.865LeuAsn: 2.865 ± 0.018
5.666LeuPro: 5.666 ± 0.026
3.921LeuGln: 3.921 ± 0.025
5.864LeuArg: 5.864 ± 0.028
6.862LeuSer: 6.862 ± 0.031
5.124LeuThr: 5.124 ± 0.025
5.125LeuVal: 5.125 ± 0.029
1.098LeuTrp: 1.098 ± 0.012
2.202LeuTyr: 2.202 ± 0.017
0.0LeuXaa: 0.0 ± 0.0
Met
2.37MetAla: 2.37 ± 0.015
0.225MetCys: 0.225 ± 0.005
1.218MetAsp: 1.218 ± 0.011
1.237MetGlu: 1.237 ± 0.012
0.703MetPhe: 0.703 ± 0.01
1.506MetGly: 1.506 ± 0.014
0.549MetHis: 0.549 ± 0.009
0.914MetIle: 0.914 ± 0.011
0.973MetLys: 0.973 ± 0.012
2.003MetLeu: 2.003 ± 0.02
0.559MetMet: 0.559 ± 0.009
0.725MetAsn: 0.725 ± 0.01
1.453MetPro: 1.453 ± 0.014
1.007MetGln: 1.007 ± 0.011
1.378MetArg: 1.378 ± 0.013
1.916MetSer: 1.916 ± 0.014
1.272MetThr: 1.272 ± 0.013
1.198MetVal: 1.198 ± 0.011
0.262MetTrp: 0.262 ± 0.006
0.552MetTyr: 0.552 ± 0.008
0.0MetXaa: 0.0 ± 0.0
Asn
3.457AsnAla: 3.457 ± 0.021
0.352AsnCys: 0.352 ± 0.006
1.9AsnAsp: 1.9 ± 0.015
1.832AsnGlu: 1.832 ± 0.013
1.185AsnPhe: 1.185 ± 0.011
3.296AsnGly: 3.296 ± 0.023
0.717AsnHis: 0.717 ± 0.009
1.68AsnIle: 1.68 ± 0.013
1.275AsnLys: 1.275 ± 0.011
2.794AsnLeu: 2.794 ± 0.019
0.734AsnMet: 0.734 ± 0.01
1.206AsnAsn: 1.206 ± 0.014
2.086AsnPro: 2.086 ± 0.016
1.016AsnGln: 1.016 ± 0.014
1.649AsnArg: 1.649 ± 0.013
2.331AsnSer: 2.331 ± 0.016
2.152AsnThr: 2.152 ± 0.018
2.258AsnVal: 2.258 ± 0.018
0.469AsnTrp: 0.469 ± 0.007
0.909AsnTyr: 0.909 ± 0.009
0.0AsnXaa: 0.0 ± 0.0
Pro
6.135ProAla: 6.135 ± 0.038
0.403ProCys: 0.403 ± 0.008
3.112ProAsp: 3.112 ± 0.021
3.472ProGlu: 3.472 ± 0.021
1.835ProPhe: 1.835 ± 0.015
4.03ProGly: 4.03 ± 0.022
1.334ProHis: 1.334 ± 0.014
2.4ProIle: 2.4 ± 0.017
2.613ProLys: 2.613 ± 0.019
4.634ProLeu: 4.634 ± 0.024
1.15ProMet: 1.15 ± 0.013
2.002ProAsn: 2.002 ± 0.017
5.388ProPro: 5.388 ± 0.056
2.536ProGln: 2.536 ± 0.022
3.363ProArg: 3.363 ± 0.023
6.045ProSer: 6.045 ± 0.037
4.302ProThr: 4.302 ± 0.027
3.56ProVal: 3.56 ± 0.023
0.667ProTrp: 0.667 ± 0.009
1.491ProTyr: 1.491 ± 0.013
0.0ProXaa: 0.0 ± 0.0
Gln
3.848GlnAla: 3.848 ± 0.024
0.422GlnCys: 0.422 ± 0.007
2.116GlnAsp: 2.116 ± 0.016
2.113GlnGlu: 2.113 ± 0.018
1.078GlnPhe: 1.078 ± 0.01
2.515GlnGly: 2.515 ± 0.017
1.226GlnHis: 1.226 ± 0.013
1.783GlnIle: 1.783 ± 0.014
1.716GlnLys: 1.716 ± 0.017
3.333GlnLeu: 3.333 ± 0.024
0.921GlnMet: 0.921 ± 0.011
1.333GlnAsn: 1.333 ± 0.013
2.636GlnPro: 2.636 ± 0.022
2.78GlnGln: 2.78 ± 0.039
2.851GlnArg: 2.851 ± 0.021
3.269GlnSer: 3.269 ± 0.023
2.441GlnThr: 2.441 ± 0.02
2.201GlnVal: 2.201 ± 0.016
0.584GlnTrp: 0.584 ± 0.008
1.233GlnTyr: 1.233 ± 0.014
0.0GlnXaa: 0.0 ± 0.0
Arg
5.256ArgAla: 5.256 ± 0.027
0.643ArgCys: 0.643 ± 0.009
3.493ArgAsp: 3.493 ± 0.025
4.066ArgGlu: 4.066 ± 0.033
1.938ArgPhe: 1.938 ± 0.014
3.911ArgGly: 3.911 ± 0.028
1.537ArgHis: 1.537 ± 0.015
2.661ArgIle: 2.661 ± 0.018
3.646ArgLys: 3.646 ± 0.026
5.378ArgLeu: 5.378 ± 0.027
1.436ArgMet: 1.436 ± 0.013
2.076ArgAsn: 2.076 ± 0.016
3.59ArgPro: 3.59 ± 0.028
2.737ArgGln: 2.737 ± 0.021
5.171ArgArg: 5.171 ± 0.04
4.934ArgSer: 4.934 ± 0.033
3.561ArgThr: 3.561 ± 0.02
3.494ArgVal: 3.494 ± 0.021
0.88ArgTrp: 0.88 ± 0.011
1.57ArgTyr: 1.57 ± 0.017
0.0ArgXaa: 0.0 ± 0.0
Ser
7.662SerAla: 7.662 ± 0.039
0.71SerCys: 0.71 ± 0.01
4.304SerAsp: 4.304 ± 0.022
4.031SerGlu: 4.031 ± 0.024
2.63SerPhe: 2.63 ± 0.017
5.86SerGly: 5.86 ± 0.03
1.873SerHis: 1.873 ± 0.016
3.648SerIle: 3.648 ± 0.019
3.578SerLys: 3.578 ± 0.025
6.815SerLeu: 6.815 ± 0.031
1.759SerMet: 1.759 ± 0.015
2.812SerAsn: 2.812 ± 0.018
5.478SerPro: 5.478 ± 0.043
3.144SerGln: 3.144 ± 0.023
4.902SerArg: 4.902 ± 0.035
8.382SerSer: 8.382 ± 0.059
6.089SerThr: 6.089 ± 0.04
4.555SerVal: 4.555 ± 0.025
1.018SerTrp: 1.018 ± 0.012
2.017SerTyr: 2.017 ± 0.016
0.0SerXaa: 0.0 ± 0.0
Thr
6.31ThrAla: 6.31 ± 0.033
0.698ThrCys: 0.698 ± 0.01
2.94ThrAsp: 2.94 ± 0.018
3.051ThrGlu: 3.051 ± 0.018
2.199ThrPhe: 2.199 ± 0.016
4.405ThrGly: 4.405 ± 0.026
1.364ThrHis: 1.364 ± 0.013
2.959ThrIle: 2.959 ± 0.022
2.638ThrLys: 2.638 ± 0.019
5.606ThrLeu: 5.606 ± 0.036
1.24ThrMet: 1.24 ± 0.012
2.1ThrAsn: 2.1 ± 0.016
4.728ThrPro: 4.728 ± 0.031
2.294ThrGln: 2.294 ± 0.019
3.303ThrArg: 3.303 ± 0.021
5.943ThrSer: 5.943 ± 0.034
4.815ThrThr: 4.815 ± 0.042
3.796ThrVal: 3.796 ± 0.029
0.86ThrTrp: 0.86 ± 0.009
1.676ThrTyr: 1.676 ± 0.012
0.0ThrXaa: 0.0 ± 0.0
Val
5.825ValAla: 5.825 ± 0.029
0.737ValCys: 0.737 ± 0.008
3.745ValAsp: 3.745 ± 0.022
4.096ValGlu: 4.096 ± 0.025
2.21ValPhe: 2.21 ± 0.019
4.176ValGly: 4.176 ± 0.024
1.452ValHis: 1.452 ± 0.013
2.685ValIle: 2.685 ± 0.02
3.12ValLys: 3.12 ± 0.025
5.582ValLeu: 5.582 ± 0.032
1.357ValMet: 1.357 ± 0.013
2.191ValAsn: 2.191 ± 0.016
3.652ValPro: 3.652 ± 0.02
2.532ValGln: 2.532 ± 0.017
3.697ValArg: 3.697 ± 0.02
4.481ValSer: 4.481 ± 0.021
3.657ValThr: 3.657 ± 0.027
4.347ValVal: 4.347 ± 0.029
0.852ValTrp: 0.852 ± 0.01
1.699ValTyr: 1.699 ± 0.016
0.0ValXaa: 0.0 ± 0.0
Trp
1.08TrpAla: 1.08 ± 0.011
0.191TrpCys: 0.191 ± 0.005
0.849TrpAsp: 0.849 ± 0.01
0.811TrpGlu: 0.811 ± 0.009
0.487TrpPhe: 0.487 ± 0.008
0.868TrpGly: 0.868 ± 0.012
0.366TrpHis: 0.366 ± 0.007
0.66TrpIle: 0.66 ± 0.01
0.701TrpLys: 0.701 ± 0.009
1.348TrpLeu: 1.348 ± 0.015
0.341TrpMet: 0.341 ± 0.007
0.531TrpAsn: 0.531 ± 0.007
0.598TrpPro: 0.598 ± 0.009
0.645TrpGln: 0.645 ± 0.009
0.985TrpArg: 0.985 ± 0.01
1.037TrpSer: 1.037 ± 0.012
0.923TrpThr: 0.923 ± 0.012
0.828TrpVal: 0.828 ± 0.009
0.257TrpTrp: 0.257 ± 0.005
0.414TrpTyr: 0.414 ± 0.007
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.409TyrAla: 2.409 ± 0.017
0.38TyrCys: 0.38 ± 0.006
1.666TyrAsp: 1.666 ± 0.015
1.507TyrGlu: 1.507 ± 0.014
1.088TyrPhe: 1.088 ± 0.012
2.168TyrGly: 2.168 ± 0.018
0.681TyrHis: 0.681 ± 0.009
1.231TyrIle: 1.231 ± 0.011
1.016TyrLys: 1.016 ± 0.012
2.437TyrLeu: 2.437 ± 0.018
0.584TyrMet: 0.584 ± 0.01
1.046TyrAsn: 1.046 ± 0.012
1.396TyrPro: 1.396 ± 0.013
1.04TyrGln: 1.04 ± 0.013
1.497TyrArg: 1.497 ± 0.013
1.914TyrSer: 1.914 ± 0.017
1.734TyrThr: 1.734 ± 0.014
1.592TyrVal: 1.592 ± 0.013
0.408TyrTrp: 0.408 ± 0.008
0.883TyrTyr: 0.883 ± 0.01
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 18598 proteins (9390076 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski