Amino acid dipepetide frequency for Coleophoma cylindrospora

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.152AlaAla: 9.152 ± 0.052
1.133AlaCys: 1.133 ± 0.013
4.069AlaAsp: 4.069 ± 0.026
5.111AlaGlu: 5.111 ± 0.033
3.28AlaPhe: 3.28 ± 0.023
5.969AlaGly: 5.969 ± 0.032
1.666AlaHis: 1.666 ± 0.017
4.685AlaIle: 4.685 ± 0.027
4.157AlaLys: 4.157 ± 0.028
7.859AlaLeu: 7.859 ± 0.044
2.053AlaMet: 2.053 ± 0.018
3.051AlaAsn: 3.051 ± 0.022
4.519AlaPro: 4.519 ± 0.038
3.348AlaGln: 3.348 ± 0.023
4.44AlaArg: 4.44 ± 0.027
7.474AlaSer: 7.474 ± 0.049
5.62AlaThr: 5.62 ± 0.036
5.497AlaVal: 5.497 ± 0.033
1.241AlaTrp: 1.241 ± 0.016
2.276AlaTyr: 2.276 ± 0.022
0.0AlaXaa: 0.0 ± 0.0
Cys
0.992CysAla: 0.992 ± 0.013
0.243CysCys: 0.243 ± 0.007
0.65CysAsp: 0.65 ± 0.011
0.597CysGlu: 0.597 ± 0.009
0.579CysPhe: 0.579 ± 0.009
0.973CysGly: 0.973 ± 0.014
0.298CysHis: 0.298 ± 0.007
0.805CysIle: 0.805 ± 0.011
0.514CysLys: 0.514 ± 0.009
1.3CysLeu: 1.3 ± 0.015
0.277CysMet: 0.277 ± 0.006
0.457CysAsn: 0.457 ± 0.01
0.629CysPro: 0.629 ± 0.012
0.456CysGln: 0.456 ± 0.008
0.669CysArg: 0.669 ± 0.009
0.999CysSer: 0.999 ± 0.014
0.767CysThr: 0.767 ± 0.014
0.821CysVal: 0.821 ± 0.014
0.219CysTrp: 0.219 ± 0.006
0.377CysTyr: 0.377 ± 0.008
0.0CysXaa: 0.0 ± 0.0
Asp
4.457AspAla: 4.457 ± 0.029
0.642AspCys: 0.642 ± 0.011
3.767AspAsp: 3.767 ± 0.037
4.14AspGlu: 4.14 ± 0.031
2.294AspPhe: 2.294 ± 0.021
3.979AspGly: 3.979 ± 0.026
1.171AspHis: 1.171 ± 0.014
3.198AspIle: 3.198 ± 0.024
2.222AspLys: 2.222 ± 0.019
4.993AspLeu: 4.993 ± 0.035
1.263AspMet: 1.263 ± 0.013
1.841AspAsn: 1.841 ± 0.013
3.048AspPro: 3.048 ± 0.023
1.783AspGln: 1.783 ± 0.019
2.656AspArg: 2.656 ± 0.02
4.012AspSer: 4.012 ± 0.031
2.93AspThr: 2.93 ± 0.019
3.565AspVal: 3.565 ± 0.024
0.886AspTrp: 0.886 ± 0.011
1.627AspTyr: 1.627 ± 0.017
0.0AspXaa: 0.0 ± 0.0
Glu
5.332GluAla: 5.332 ± 0.032
0.62GluCys: 0.62 ± 0.01
3.994GluAsp: 3.994 ± 0.031
5.082GluGlu: 5.082 ± 0.043
1.942GluPhe: 1.942 ± 0.016
3.625GluGly: 3.625 ± 0.026
1.353GluHis: 1.353 ± 0.015
3.284GluIle: 3.284 ± 0.026
3.759GluLys: 3.759 ± 0.028
5.045GluLeu: 5.045 ± 0.031
1.497GluMet: 1.497 ± 0.016
2.421GluAsn: 2.421 ± 0.018
2.577GluPro: 2.577 ± 0.029
2.324GluGln: 2.324 ± 0.023
3.536GluArg: 3.536 ± 0.027
4.292GluSer: 4.292 ± 0.028
3.36GluThr: 3.36 ± 0.025
3.626GluVal: 3.626 ± 0.024
0.865GluTrp: 0.865 ± 0.011
1.685GluTyr: 1.685 ± 0.017
0.0GluXaa: 0.0 ± 0.0
Phe
3.226PheAla: 3.226 ± 0.021
0.602PheCys: 0.602 ± 0.011
2.255PheAsp: 2.255 ± 0.019
2.227PheGlu: 2.227 ± 0.019
1.689PhePhe: 1.689 ± 0.02
3.038PheGly: 3.038 ± 0.025
0.896PheHis: 0.896 ± 0.014
1.97PheIle: 1.97 ± 0.019
1.549PheLys: 1.549 ± 0.013
3.592PheLeu: 3.592 ± 0.028
0.85PheMet: 0.85 ± 0.011
1.497PheAsn: 1.497 ± 0.015
1.961PhePro: 1.961 ± 0.017
1.494PheGln: 1.494 ± 0.013
1.827PheArg: 1.827 ± 0.019
3.137PheSer: 3.137 ± 0.02
2.319PheThr: 2.319 ± 0.02
2.465PheVal: 2.465 ± 0.02
0.695PheTrp: 0.695 ± 0.009
1.174PheTyr: 1.174 ± 0.015
0.0PheXaa: 0.0 ± 0.0
Gly
5.427GlyAla: 5.427 ± 0.038
0.934GlyCys: 0.934 ± 0.013
3.451GlyAsp: 3.451 ± 0.022
3.474GlyGlu: 3.474 ± 0.026
2.994GlyPhe: 2.994 ± 0.025
5.72GlyGly: 5.72 ± 0.05
1.615GlyHis: 1.615 ± 0.018
3.941GlyIle: 3.941 ± 0.031
3.545GlyLys: 3.545 ± 0.027
6.29GlyLeu: 6.29 ± 0.039
1.679GlyMet: 1.679 ± 0.017
2.73GlyAsn: 2.73 ± 0.021
3.263GlyPro: 3.263 ± 0.042
2.464GlyGln: 2.464 ± 0.021
3.799GlyArg: 3.799 ± 0.026
5.916GlySer: 5.916 ± 0.037
4.344GlyThr: 4.344 ± 0.039
4.412GlyVal: 4.412 ± 0.028
1.215GlyTrp: 1.215 ± 0.018
2.232GlyTyr: 2.232 ± 0.02
0.0GlyXaa: 0.0 ± 0.0
His
1.75HisAla: 1.75 ± 0.018
0.318HisCys: 0.318 ± 0.008
1.249HisAsp: 1.249 ± 0.014
1.285HisGlu: 1.285 ± 0.013
0.907HisPhe: 0.907 ± 0.013
1.662HisGly: 1.662 ± 0.015
0.746HisHis: 0.746 ± 0.015
1.187HisIle: 1.187 ± 0.012
0.902HisLys: 0.902 ± 0.013
2.101HisLeu: 2.101 ± 0.02
0.471HisMet: 0.471 ± 0.008
0.839HisAsn: 0.839 ± 0.012
1.489HisPro: 1.489 ± 0.017
0.927HisGln: 0.927 ± 0.014
1.349HisArg: 1.349 ± 0.015
1.753HisSer: 1.753 ± 0.019
1.233HisThr: 1.233 ± 0.014
1.342HisVal: 1.342 ± 0.014
0.346HisTrp: 0.346 ± 0.007
0.659HisTyr: 0.659 ± 0.011
0.0HisXaa: 0.0 ± 0.0
Ile
4.567IleAla: 4.567 ± 0.027
0.849IleCys: 0.849 ± 0.011
2.997IleAsp: 2.997 ± 0.021
3.029IleGlu: 3.029 ± 0.023
2.255IlePhe: 2.255 ± 0.019
3.544IleGly: 3.544 ± 0.031
1.211IleHis: 1.211 ± 0.013
2.865IleIle: 2.865 ± 0.026
2.327IleLys: 2.327 ± 0.021
4.991IleLeu: 4.991 ± 0.03
1.144IleMet: 1.144 ± 0.013
2.019IleAsn: 2.019 ± 0.017
3.258IlePro: 3.258 ± 0.029
2.036IleGln: 2.036 ± 0.02
2.651IleArg: 2.651 ± 0.021
4.438IleSer: 4.438 ± 0.034
3.264IleThr: 3.264 ± 0.026
3.401IleVal: 3.401 ± 0.026
0.839IleTrp: 0.839 ± 0.011
1.574IleTyr: 1.574 ± 0.015
0.0IleXaa: 0.0 ± 0.0
Lys
4.422LysAla: 4.422 ± 0.032
0.507LysCys: 0.507 ± 0.01
2.857LysAsp: 2.857 ± 0.023
3.4LysGlu: 3.4 ± 0.029
1.551LysPhe: 1.551 ± 0.016
2.987LysGly: 2.987 ± 0.021
1.122LysHis: 1.122 ± 0.013
2.482LysIle: 2.482 ± 0.021
3.244LysLys: 3.244 ± 0.04
4.146LysLeu: 4.146 ± 0.023
1.055LysMet: 1.055 ± 0.012
1.793LysAsn: 1.793 ± 0.018
2.561LysPro: 2.561 ± 0.022
1.778LysGln: 1.778 ± 0.016
3.086LysArg: 3.086 ± 0.029
3.733LysSer: 3.733 ± 0.027
2.962LysThr: 2.962 ± 0.025
2.887LysVal: 2.887 ± 0.024
0.697LysTrp: 0.697 ± 0.01
1.478LysTyr: 1.478 ± 0.015
0.0LysXaa: 0.0 ± 0.0
Leu
7.921LeuAla: 7.921 ± 0.032
1.246LeuCys: 1.246 ± 0.015
5.034LeuAsp: 5.034 ± 0.032
5.553LeuGlu: 5.553 ± 0.033
3.366LeuPhe: 3.366 ± 0.027
6.169LeuGly: 6.169 ± 0.033
2.157LeuHis: 2.157 ± 0.019
4.183LeuIle: 4.183 ± 0.029
4.293LeuLys: 4.293 ± 0.03
8.479LeuLeu: 8.479 ± 0.051
1.824LeuMet: 1.824 ± 0.019
3.276LeuAsn: 3.276 ± 0.023
5.406LeuPro: 5.406 ± 0.034
3.955LeuGln: 3.955 ± 0.031
5.117LeuArg: 5.117 ± 0.033
7.448LeuSer: 7.448 ± 0.042
4.969LeuThr: 4.969 ± 0.035
5.482LeuVal: 5.482 ± 0.032
1.269LeuTrp: 1.269 ± 0.017
2.459LeuTyr: 2.459 ± 0.021
0.0LeuXaa: 0.0 ± 0.0
Met
2.328MetAla: 2.328 ± 0.02
0.251MetCys: 0.251 ± 0.006
1.259MetAsp: 1.259 ± 0.015
1.311MetGlu: 1.311 ± 0.016
0.797MetPhe: 0.797 ± 0.013
1.522MetGly: 1.522 ± 0.014
0.479MetHis: 0.479 ± 0.009
1.12MetIle: 1.12 ± 0.015
1.105MetLys: 1.105 ± 0.013
1.906MetLeu: 1.906 ± 0.019
0.584MetMet: 0.584 ± 0.011
0.855MetAsn: 0.855 ± 0.01
1.27MetPro: 1.27 ± 0.014
0.886MetGln: 0.886 ± 0.012
1.171MetArg: 1.171 ± 0.014
1.92MetSer: 1.92 ± 0.017
1.373MetThr: 1.373 ± 0.014
1.38MetVal: 1.38 ± 0.013
0.281MetTrp: 0.281 ± 0.006
0.552MetTyr: 0.552 ± 0.008
0.0MetXaa: 0.0 ± 0.0
Asn
3.22AsnAla: 3.22 ± 0.024
0.485AsnCys: 0.485 ± 0.01
1.942AsnAsp: 1.942 ± 0.018
2.043AsnGlu: 2.043 ± 0.016
1.502AsnPhe: 1.502 ± 0.017
3.192AsnGly: 3.192 ± 0.025
0.866AsnHis: 0.866 ± 0.013
2.274AsnIle: 2.274 ± 0.02
1.54AsnLys: 1.54 ± 0.014
3.435AsnLeu: 3.435 ± 0.021
0.899AsnMet: 0.899 ± 0.012
1.57AsnAsn: 1.57 ± 0.02
2.386AsnPro: 2.386 ± 0.022
1.401AsnGln: 1.401 ± 0.015
1.754AsnArg: 1.754 ± 0.017
3.117AsnSer: 3.117 ± 0.024
2.45AsnThr: 2.45 ± 0.021
2.352AsnVal: 2.352 ± 0.019
0.594AsnTrp: 0.594 ± 0.009
1.157AsnTyr: 1.157 ± 0.012
0.0AsnXaa: 0.0 ± 0.0
Pro
4.999ProAla: 4.999 ± 0.04
0.478ProCys: 0.478 ± 0.008
2.888ProAsp: 2.888 ± 0.024
3.602ProGlu: 3.602 ± 0.031
2.014ProPhe: 2.014 ± 0.017
3.754ProGly: 3.754 ± 0.038
1.137ProHis: 1.137 ± 0.014
2.705ProIle: 2.705 ± 0.021
2.647ProLys: 2.647 ± 0.027
4.598ProLeu: 4.598 ± 0.029
1.083ProMet: 1.083 ± 0.014
2.187ProAsn: 2.187 ± 0.018
4.207ProPro: 4.207 ± 0.053
2.364ProGln: 2.364 ± 0.025
2.994ProArg: 2.994 ± 0.026
5.601ProSer: 5.601 ± 0.042
3.945ProThr: 3.945 ± 0.034
3.425ProVal: 3.425 ± 0.031
0.734ProTrp: 0.734 ± 0.01
1.543ProTyr: 1.543 ± 0.017
0.0ProXaa: 0.0 ± 0.0
Gln
3.425GlnAla: 3.425 ± 0.023
0.451GlnCys: 0.451 ± 0.009
2.072GlnAsp: 2.072 ± 0.019
2.365GlnGlu: 2.365 ± 0.02
1.301GlnPhe: 1.301 ± 0.015
2.382GlnGly: 2.382 ± 0.021
1.023GlnHis: 1.023 ± 0.015
2.016GlnIle: 2.016 ± 0.019
2.055GlnLys: 2.055 ± 0.02
3.329GlnLeu: 3.329 ± 0.025
0.873GlnMet: 0.873 ± 0.012
1.664GlnAsn: 1.664 ± 0.015
2.27GlnPro: 2.27 ± 0.027
2.162GlnGln: 2.162 ± 0.032
2.411GlnArg: 2.411 ± 0.022
3.135GlnSer: 3.135 ± 0.026
2.324GlnThr: 2.324 ± 0.021
2.186GlnVal: 2.186 ± 0.02
0.583GlnTrp: 0.583 ± 0.011
1.248GlnTyr: 1.248 ± 0.015
0.0GlnXaa: 0.0 ± 0.0
Arg
4.233ArgAla: 4.233 ± 0.027
0.644ArgCys: 0.644 ± 0.01
2.94ArgAsp: 2.94 ± 0.021
3.411ArgGlu: 3.411 ± 0.03
1.97ArgPhe: 1.97 ± 0.017
3.38ArgGly: 3.38 ± 0.028
1.318ArgHis: 1.318 ± 0.015
2.791ArgIle: 2.791 ± 0.022
3.216ArgLys: 3.216 ± 0.027
4.896ArgLeu: 4.896 ± 0.032
1.213ArgMet: 1.213 ± 0.013
2.15ArgAsn: 2.15 ± 0.021
3.02ArgPro: 3.02 ± 0.025
2.311ArgGln: 2.311 ± 0.021
4.265ArgArg: 4.265 ± 0.035
4.35ArgSer: 4.35 ± 0.032
3.057ArgThr: 3.057 ± 0.025
2.987ArgVal: 2.987 ± 0.021
0.869ArgTrp: 0.869 ± 0.01
1.556ArgTyr: 1.556 ± 0.015
0.0ArgXaa: 0.0 ± 0.0
Ser
6.811SerAla: 6.811 ± 0.046
0.944SerCys: 0.944 ± 0.013
4.068SerAsp: 4.068 ± 0.028
4.147SerGlu: 4.147 ± 0.029
3.144SerPhe: 3.144 ± 0.023
5.719SerGly: 5.719 ± 0.047
1.821SerHis: 1.821 ± 0.018
4.575SerIle: 4.575 ± 0.033
3.969SerLys: 3.969 ± 0.032
7.308SerLeu: 7.308 ± 0.042
1.871SerMet: 1.871 ± 0.016
3.302SerAsn: 3.302 ± 0.025
5.243SerPro: 5.243 ± 0.041
3.312SerGln: 3.312 ± 0.027
4.55SerArg: 4.55 ± 0.034
9.275SerSer: 9.275 ± 0.099
6.285SerThr: 6.285 ± 0.066
4.746SerVal: 4.746 ± 0.029
1.199SerTrp: 1.199 ± 0.012
2.296SerTyr: 2.296 ± 0.019
0.0SerXaa: 0.0 ± 0.0
Thr
5.511ThrAla: 5.511 ± 0.034
0.802ThrCys: 0.802 ± 0.013
2.842ThrAsp: 2.842 ± 0.022
3.143ThrGlu: 3.143 ± 0.021
2.462ThrPhe: 2.462 ± 0.02
4.42ThrGly: 4.42 ± 0.044
1.211ThrHis: 1.211 ± 0.013
3.567ThrIle: 3.567 ± 0.038
2.784ThrLys: 2.784 ± 0.021
5.478ThrLeu: 5.478 ± 0.041
1.292ThrMet: 1.292 ± 0.014
2.343ThrAsn: 2.343 ± 0.02
4.214ThrPro: 4.214 ± 0.04
2.139ThrGln: 2.139 ± 0.02
2.887ThrArg: 2.887 ± 0.023
5.995ThrSer: 5.995 ± 0.056
4.866ThrThr: 4.866 ± 0.056
3.892ThrVal: 3.892 ± 0.036
0.928ThrTrp: 0.928 ± 0.011
1.809ThrTyr: 1.809 ± 0.018
0.0ThrXaa: 0.0 ± 0.0
Val
5.395ValAla: 5.395 ± 0.03
0.82ValCys: 0.82 ± 0.012
3.576ValAsp: 3.576 ± 0.024
3.866ValGlu: 3.866 ± 0.024
2.533ValPhe: 2.533 ± 0.022
4.143ValGly: 4.143 ± 0.033
1.326ValHis: 1.326 ± 0.013
3.19ValIle: 3.19 ± 0.024
2.914ValLys: 2.914 ± 0.023
5.727ValLeu: 5.727 ± 0.034
1.366ValMet: 1.366 ± 0.015
2.278ValAsn: 2.278 ± 0.019
3.446ValPro: 3.446 ± 0.023
2.374ValGln: 2.374 ± 0.018
3.07ValArg: 3.07 ± 0.022
4.66ValSer: 4.66 ± 0.03
3.72ValThr: 3.72 ± 0.033
4.442ValVal: 4.442 ± 0.032
0.894ValTrp: 0.894 ± 0.012
1.774ValTyr: 1.774 ± 0.018
0.0ValXaa: 0.0 ± 0.0
Trp
1.188TrpAla: 1.188 ± 0.014
0.21TrpCys: 0.21 ± 0.005
0.881TrpAsp: 0.881 ± 0.013
0.844TrpGlu: 0.844 ± 0.012
0.576TrpPhe: 0.576 ± 0.01
1.005TrpGly: 1.005 ± 0.013
0.348TrpHis: 0.348 ± 0.007
0.856TrpIle: 0.856 ± 0.012
0.841TrpLys: 0.841 ± 0.012
1.393TrpLeu: 1.393 ± 0.015
0.39TrpMet: 0.39 ± 0.007
0.685TrpAsn: 0.685 ± 0.01
0.637TrpPro: 0.637 ± 0.01
0.587TrpGln: 0.587 ± 0.01
0.879TrpArg: 0.879 ± 0.01
1.117TrpSer: 1.117 ± 0.012
0.988TrpThr: 0.988 ± 0.013
0.93TrpVal: 0.93 ± 0.011
0.292TrpTrp: 0.292 ± 0.007
0.481TrpTyr: 0.481 ± 0.01
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.246TyrAla: 2.246 ± 0.02
0.444TyrCys: 0.444 ± 0.008
1.684TyrAsp: 1.684 ± 0.016
1.588TyrGlu: 1.588 ± 0.015
1.307TyrPhe: 1.307 ± 0.015
2.264TyrGly: 2.264 ± 0.021
0.753TyrHis: 0.753 ± 0.011
1.564TyrIle: 1.564 ± 0.017
1.165TyrLys: 1.165 ± 0.013
2.741TyrLeu: 2.741 ± 0.025
0.663TyrMet: 0.663 ± 0.011
1.221TyrAsn: 1.221 ± 0.012
1.501TyrPro: 1.501 ± 0.017
1.175TyrGln: 1.175 ± 0.017
1.466TyrArg: 1.466 ± 0.014
2.213TyrSer: 2.213 ± 0.018
1.807TyrThr: 1.807 ± 0.02
1.675TyrVal: 1.675 ± 0.016
0.483TyrTrp: 0.483 ± 0.009
0.997TyrTyr: 0.997 ± 0.013
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 14176 proteins (7204061 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski