Amino acid dipepetide frequency for Paramecium tetraurelia

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.569AlaAla: 1.569 ± 0.015
0.706AlaCys: 0.706 ± 0.017
1.618AlaAsp: 1.618 ± 0.011
2.118AlaGlu: 2.118 ± 0.013
1.845AlaPhe: 1.845 ± 0.011
1.275AlaGly: 1.275 ± 0.011
0.581AlaHis: 0.581 ± 0.007
3.183AlaIle: 3.183 ± 0.02
2.97AlaLys: 2.97 ± 0.015
3.67AlaLeu: 3.67 ± 0.018
0.708AlaMet: 0.708 ± 0.007
2.137AlaAsn: 2.137 ± 0.012
1.055AlaPro: 1.055 ± 0.01
2.795AlaGln: 2.795 ± 0.015
1.171AlaArg: 1.171 ± 0.009
2.328AlaSer: 2.328 ± 0.015
1.683AlaThr: 1.683 ± 0.015
1.632AlaVal: 1.632 ± 0.013
0.259AlaTrp: 0.259 ± 0.004
1.479AlaTyr: 1.479 ± 0.011
0.0AlaXaa: 0.0 ± 0.0
Cys
0.633CysAla: 0.633 ± 0.013
0.317CysCys: 0.317 ± 0.004
0.948CysAsp: 0.948 ± 0.014
1.008CysGlu: 1.008 ± 0.012
0.953CysPhe: 0.953 ± 0.009
0.79CysGly: 0.79 ± 0.01
0.329CysHis: 0.329 ± 0.006
1.462CysIle: 1.462 ± 0.017
1.461CysLys: 1.461 ± 0.017
1.895CysLeu: 1.895 ± 0.018
0.332CysMet: 0.332 ± 0.006
1.078CysAsn: 1.078 ± 0.013
0.586CysPro: 0.586 ± 0.008
1.611CysGln: 1.611 ± 0.022
0.583CysArg: 0.583 ± 0.008
1.392CysSer: 1.392 ± 0.023
0.927CysThr: 0.927 ± 0.027
0.941CysVal: 0.941 ± 0.019
0.129CysTrp: 0.129 ± 0.003
0.856CysTyr: 0.856 ± 0.012
0.0CysXaa: 0.0 ± 0.0
Asp
1.675AspAla: 1.675 ± 0.012
0.939AspCys: 0.939 ± 0.013
2.962AspAsp: 2.962 ± 0.018
3.757AspGlu: 3.757 ± 0.017
2.737AspPhe: 2.737 ± 0.014
2.163AspGly: 2.163 ± 0.02
0.892AspHis: 0.892 ± 0.007
3.974AspIle: 3.974 ± 0.017
3.984AspLys: 3.984 ± 0.019
5.119AspLeu: 5.119 ± 0.018
0.904AspMet: 0.904 ± 0.007
3.061AspAsn: 3.061 ± 0.015
1.596AspPro: 1.596 ± 0.01
4.411AspGln: 4.411 ± 0.02
1.59AspArg: 1.59 ± 0.012
3.311AspSer: 3.311 ± 0.015
1.885AspThr: 1.885 ± 0.011
2.253AspVal: 2.253 ± 0.011
0.439AspTrp: 0.439 ± 0.006
2.361AspTyr: 2.361 ± 0.012
0.001AspXaa: 0.001 ± 0.0
Glu
2.323GluAla: 2.323 ± 0.017
1.146GluCys: 1.146 ± 0.014
3.489GluAsp: 3.489 ± 0.018
5.628GluGlu: 5.628 ± 0.038
3.237GluPhe: 3.237 ± 0.015
2.15GluGly: 2.15 ± 0.014
0.969GluHis: 0.969 ± 0.008
6.098GluIle: 6.098 ± 0.022
6.234GluLys: 6.234 ± 0.03
6.457GluLeu: 6.457 ± 0.028
1.568GluMet: 1.568 ± 0.009
4.798GluAsn: 4.798 ± 0.019
1.297GluPro: 1.297 ± 0.011
6.201GluGln: 6.201 ± 0.03
2.431GluArg: 2.431 ± 0.016
3.896GluSer: 3.896 ± 0.015
2.744GluThr: 2.744 ± 0.012
2.916GluVal: 2.916 ± 0.015
0.503GluTrp: 0.503 ± 0.006
2.709GluTyr: 2.709 ± 0.013
0.0GluXaa: 0.0 ± 0.0
Phe
1.754PheAla: 1.754 ± 0.012
0.903PheCys: 0.903 ± 0.007
2.86PheAsp: 2.86 ± 0.015
3.33PheGlu: 3.33 ± 0.014
2.228PhePhe: 2.228 ± 0.014
2.273PheGly: 2.273 ± 0.015
0.914PheHis: 0.914 ± 0.007
4.01PheIle: 4.01 ± 0.019
4.126PheLys: 4.126 ± 0.017
4.624PheLeu: 4.624 ± 0.022
1.048PheMet: 1.048 ± 0.008
3.404PheAsn: 3.404 ± 0.016
1.27PhePro: 1.27 ± 0.008
4.098PheGln: 4.098 ± 0.017
1.678PheArg: 1.678 ± 0.011
3.564PheSer: 3.564 ± 0.019
2.321PheThr: 2.321 ± 0.013
2.275PheVal: 2.275 ± 0.011
0.462PheTrp: 0.462 ± 0.005
2.359PheTyr: 2.359 ± 0.013
0.001PheXaa: 0.001 ± 0.0
Gly
1.346GlyAla: 1.346 ± 0.012
0.895GlyCys: 0.895 ± 0.019
1.878GlyAsp: 1.878 ± 0.012
2.071GlyGlu: 2.071 ± 0.014
1.977GlyPhe: 1.977 ± 0.014
1.753GlyGly: 1.753 ± 0.016
0.674GlyHis: 0.674 ± 0.008
2.881GlyIle: 2.881 ± 0.016
2.845GlyLys: 2.845 ± 0.016
3.268GlyLeu: 3.268 ± 0.017
0.764GlyMet: 0.764 ± 0.008
2.179GlyAsn: 2.179 ± 0.015
0.793GlyPro: 0.793 ± 0.009
2.676GlyGln: 2.676 ± 0.017
1.296GlyArg: 1.296 ± 0.011
2.484GlySer: 2.484 ± 0.017
1.796GlyThr: 1.796 ± 0.017
1.867GlyVal: 1.867 ± 0.011
0.383GlyTrp: 0.383 ± 0.006
1.769GlyTyr: 1.769 ± 0.015
0.001GlyXaa: 0.001 ± 0.0
His
0.51HisAla: 0.51 ± 0.006
0.308HisCys: 0.308 ± 0.005
0.699HisAsp: 0.699 ± 0.007
0.957HisGlu: 0.957 ± 0.008
1.046HisPhe: 1.046 ± 0.007
0.615HisGly: 0.615 ± 0.008
0.496HisHis: 0.496 ± 0.007
1.32HisIle: 1.32 ± 0.009
1.461HisLys: 1.461 ± 0.009
1.902HisLeu: 1.902 ± 0.013
0.31HisMet: 0.31 ± 0.004
1.101HisAsn: 1.101 ± 0.009
0.75HisPro: 0.75 ± 0.007
1.695HisGln: 1.695 ± 0.011
0.691HisArg: 0.691 ± 0.007
1.369HisSer: 1.369 ± 0.01
0.74HisThr: 0.74 ± 0.007
0.652HisVal: 0.652 ± 0.006
0.127HisTrp: 0.127 ± 0.003
0.89HisTyr: 0.89 ± 0.008
0.0HisXaa: 0.0 ± 0.0
Ile
2.933IleAla: 2.933 ± 0.013
1.511IleCys: 1.511 ± 0.012
4.475IleAsp: 4.475 ± 0.019
5.7IleGlu: 5.7 ± 0.02
3.939IlePhe: 3.939 ± 0.02
2.854IleGly: 2.854 ± 0.015
1.544IleHis: 1.544 ± 0.01
6.977IleIle: 6.977 ± 0.031
7.772IleLys: 7.772 ± 0.024
8.24IleLeu: 8.24 ± 0.034
1.745IleMet: 1.745 ± 0.011
5.927IleAsn: 5.927 ± 0.024
2.551IlePro: 2.551 ± 0.012
7.357IleGln: 7.357 ± 0.025
2.88IleArg: 2.88 ± 0.015
5.893IleSer: 5.893 ± 0.021
3.925IleThr: 3.925 ± 0.016
3.907IleVal: 3.907 ± 0.017
0.685IleTrp: 0.685 ± 0.007
3.5IleTyr: 3.5 ± 0.018
0.001IleXaa: 0.001 ± 0.0
Lys
3.004LysAla: 3.004 ± 0.015
1.625LysCys: 1.625 ± 0.021
4.258LysAsp: 4.258 ± 0.016
6.405LysGlu: 6.405 ± 0.027
3.886LysPhe: 3.886 ± 0.016
2.849LysGly: 2.849 ± 0.016
1.51LysHis: 1.51 ± 0.01
7.308LysIle: 7.308 ± 0.024
8.196LysLys: 8.196 ± 0.031
8.714LysLeu: 8.714 ± 0.03
2.104LysMet: 2.104 ± 0.012
5.723LysAsn: 5.723 ± 0.018
2.535LysPro: 2.535 ± 0.015
8.564LysGln: 8.564 ± 0.033
3.239LysArg: 3.239 ± 0.016
5.964LysSer: 5.964 ± 0.019
3.972LysThr: 3.972 ± 0.016
4.038LysVal: 4.038 ± 0.016
0.654LysTrp: 0.654 ± 0.007
3.728LysTyr: 3.728 ± 0.016
0.0LysXaa: 0.0 ± 0.0
Leu
3.579LeuAla: 3.579 ± 0.02
1.553LeuCys: 1.553 ± 0.012
4.85LeuAsp: 4.85 ± 0.018
6.655LeuGlu: 6.655 ± 0.032
4.739LeuPhe: 4.739 ± 0.023
3.294LeuGly: 3.294 ± 0.015
1.685LeuHis: 1.685 ± 0.01
8.544LeuIle: 8.544 ± 0.035
9.395LeuLys: 9.395 ± 0.031
9.326LeuLeu: 9.326 ± 0.032
2.156LeuMet: 2.156 ± 0.014
7.312LeuAsn: 7.312 ± 0.028
2.624LeuPro: 2.624 ± 0.012
8.305LeuGln: 8.305 ± 0.033
3.546LeuArg: 3.546 ± 0.016
6.839LeuSer: 6.839 ± 0.022
4.479LeuThr: 4.479 ± 0.017
4.14LeuVal: 4.14 ± 0.015
0.713LeuTrp: 0.713 ± 0.008
3.674LeuTyr: 3.674 ± 0.016
0.002LeuXaa: 0.002 ± 0.0
Met
0.88MetAla: 0.88 ± 0.009
0.273MetCys: 0.273 ± 0.004
1.081MetAsp: 1.081 ± 0.009
1.356MetGlu: 1.356 ± 0.01
0.882MetPhe: 0.882 ± 0.007
0.779MetGly: 0.779 ± 0.008
0.402MetHis: 0.402 ± 0.005
1.977MetIle: 1.977 ± 0.011
2.18MetLys: 2.18 ± 0.012
1.918MetLeu: 1.918 ± 0.012
0.578MetMet: 0.578 ± 0.006
1.761MetAsn: 1.761 ± 0.013
0.569MetPro: 0.569 ± 0.006
1.597MetGln: 1.597 ± 0.01
0.791MetArg: 0.791 ± 0.007
1.42MetSer: 1.42 ± 0.009
0.924MetThr: 0.924 ± 0.007
0.861MetVal: 0.861 ± 0.008
0.133MetTrp: 0.133 ± 0.003
0.657MetTyr: 0.657 ± 0.005
0.0MetXaa: 0.0 ± 0.0
Asn
2.073AsnAla: 2.073 ± 0.013
1.36AsnCys: 1.36 ± 0.02
3.336AsnAsp: 3.336 ± 0.014
4.612AsnGlu: 4.612 ± 0.02
3.248AsnPhe: 3.248 ± 0.016
2.413AsnGly: 2.413 ± 0.018
1.345AsnHis: 1.345 ± 0.009
5.292AsnIle: 5.292 ± 0.022
5.993AsnLys: 5.993 ± 0.021
6.759AsnLeu: 6.759 ± 0.025
1.235AsnMet: 1.235 ± 0.01
5.116AsnAsn: 5.116 ± 0.022
2.251AsnPro: 2.251 ± 0.012
7.164AsnGln: 7.164 ± 0.028
2.273AsnArg: 2.273 ± 0.013
5.098AsnSer: 5.098 ± 0.02
2.996AsnThr: 2.996 ± 0.015
2.783AsnVal: 2.783 ± 0.013
0.477AsnTrp: 0.477 ± 0.006
3.253AsnTyr: 3.253 ± 0.016
0.001AsnXaa: 0.001 ± 0.0
Pro
0.915ProAla: 0.915 ± 0.009
0.43ProCys: 0.43 ± 0.008
1.334ProAsp: 1.334 ± 0.01
1.98ProGlu: 1.98 ± 0.015
1.509ProPhe: 1.509 ± 0.011
0.932ProGly: 0.932 ± 0.01
0.489ProHis: 0.489 ± 0.005
2.502ProIle: 2.502 ± 0.014
2.717ProLys: 2.717 ± 0.017
2.599ProLeu: 2.599 ± 0.014
0.46ProMet: 0.46 ± 0.005
2.106ProAsn: 2.106 ± 0.013
1.247ProPro: 1.247 ± 0.017
2.776ProGln: 2.776 ± 0.017
0.865ProArg: 0.865 ± 0.008
2.062ProSer: 2.062 ± 0.013
1.49ProThr: 1.49 ± 0.011
1.25ProVal: 1.25 ± 0.01
0.233ProTrp: 0.233 ± 0.004
1.255ProTyr: 1.255 ± 0.009
0.0ProXaa: 0.0 ± 0.0
Gln
2.831GlnAla: 2.831 ± 0.014
1.473GlnCys: 1.473 ± 0.024
3.831GlnAsp: 3.831 ± 0.019
6.112GlnGlu: 6.112 ± 0.031
4.418GlnPhe: 4.418 ± 0.016
2.516GlnGly: 2.516 ± 0.016
1.435GlnHis: 1.435 ± 0.009
8.186GlnIle: 8.186 ± 0.031
8.022GlnLys: 8.022 ± 0.029
8.942GlnLeu: 8.942 ± 0.037
2.137GlnMet: 2.137 ± 0.01
6.698GlnAsn: 6.698 ± 0.028
2.438GlnPro: 2.438 ± 0.019
11.055GlnGln: 11.055 ± 0.078
2.985GlnArg: 2.985 ± 0.016
6.132GlnSer: 6.132 ± 0.024
4.07GlnThr: 4.07 ± 0.02
3.733GlnVal: 3.733 ± 0.017
0.537GlnTrp: 0.537 ± 0.006
3.565GlnTyr: 3.565 ± 0.016
0.001GlnXaa: 0.001 ± 0.0
Arg
1.22ArgAla: 1.22 ± 0.009
0.555ArgCys: 0.555 ± 0.007
1.752ArgAsp: 1.752 ± 0.013
2.285ArgGlu: 2.285 ± 0.013
1.676ArgPhe: 1.676 ± 0.011
1.178ArgGly: 1.178 ± 0.011
0.564ArgHis: 0.564 ± 0.006
3.059ArgIle: 3.059 ± 0.014
3.232ArgLys: 3.232 ± 0.018
3.276ArgLeu: 3.276 ± 0.017
0.854ArgMet: 0.854 ± 0.008
2.338ArgAsn: 2.338 ± 0.013
0.932ArgPro: 0.932 ± 0.009
2.799ArgGln: 2.799 ± 0.015
1.6ArgArg: 1.6 ± 0.014
2.302ArgSer: 2.302 ± 0.013
1.527ArgThr: 1.527 ± 0.01
1.74ArgVal: 1.74 ± 0.012
0.287ArgTrp: 0.287 ± 0.005
1.339ArgTyr: 1.339 ± 0.01
0.0ArgXaa: 0.0 ± 0.0
Ser
2.294SerAla: 2.294 ± 0.014
1.342SerCys: 1.342 ± 0.025
3.337SerAsp: 3.337 ± 0.017
4.02SerGlu: 4.02 ± 0.015
3.585SerPhe: 3.585 ± 0.018
2.409SerGly: 2.409 ± 0.017
1.196SerHis: 1.196 ± 0.009
6.053SerIle: 6.053 ± 0.022
6.059SerLys: 6.059 ± 0.019
6.668SerLeu: 6.668 ± 0.022
1.298SerMet: 1.298 ± 0.009
5.016SerAsn: 5.016 ± 0.02
2.264SerPro: 2.264 ± 0.015
6.299SerGln: 6.299 ± 0.026
2.284SerArg: 2.284 ± 0.014
5.332SerSer: 5.332 ± 0.025
3.336SerThr: 3.336 ± 0.019
3.051SerVal: 3.051 ± 0.015
0.451SerTrp: 0.451 ± 0.006
2.762SerTyr: 2.762 ± 0.015
0.001SerXaa: 0.001 ± 0.0
Thr
1.657ThrAla: 1.657 ± 0.017
1.022ThrCys: 1.022 ± 0.029
2.047ThrAsp: 2.047 ± 0.015
2.56ThrGlu: 2.56 ± 0.012
2.358ThrPhe: 2.358 ± 0.012
1.637ThrGly: 1.637 ± 0.017
0.832ThrHis: 0.832 ± 0.008
4.02ThrIle: 4.02 ± 0.016
3.868ThrLys: 3.868 ± 0.016
4.635ThrLeu: 4.635 ± 0.015
0.828ThrMet: 0.828 ± 0.007
3.138ThrAsn: 3.138 ± 0.017
1.673ThrPro: 1.673 ± 0.011
3.953ThrGln: 3.953 ± 0.017
1.375ThrArg: 1.375 ± 0.009
3.152ThrSer: 3.152 ± 0.019
2.44ThrThr: 2.44 ± 0.021
1.956ThrVal: 1.956 ± 0.013
0.314ThrTrp: 0.314 ± 0.004
1.926ThrTyr: 1.926 ± 0.011
0.0ThrXaa: 0.0 ± 0.0
Val
1.819ValAla: 1.819 ± 0.011
0.864ValCys: 0.864 ± 0.011
2.468ValAsp: 2.468 ± 0.012
3.027ValGlu: 3.027 ± 0.016
2.247ValPhe: 2.247 ± 0.014
1.817ValGly: 1.817 ± 0.012
0.803ValHis: 0.803 ± 0.007
3.646ValIle: 3.646 ± 0.016
3.788ValLys: 3.788 ± 0.014
4.204ValLeu: 4.204 ± 0.016
0.92ValMet: 0.92 ± 0.007
2.876ValAsn: 2.876 ± 0.015
1.366ValPro: 1.366 ± 0.01
3.573ValGln: 3.573 ± 0.016
1.562ValArg: 1.562 ± 0.01
2.989ValSer: 2.989 ± 0.015
1.997ValThr: 1.997 ± 0.013
2.279ValVal: 2.279 ± 0.013
0.406ValTrp: 0.406 ± 0.005
1.893ValTyr: 1.893 ± 0.01
0.001ValXaa: 0.001 ± 0.0
Trp
0.318TrpAla: 0.318 ± 0.005
0.115TrpCys: 0.115 ± 0.002
0.494TrpAsp: 0.494 ± 0.008
0.44TrpGlu: 0.44 ± 0.005
0.356TrpPhe: 0.356 ± 0.005
0.303TrpGly: 0.303 ± 0.006
0.142TrpHis: 0.142 ± 0.003
0.691TrpIle: 0.691 ± 0.007
0.732TrpLys: 0.732 ± 0.006
0.659TrpLeu: 0.659 ± 0.007
0.193TrpMet: 0.193 ± 0.003
0.611TrpAsn: 0.611 ± 0.008
0.178TrpPro: 0.178 ± 0.003
0.437TrpGln: 0.437 ± 0.005
0.29TrpArg: 0.29 ± 0.004
0.544TrpSer: 0.544 ± 0.006
0.381TrpThr: 0.381 ± 0.006
0.411TrpVal: 0.411 ± 0.006
0.072TrpTrp: 0.072 ± 0.002
0.263TrpTyr: 0.263 ± 0.004
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.449TyrAla: 1.449 ± 0.01
0.897TyrCys: 0.897 ± 0.01
2.292TyrAsp: 2.292 ± 0.013
2.777TyrGlu: 2.777 ± 0.015
2.638TyrPhe: 2.638 ± 0.015
1.594TyrGly: 1.594 ± 0.012
0.834TyrHis: 0.834 ± 0.006
3.133TyrIle: 3.133 ± 0.015
3.243TyrLys: 3.243 ± 0.015
4.533TyrLeu: 4.533 ± 0.02
0.875TyrMet: 0.875 ± 0.008
2.696TyrAsn: 2.696 ± 0.015
1.175TyrPro: 1.175 ± 0.01
3.748TyrGln: 3.748 ± 0.017
1.385TyrArg: 1.385 ± 0.008
2.986TyrSer: 2.986 ± 0.014
1.762TyrThr: 1.762 ± 0.011
1.86TyrVal: 1.86 ± 0.011
0.363TyrTrp: 0.363 ± 0.005
2.28TyrTyr: 2.28 ± 0.015
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.001XaaGlu: 0.001 ± 0.0
0.001XaaPhe: 0.001 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.001XaaIle: 0.001 ± 0.0
0.002XaaLys: 0.002 ± 0.0
0.001XaaLeu: 0.001 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.001XaaAsn: 0.001 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.001XaaGln: 0.001 ± 0.0
0.001XaaArg: 0.001 ± 0.0
0.001XaaSer: 0.001 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.001XaaVal: 0.001 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
3.409XaaXaa: 3.409 ± 0.498
Statistics based on 39461 proteins (18009409 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski