Amino acid dipepetide frequency for Streptomyces sp. 846.5

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
22.896AlaAla: 22.896 ± 0.159
1.099AlaCys: 1.099 ± 0.022
8.144AlaAsp: 8.144 ± 0.067
8.618AlaGlu: 8.618 ± 0.093
3.602AlaPhe: 3.602 ± 0.043
13.037AlaGly: 13.037 ± 0.09
2.671AlaHis: 2.671 ± 0.031
3.587AlaIle: 3.587 ± 0.042
2.65AlaLys: 2.65 ± 0.041
14.964AlaLeu: 14.964 ± 0.113
2.564AlaMet: 2.564 ± 0.031
2.262AlaAsn: 2.262 ± 0.034
7.132AlaPro: 7.132 ± 0.068
4.299AlaGln: 4.299 ± 0.045
9.311AlaArg: 9.311 ± 0.079
6.56AlaSer: 6.56 ± 0.066
7.594AlaThr: 7.594 ± 0.068
12.872AlaVal: 12.872 ± 0.097
1.894AlaTrp: 1.894 ± 0.028
2.592AlaTyr: 2.592 ± 0.034
0.0AlaXaa: 0.0 ± 0.0
Cys
1.062CysAla: 1.062 ± 0.019
0.11CysCys: 0.11 ± 0.006
0.463CysAsp: 0.463 ± 0.015
0.359CysGlu: 0.359 ± 0.013
0.233CysPhe: 0.233 ± 0.01
0.935CysGly: 0.935 ± 0.02
0.198CysHis: 0.198 ± 0.01
0.19CysIle: 0.19 ± 0.008
0.107CysLys: 0.107 ± 0.007
0.782CysLeu: 0.782 ± 0.017
0.113CysMet: 0.113 ± 0.007
0.154CysAsn: 0.154 ± 0.009
0.486CysPro: 0.486 ± 0.015
0.177CysGln: 0.177 ± 0.008
0.599CysArg: 0.599 ± 0.018
0.536CysSer: 0.536 ± 0.013
0.549CysThr: 0.549 ± 0.018
0.602CysVal: 0.602 ± 0.015
0.131CysTrp: 0.131 ± 0.007
0.185CysTyr: 0.185 ± 0.008
0.0CysXaa: 0.0 ± 0.0
Asp
7.168AspAla: 7.168 ± 0.059
0.422AspCys: 0.422 ± 0.014
3.035AspAsp: 3.035 ± 0.041
3.245AspGlu: 3.245 ± 0.039
1.589AspPhe: 1.589 ± 0.029
5.803AspGly: 5.803 ± 0.053
1.344AspHis: 1.344 ± 0.022
1.761AspIle: 1.761 ± 0.031
0.845AspLys: 0.845 ± 0.021
6.229AspLeu: 6.229 ± 0.061
0.713AspMet: 0.713 ± 0.017
0.976AspAsn: 0.976 ± 0.023
4.41AspPro: 4.41 ± 0.045
1.816AspGln: 1.816 ± 0.026
4.506AspArg: 4.506 ± 0.057
2.788AspSer: 2.788 ± 0.034
2.989AspThr: 2.989 ± 0.033
4.073AspVal: 4.073 ± 0.043
1.011AspTrp: 1.011 ± 0.023
1.224AspTyr: 1.224 ± 0.019
0.0AspXaa: 0.0 ± 0.0
Glu
6.776GluAla: 6.776 ± 0.085
0.328GluCys: 0.328 ± 0.012
2.458GluAsp: 2.458 ± 0.034
2.894GluGlu: 2.894 ± 0.046
1.339GluPhe: 1.339 ± 0.024
3.707GluGly: 3.707 ± 0.041
1.417GluHis: 1.417 ± 0.025
2.037GluIle: 2.037 ± 0.033
0.984GluLys: 0.984 ± 0.023
6.84GluLeu: 6.84 ± 0.066
0.779GluMet: 0.779 ± 0.019
0.916GluAsn: 0.916 ± 0.02
3.041GluPro: 3.041 ± 0.044
2.547GluGln: 2.547 ± 0.034
4.765GluArg: 4.765 ± 0.058
2.424GluSer: 2.424 ± 0.03
2.574GluThr: 2.574 ± 0.035
4.097GluVal: 4.097 ± 0.044
0.701GluTrp: 0.701 ± 0.019
1.04GluTyr: 1.04 ± 0.019
0.0GluXaa: 0.0 ± 0.0
Phe
3.745PheAla: 3.745 ± 0.042
0.271PheCys: 0.271 ± 0.01
1.914PheAsp: 1.914 ± 0.03
1.323PheGlu: 1.323 ± 0.025
0.843PhePhe: 0.843 ± 0.021
3.059PheGly: 3.059 ± 0.039
0.635PheHis: 0.635 ± 0.015
0.738PheIle: 0.738 ± 0.02
0.458PheLys: 0.458 ± 0.015
2.577PheLeu: 2.577 ± 0.034
0.381PheMet: 0.381 ± 0.012
0.633PheAsn: 0.633 ± 0.019
1.376PhePro: 1.376 ± 0.022
0.812PheGln: 0.812 ± 0.016
1.755PheArg: 1.755 ± 0.026
1.611PheSer: 1.611 ± 0.025
2.04PheThr: 2.04 ± 0.031
2.139PheVal: 2.139 ± 0.03
0.44PheTrp: 0.44 ± 0.012
0.61PheTyr: 0.61 ± 0.016
0.0PheXaa: 0.0 ± 0.0
Gly
11.188GlyAla: 11.188 ± 0.08
0.855GlyCys: 0.855 ± 0.018
4.563GlyAsp: 4.563 ± 0.039
4.423GlyGlu: 4.423 ± 0.048
2.972GlyPhe: 2.972 ± 0.042
8.606GlyGly: 8.606 ± 0.088
2.166GlyHis: 2.166 ± 0.031
3.602GlyIle: 3.602 ± 0.038
2.063GlyLys: 2.063 ± 0.034
9.838GlyLeu: 9.838 ± 0.075
2.067GlyMet: 2.067 ± 0.029
1.88GlyAsn: 1.88 ± 0.03
4.909GlyPro: 4.909 ± 0.05
2.959GlyGln: 2.959 ± 0.038
7.287GlyArg: 7.287 ± 0.064
6.359GlySer: 6.359 ± 0.065
6.595GlyThr: 6.595 ± 0.062
7.54GlyVal: 7.54 ± 0.078
1.747GlyTrp: 1.747 ± 0.028
2.413GlyTyr: 2.413 ± 0.038
0.0GlyXaa: 0.0 ± 0.0
His
2.613HisAla: 2.613 ± 0.032
0.219HisCys: 0.219 ± 0.011
1.205HisAsp: 1.205 ± 0.025
1.009HisGlu: 1.009 ± 0.018
0.62HisPhe: 0.62 ± 0.015
2.328HisGly: 2.328 ± 0.035
0.676HisHis: 0.676 ± 0.017
0.663HisIle: 0.663 ± 0.016
0.299HisLys: 0.299 ± 0.011
2.427HisLeu: 2.427 ± 0.031
0.314HisMet: 0.314 ± 0.012
0.41HisAsn: 0.41 ± 0.013
1.832HisPro: 1.832 ± 0.03
0.759HisGln: 0.759 ± 0.017
1.984HisArg: 1.984 ± 0.028
1.124HisSer: 1.124 ± 0.023
1.291HisThr: 1.291 ± 0.023
1.504HisVal: 1.504 ± 0.026
0.39HisTrp: 0.39 ± 0.013
0.471HisTyr: 0.471 ± 0.014
0.0HisXaa: 0.0 ± 0.0
Ile
4.947IleAla: 4.947 ± 0.043
0.316IleCys: 0.316 ± 0.01
2.263IleAsp: 2.263 ± 0.031
1.785IleGlu: 1.785 ± 0.028
0.737IlePhe: 0.737 ± 0.018
3.62IleGly: 3.62 ± 0.04
0.635IleHis: 0.635 ± 0.015
0.948IleIle: 0.948 ± 0.025
0.676IleLys: 0.676 ± 0.02
2.522IleLeu: 2.522 ± 0.031
0.442IleMet: 0.442 ± 0.014
0.784IleAsn: 0.784 ± 0.019
1.889IlePro: 1.889 ± 0.028
0.858IleGln: 0.858 ± 0.02
2.234IleArg: 2.234 ± 0.033
1.935IleSer: 1.935 ± 0.029
2.361IleThr: 2.361 ± 0.03
2.705IleVal: 2.705 ± 0.04
0.399IleTrp: 0.399 ± 0.014
0.572IleTyr: 0.572 ± 0.015
0.0IleXaa: 0.0 ± 0.0
Lys
2.616LysAla: 2.616 ± 0.042
0.082LysCys: 0.082 ± 0.006
1.022LysAsp: 1.022 ± 0.022
0.888LysGlu: 0.888 ± 0.022
0.383LysPhe: 0.383 ± 0.015
1.52LysGly: 1.52 ± 0.029
0.42LysHis: 0.42 ± 0.013
0.749LysIle: 0.749 ± 0.019
0.521LysLys: 0.521 ± 0.022
1.803LysLeu: 1.803 ± 0.027
0.311LysMet: 0.311 ± 0.012
0.436LysAsn: 0.436 ± 0.017
1.176LysPro: 1.176 ± 0.024
0.684LysGln: 0.684 ± 0.015
1.247LysArg: 1.247 ± 0.026
1.054LysSer: 1.054 ± 0.025
1.096LysThr: 1.096 ± 0.027
1.673LysVal: 1.673 ± 0.029
0.214LysTrp: 0.214 ± 0.009
0.4LysTyr: 0.4 ± 0.014
0.0LysXaa: 0.0 ± 0.0
Leu
15.908LeuAla: 15.908 ± 0.105
0.879LeuCys: 0.879 ± 0.019
6.668LeuAsp: 6.668 ± 0.057
4.634LeuGlu: 4.634 ± 0.054
2.644LeuPhe: 2.644 ± 0.037
9.608LeuGly: 9.608 ± 0.064
2.371LeuHis: 2.371 ± 0.028
3.42LeuIle: 3.42 ± 0.038
1.775LeuLys: 1.775 ± 0.03
12.261LeuLeu: 12.261 ± 0.113
1.642LeuMet: 1.642 ± 0.027
1.857LeuAsn: 1.857 ± 0.028
6.677LeuPro: 6.677 ± 0.063
2.614LeuGln: 2.614 ± 0.031
8.591LeuArg: 8.591 ± 0.075
5.786LeuSer: 5.786 ± 0.051
7.039LeuThr: 7.039 ± 0.056
9.203LeuVal: 9.203 ± 0.076
1.311LeuTrp: 1.311 ± 0.021
1.867LeuTyr: 1.867 ± 0.029
0.0LeuXaa: 0.0 ± 0.0
Met
2.282MetAla: 2.282 ± 0.028
0.123MetCys: 0.123 ± 0.006
0.89MetAsp: 0.89 ± 0.019
0.699MetGlu: 0.699 ± 0.019
0.444MetPhe: 0.444 ± 0.015
1.298MetGly: 1.298 ± 0.024
0.352MetHis: 0.352 ± 0.012
0.6MetIle: 0.6 ± 0.016
0.369MetLys: 0.369 ± 0.013
1.762MetLeu: 1.762 ± 0.028
0.298MetMet: 0.298 ± 0.012
0.46MetAsn: 0.46 ± 0.012
1.073MetPro: 1.073 ± 0.021
0.477MetGln: 0.477 ± 0.014
1.327MetArg: 1.327 ± 0.027
1.344MetSer: 1.344 ± 0.022
1.527MetThr: 1.527 ± 0.022
1.359MetVal: 1.359 ± 0.023
0.197MetTrp: 0.197 ± 0.01
0.301MetTyr: 0.301 ± 0.009
0.0MetXaa: 0.0 ± 0.0
Asn
2.43AsnAla: 2.43 ± 0.03
0.176AsnCys: 0.176 ± 0.009
0.95AsnAsp: 0.95 ± 0.02
0.785AsnGlu: 0.785 ± 0.019
0.548AsnPhe: 0.548 ± 0.014
2.275AsnGly: 2.275 ± 0.041
0.417AsnHis: 0.417 ± 0.013
0.712AsnIle: 0.712 ± 0.016
0.372AsnLys: 0.372 ± 0.014
1.874AsnLeu: 1.874 ± 0.031
0.308AsnMet: 0.308 ± 0.012
0.524AsnAsn: 0.524 ± 0.02
1.501AsnPro: 1.501 ± 0.027
0.63AsnGln: 0.63 ± 0.02
1.287AsnArg: 1.287 ± 0.023
1.178AsnSer: 1.178 ± 0.024
1.297AsnThr: 1.297 ± 0.032
1.434AsnVal: 1.434 ± 0.027
0.347AsnTrp: 0.347 ± 0.013
0.45AsnTyr: 0.45 ± 0.012
0.0AsnXaa: 0.0 ± 0.0
Pro
8.607ProAla: 8.607 ± 0.066
0.333ProCys: 0.333 ± 0.011
4.06ProAsp: 4.06 ± 0.045
3.971ProGlu: 3.971 ± 0.053
1.527ProPhe: 1.527 ± 0.025
6.486ProGly: 6.486 ± 0.05
1.218ProHis: 1.218 ± 0.023
1.478ProIle: 1.478 ± 0.026
1.015ProLys: 1.015 ± 0.023
5.536ProLeu: 5.536 ± 0.053
1.022ProMet: 1.022 ± 0.021
1.046ProAsn: 1.046 ± 0.021
3.284ProPro: 3.284 ± 0.059
2.079ProGln: 2.079 ± 0.034
3.614ProArg: 3.614 ± 0.042
3.574ProSer: 3.574 ± 0.041
3.612ProThr: 3.612 ± 0.048
5.51ProVal: 5.51 ± 0.047
0.876ProTrp: 0.876 ± 0.018
1.311ProTyr: 1.311 ± 0.025
0.0ProXaa: 0.0 ± 0.0
Gln
4.372GlnAla: 4.372 ± 0.051
0.187GlnCys: 0.187 ± 0.009
1.562GlnAsp: 1.562 ± 0.024
1.446GlnGlu: 1.446 ± 0.027
0.796GlnPhe: 0.796 ± 0.019
2.525GlnGly: 2.525 ± 0.03
0.764GlnHis: 0.764 ± 0.019
1.249GlnIle: 1.249 ± 0.022
0.539GlnLys: 0.539 ± 0.016
3.596GlnLeu: 3.596 ± 0.034
0.513GlnMet: 0.513 ± 0.015
0.636GlnAsn: 0.636 ± 0.02
2.014GlnPro: 2.014 ± 0.033
1.715GlnGln: 1.715 ± 0.037
2.599GlnArg: 2.599 ± 0.035
1.619GlnSer: 1.619 ± 0.027
1.575GlnThr: 1.575 ± 0.03
2.861GlnVal: 2.861 ± 0.027
0.528GlnTrp: 0.528 ± 0.015
0.738GlnTyr: 0.738 ± 0.019
0.0GlnXaa: 0.0 ± 0.0
Arg
9.187ArgAla: 9.187 ± 0.079
0.56ArgCys: 0.56 ± 0.016
3.651ArgAsp: 3.651 ± 0.044
4.208ArgGlu: 4.208 ± 0.055
2.222ArgPhe: 2.222 ± 0.027
5.383ArgGly: 5.383 ± 0.057
1.909ArgHis: 1.909 ± 0.028
3.285ArgIle: 3.285 ± 0.038
1.375ArgLys: 1.375 ± 0.026
8.544ArgLeu: 8.544 ± 0.073
1.687ArgMet: 1.687 ± 0.024
1.399ArgAsn: 1.399 ± 0.022
4.672ArgPro: 4.672 ± 0.048
2.474ArgGln: 2.474 ± 0.032
7.347ArgArg: 7.347 ± 0.072
4.36ArgSer: 4.36 ± 0.048
4.919ArgThr: 4.919 ± 0.048
5.313ArgVal: 5.313 ± 0.051
1.276ArgTrp: 1.276 ± 0.025
1.706ArgTyr: 1.706 ± 0.026
0.0ArgXaa: 0.0 ± 0.0
Ser
7.822SerAla: 7.822 ± 0.077
0.472SerCys: 0.472 ± 0.014
2.724SerAsp: 2.724 ± 0.032
2.382SerGlu: 2.382 ± 0.031
1.605SerPhe: 1.605 ± 0.026
6.721SerGly: 6.721 ± 0.067
1.1SerHis: 1.1 ± 0.02
1.727SerIle: 1.727 ± 0.03
1.013SerLys: 1.013 ± 0.025
5.219SerLeu: 5.219 ± 0.058
1.139SerMet: 1.139 ± 0.02
1.208SerAsn: 1.208 ± 0.027
3.361SerPro: 3.361 ± 0.044
1.525SerGln: 1.525 ± 0.029
3.782SerArg: 3.782 ± 0.042
4.028SerSer: 4.028 ± 0.069
3.999SerThr: 3.999 ± 0.058
4.571SerVal: 4.571 ± 0.047
1.067SerTrp: 1.067 ± 0.024
1.388SerTyr: 1.388 ± 0.024
0.0SerXaa: 0.0 ± 0.0
Thr
9.666ThrAla: 9.666 ± 0.084
0.454ThrCys: 0.454 ± 0.013
3.42ThrAsp: 3.42 ± 0.038
2.921ThrGlu: 2.921 ± 0.039
1.65ThrPhe: 1.65 ± 0.028
6.983ThrGly: 6.983 ± 0.068
1.124ThrHis: 1.124 ± 0.024
1.878ThrIle: 1.878 ± 0.035
1.043ThrLys: 1.043 ± 0.026
6.029ThrLeu: 6.029 ± 0.059
0.943ThrMet: 0.943 ± 0.021
1.176ThrAsn: 1.176 ± 0.024
4.205ThrPro: 4.205 ± 0.043
1.55ThrGln: 1.55 ± 0.026
3.709ThrArg: 3.709 ± 0.039
3.687ThrSer: 3.687 ± 0.048
4.308ThrThr: 4.308 ± 0.069
6.404ThrVal: 6.404 ± 0.056
0.91ThrTrp: 0.91 ± 0.019
1.237ThrTyr: 1.237 ± 0.026
0.0ThrXaa: 0.0 ± 0.0
Val
10.76ValAla: 10.76 ± 0.083
0.74ValCys: 0.74 ± 0.02
4.86ValAsp: 4.86 ± 0.049
4.419ValGlu: 4.419 ± 0.055
2.463ValPhe: 2.463 ± 0.033
6.845ValGly: 6.845 ± 0.058
1.923ValHis: 1.923 ± 0.026
2.867ValIle: 2.867 ± 0.038
1.54ValLys: 1.54 ± 0.031
9.81ValLeu: 9.81 ± 0.074
1.388ValMet: 1.388 ± 0.022
1.842ValAsn: 1.842 ± 0.035
5.122ValPro: 5.122 ± 0.049
2.392ValGln: 2.392 ± 0.03
6.438ValArg: 6.438 ± 0.053
4.694ValSer: 4.694 ± 0.052
5.499ValThr: 5.499 ± 0.056
7.973ValVal: 7.973 ± 0.083
1.121ValTrp: 1.121 ± 0.023
1.584ValTyr: 1.584 ± 0.032
0.0ValXaa: 0.0 ± 0.0
Trp
1.63TrpAla: 1.63 ± 0.03
0.155TrpCys: 0.155 ± 0.009
0.784TrpAsp: 0.784 ± 0.02
0.701TrpGlu: 0.701 ± 0.016
0.509TrpPhe: 0.509 ± 0.015
1.092TrpGly: 1.092 ± 0.025
0.365TrpHis: 0.365 ± 0.012
0.59TrpIle: 0.59 ± 0.017
0.333TrpLys: 0.333 ± 0.011
1.798TrpLeu: 1.798 ± 0.026
0.304TrpMet: 0.304 ± 0.01
0.424TrpAsn: 0.424 ± 0.014
0.794TrpPro: 0.794 ± 0.018
0.663TrpGln: 0.663 ± 0.016
1.266TrpArg: 1.266 ± 0.023
1.05TrpSer: 1.05 ± 0.021
1.124TrpThr: 1.124 ± 0.02
0.982TrpVal: 0.982 ± 0.021
0.319TrpTrp: 0.319 ± 0.012
0.375TrpTyr: 0.375 ± 0.012
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.673TyrAla: 2.673 ± 0.034
0.192TyrCys: 0.192 ± 0.009
1.31TyrAsp: 1.31 ± 0.024
0.954TyrGlu: 0.954 ± 0.02
0.636TyrPhe: 0.636 ± 0.017
2.202TyrGly: 2.202 ± 0.028
0.433TyrHis: 0.433 ± 0.013
0.548TyrIle: 0.548 ± 0.018
0.318TyrLys: 0.318 ± 0.013
2.345TyrLeu: 2.345 ± 0.032
0.25TyrMet: 0.25 ± 0.011
0.474TyrAsn: 0.474 ± 0.015
1.207TyrPro: 1.207 ± 0.026
0.805TyrGln: 0.805 ± 0.02
1.831TyrArg: 1.831 ± 0.028
1.138TyrSer: 1.138 ± 0.025
1.307TyrThr: 1.307 ± 0.025
1.466TyrVal: 1.466 ± 0.025
0.379TyrTrp: 0.379 ± 0.013
0.553TyrTyr: 0.553 ± 0.017
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7726 proteins (2609667 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski