Amino acid dipepetide frequency for Streptomyces sp. ICBB 8177

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
22.682AlaAla: 22.682 ± 0.202
1.245AlaCys: 1.245 ± 0.031
8.624AlaAsp: 8.624 ± 0.075
8.43AlaGlu: 8.43 ± 0.11
3.718AlaPhe: 3.718 ± 0.047
13.582AlaGly: 13.582 ± 0.107
3.043AlaHis: 3.043 ± 0.04
3.541AlaIle: 3.541 ± 0.052
2.704AlaLys: 2.704 ± 0.051
14.801AlaLeu: 14.801 ± 0.141
2.596AlaMet: 2.596 ± 0.037
1.968AlaAsn: 1.968 ± 0.041
7.693AlaPro: 7.693 ± 0.092
3.924AlaGln: 3.924 ± 0.057
11.215AlaArg: 11.215 ± 0.112
6.425AlaSer: 6.425 ± 0.084
7.279AlaThr: 7.279 ± 0.08
12.856AlaVal: 12.856 ± 0.119
1.899AlaTrp: 1.899 ± 0.034
2.884AlaTyr: 2.884 ± 0.046
0.0AlaXaa: 0.0 ± 0.0
Cys
1.225CysAla: 1.225 ± 0.027
0.107CysCys: 0.107 ± 0.008
0.521CysAsp: 0.521 ± 0.016
0.439CysGlu: 0.439 ± 0.016
0.239CysPhe: 0.239 ± 0.011
1.085CysGly: 1.085 ± 0.026
0.21CysHis: 0.21 ± 0.012
0.126CysIle: 0.126 ± 0.008
0.099CysLys: 0.099 ± 0.007
0.787CysLeu: 0.787 ± 0.024
0.118CysMet: 0.118 ± 0.008
0.133CysAsn: 0.133 ± 0.011
0.544CysPro: 0.544 ± 0.018
0.164CysGln: 0.164 ± 0.009
0.615CysArg: 0.615 ± 0.019
0.44CysSer: 0.44 ± 0.017
0.457CysThr: 0.457 ± 0.017
0.771CysVal: 0.771 ± 0.023
0.145CysTrp: 0.145 ± 0.01
0.161CysTyr: 0.161 ± 0.009
0.0CysXaa: 0.0 ± 0.0
Asp
8.41AspAla: 8.41 ± 0.07
0.408AspCys: 0.408 ± 0.018
3.746AspAsp: 3.746 ± 0.056
3.896AspGlu: 3.896 ± 0.059
1.571AspPhe: 1.571 ± 0.032
6.435AspGly: 6.435 ± 0.076
1.477AspHis: 1.477 ± 0.035
1.6AspIle: 1.6 ± 0.039
0.984AspLys: 0.984 ± 0.025
6.159AspLeu: 6.159 ± 0.065
0.766AspMet: 0.766 ± 0.021
0.9AspAsn: 0.9 ± 0.024
4.736AspPro: 4.736 ± 0.055
1.489AspGln: 1.489 ± 0.034
4.913AspArg: 4.913 ± 0.063
2.445AspSer: 2.445 ± 0.042
3.02AspThr: 3.02 ± 0.049
4.846AspVal: 4.846 ± 0.064
0.941AspTrp: 0.941 ± 0.023
1.102AspTyr: 1.102 ± 0.028
0.0AspXaa: 0.0 ± 0.0
Glu
7.386GluAla: 7.386 ± 0.094
0.391GluCys: 0.391 ± 0.017
2.616GluAsp: 2.616 ± 0.043
3.288GluGlu: 3.288 ± 0.064
1.341GluPhe: 1.341 ± 0.03
4.351GluGly: 4.351 ± 0.058
1.493GluHis: 1.493 ± 0.034
2.012GluIle: 2.012 ± 0.046
1.074GluLys: 1.074 ± 0.032
6.378GluLeu: 6.378 ± 0.069
0.871GluMet: 0.871 ± 0.024
0.864GluAsn: 0.864 ± 0.022
3.444GluPro: 3.444 ± 0.054
2.008GluGln: 2.008 ± 0.038
6.032GluArg: 6.032 ± 0.081
2.301GluSer: 2.301 ± 0.042
2.703GluThr: 2.703 ± 0.046
4.54GluVal: 4.54 ± 0.062
0.753GluTrp: 0.753 ± 0.021
1.004GluTyr: 1.004 ± 0.027
0.0GluXaa: 0.0 ± 0.0
Phe
3.748PheAla: 3.748 ± 0.049
0.288PheCys: 0.288 ± 0.013
1.909PheAsp: 1.909 ± 0.04
1.361PheGlu: 1.361 ± 0.032
0.868PhePhe: 0.868 ± 0.026
3.144PheGly: 3.144 ± 0.041
0.655PheHis: 0.655 ± 0.02
0.621PheIle: 0.621 ± 0.021
0.46PheLys: 0.46 ± 0.019
2.458PheLeu: 2.458 ± 0.045
0.378PheMet: 0.378 ± 0.015
0.515PheAsn: 0.515 ± 0.019
1.343PhePro: 1.343 ± 0.031
0.648PheGln: 0.648 ± 0.019
1.797PheArg: 1.797 ± 0.033
1.361PheSer: 1.361 ± 0.026
2.069PheThr: 2.069 ± 0.039
2.16PheVal: 2.16 ± 0.039
0.378PheTrp: 0.378 ± 0.016
0.547PheTyr: 0.547 ± 0.018
0.0PheXaa: 0.0 ± 0.0
Gly
11.818GlyAla: 11.818 ± 0.112
0.896GlyCys: 0.896 ± 0.023
5.401GlyAsp: 5.401 ± 0.067
5.244GlyGlu: 5.244 ± 0.066
2.899GlyPhe: 2.899 ± 0.047
9.619GlyGly: 9.619 ± 0.122
2.453GlyHis: 2.453 ± 0.041
3.086GlyIle: 3.086 ± 0.046
2.116GlyLys: 2.116 ± 0.041
9.475GlyLeu: 9.475 ± 0.095
2.028GlyMet: 2.028 ± 0.036
1.662GlyAsn: 1.662 ± 0.036
5.575GlyPro: 5.575 ± 0.071
2.766GlyGln: 2.766 ± 0.044
8.024GlyArg: 8.024 ± 0.081
5.407GlySer: 5.407 ± 0.064
6.581GlyThr: 6.581 ± 0.07
8.0GlyVal: 8.0 ± 0.077
1.753GlyTrp: 1.753 ± 0.038
2.365GlyTyr: 2.365 ± 0.041
0.0GlyXaa: 0.0 ± 0.0
His
2.911HisAla: 2.911 ± 0.051
0.207HisCys: 0.207 ± 0.01
1.46HisAsp: 1.46 ± 0.03
1.189HisGlu: 1.189 ± 0.027
0.635HisPhe: 0.635 ± 0.022
2.569HisGly: 2.569 ± 0.041
0.734HisHis: 0.734 ± 0.024
0.603HisIle: 0.603 ± 0.02
0.327HisLys: 0.327 ± 0.014
2.386HisLeu: 2.386 ± 0.041
0.316HisMet: 0.316 ± 0.013
0.355HisAsn: 0.355 ± 0.015
1.847HisPro: 1.847 ± 0.036
0.644HisGln: 0.644 ± 0.021
2.072HisArg: 2.072 ± 0.034
0.984HisSer: 0.984 ± 0.025
1.335HisThr: 1.335 ± 0.03
1.793HisVal: 1.793 ± 0.035
0.373HisTrp: 0.373 ± 0.015
0.489HisTyr: 0.489 ± 0.017
0.0HisXaa: 0.0 ± 0.0
Ile
4.555IleAla: 4.555 ± 0.054
0.28IleCys: 0.28 ± 0.013
2.053IleAsp: 2.053 ± 0.04
1.786IleGlu: 1.786 ± 0.037
0.599IlePhe: 0.599 ± 0.02
3.392IleGly: 3.392 ± 0.051
0.572IleHis: 0.572 ± 0.017
0.757IleIle: 0.757 ± 0.026
0.606IleLys: 0.606 ± 0.019
2.05IleLeu: 2.05 ± 0.036
0.395IleMet: 0.395 ± 0.015
0.65IleAsn: 0.65 ± 0.024
1.596IlePro: 1.596 ± 0.027
0.65IleGln: 0.65 ± 0.021
2.021IleArg: 2.021 ± 0.043
1.493IleSer: 1.493 ± 0.029
2.022IleThr: 2.022 ± 0.041
2.571IleVal: 2.571 ± 0.05
0.318IleTrp: 0.318 ± 0.013
0.458IleTyr: 0.458 ± 0.017
0.0IleXaa: 0.0 ± 0.0
Lys
2.689LysAla: 2.689 ± 0.05
0.118LysCys: 0.118 ± 0.009
1.107LysAsp: 1.107 ± 0.031
0.969LysGlu: 0.969 ± 0.029
0.395LysPhe: 0.395 ± 0.017
1.593LysGly: 1.593 ± 0.037
0.37LysHis: 0.37 ± 0.014
0.682LysIle: 0.682 ± 0.023
0.621LysLys: 0.621 ± 0.03
1.764LysLeu: 1.764 ± 0.04
0.334LysMet: 0.334 ± 0.016
0.427LysAsn: 0.427 ± 0.016
1.212LysPro: 1.212 ± 0.033
0.611LysGln: 0.611 ± 0.022
1.364LysArg: 1.364 ± 0.033
0.991LysSer: 0.991 ± 0.029
1.118LysThr: 1.118 ± 0.03
1.649LysVal: 1.649 ± 0.037
0.213LysTrp: 0.213 ± 0.011
0.406LysTyr: 0.406 ± 0.016
0.0LysXaa: 0.0 ± 0.0
Leu
15.225LeuAla: 15.225 ± 0.126
0.863LeuCys: 0.863 ± 0.021
6.649LeuAsp: 6.649 ± 0.071
4.64LeuGlu: 4.64 ± 0.062
2.437LeuPhe: 2.437 ± 0.046
9.282LeuGly: 9.282 ± 0.086
2.174LeuHis: 2.174 ± 0.034
2.952LeuIle: 2.952 ± 0.05
1.725LeuLys: 1.725 ± 0.038
11.042LeuLeu: 11.042 ± 0.113
1.58LeuMet: 1.58 ± 0.03
1.542LeuAsn: 1.542 ± 0.033
6.661LeuPro: 6.661 ± 0.063
1.974LeuGln: 1.974 ± 0.038
9.24LeuArg: 9.24 ± 0.092
5.107LeuSer: 5.107 ± 0.06
6.801LeuThr: 6.801 ± 0.072
8.78LeuVal: 8.78 ± 0.085
1.239LeuTrp: 1.239 ± 0.027
1.805LeuTyr: 1.805 ± 0.036
0.0LeuXaa: 0.0 ± 0.0
Met
2.323MetAla: 2.323 ± 0.042
0.135MetCys: 0.135 ± 0.009
0.884MetAsp: 0.884 ± 0.025
0.7MetGlu: 0.7 ± 0.022
0.452MetPhe: 0.452 ± 0.017
1.316MetGly: 1.316 ± 0.029
0.321MetHis: 0.321 ± 0.013
0.597MetIle: 0.597 ± 0.02
0.329MetLys: 0.329 ± 0.013
1.636MetLeu: 1.636 ± 0.028
0.294MetMet: 0.294 ± 0.014
0.398MetAsn: 0.398 ± 0.014
1.125MetPro: 1.125 ± 0.025
0.375MetGln: 0.375 ± 0.013
1.527MetArg: 1.527 ± 0.028
1.255MetSer: 1.255 ± 0.027
1.537MetThr: 1.537 ± 0.027
1.315MetVal: 1.315 ± 0.029
0.221MetTrp: 0.221 ± 0.011
0.315MetTyr: 0.315 ± 0.014
0.0MetXaa: 0.0 ± 0.0
Asn
2.222AsnAla: 2.222 ± 0.039
0.153AsnCys: 0.153 ± 0.011
0.883AsnAsp: 0.883 ± 0.021
0.751AsnGlu: 0.751 ± 0.021
0.419AsnPhe: 0.419 ± 0.017
1.847AsnGly: 1.847 ± 0.044
0.367AsnHis: 0.367 ± 0.017
0.517AsnIle: 0.517 ± 0.019
0.316AsnLys: 0.316 ± 0.013
1.568AsnLeu: 1.568 ± 0.033
0.257AsnMet: 0.257 ± 0.013
0.35AsnAsn: 0.35 ± 0.015
1.303AsnPro: 1.303 ± 0.029
0.48AsnGln: 0.48 ± 0.019
1.161AsnArg: 1.161 ± 0.03
0.879AsnSer: 0.879 ± 0.026
0.995AsnThr: 0.995 ± 0.03
1.38AsnVal: 1.38 ± 0.031
0.253AsnTrp: 0.253 ± 0.014
0.38AsnTyr: 0.38 ± 0.016
0.0AsnXaa: 0.0 ± 0.0
Pro
9.12ProAla: 9.12 ± 0.098
0.404ProCys: 0.404 ± 0.016
4.541ProAsp: 4.541 ± 0.056
3.988ProGlu: 3.988 ± 0.053
1.554ProPhe: 1.554 ± 0.031
7.319ProGly: 7.319 ± 0.081
1.5ProHis: 1.5 ± 0.034
1.237ProIle: 1.237 ± 0.027
1.074ProLys: 1.074 ± 0.03
5.38ProLeu: 5.38 ± 0.054
0.98ProMet: 0.98 ± 0.024
0.865ProAsn: 0.865 ± 0.025
3.746ProPro: 3.746 ± 0.075
2.009ProGln: 2.009 ± 0.05
4.517ProArg: 4.517 ± 0.064
3.405ProSer: 3.405 ± 0.06
3.093ProThr: 3.093 ± 0.054
5.664ProVal: 5.664 ± 0.061
0.877ProTrp: 0.877 ± 0.025
1.477ProTyr: 1.477 ± 0.031
0.0ProXaa: 0.0 ± 0.0
Gln
3.904GlnAla: 3.904 ± 0.052
0.177GlnCys: 0.177 ± 0.01
1.443GlnAsp: 1.443 ± 0.036
1.342GlnGlu: 1.342 ± 0.032
0.633GlnPhe: 0.633 ± 0.019
2.197GlnGly: 2.197 ± 0.037
0.646GlnHis: 0.646 ± 0.023
1.009GlnIle: 1.009 ± 0.028
0.478GlnLys: 0.478 ± 0.017
2.912GlnLeu: 2.912 ± 0.047
0.474GlnMet: 0.474 ± 0.017
0.446GlnAsn: 0.446 ± 0.016
1.73GlnPro: 1.73 ± 0.046
1.192GlnGln: 1.192 ± 0.044
2.35GlnArg: 2.35 ± 0.046
1.183GlnSer: 1.183 ± 0.026
1.323GlnThr: 1.323 ± 0.034
2.5GlnVal: 2.5 ± 0.049
0.459GlnTrp: 0.459 ± 0.017
0.583GlnTyr: 0.583 ± 0.018
0.0GlnXaa: 0.0 ± 0.0
Arg
10.718ArgAla: 10.718 ± 0.114
0.611ArgCys: 0.611 ± 0.02
4.628ArgAsp: 4.628 ± 0.057
5.155ArgGlu: 5.155 ± 0.074
2.353ArgPhe: 2.353 ± 0.036
6.305ArgGly: 6.305 ± 0.064
2.197ArgHis: 2.197 ± 0.038
2.89ArgIle: 2.89 ± 0.039
1.545ArgLys: 1.545 ± 0.03
9.241ArgLeu: 9.241 ± 0.096
1.821ArgMet: 1.821 ± 0.034
1.24ArgAsn: 1.24 ± 0.027
5.304ArgPro: 5.304 ± 0.069
2.377ArgGln: 2.377 ± 0.044
8.464ArgArg: 8.464 ± 0.102
3.895ArgSer: 3.895 ± 0.051
5.189ArgThr: 5.189 ± 0.063
6.588ArgVal: 6.588 ± 0.066
1.429ArgTrp: 1.429 ± 0.031
1.848ArgTyr: 1.848 ± 0.038
0.0ArgXaa: 0.0 ± 0.0
Ser
6.933SerAla: 6.933 ± 0.078
0.446SerCys: 0.446 ± 0.016
2.558SerAsp: 2.558 ± 0.04
2.04SerGlu: 2.04 ± 0.032
1.445SerPhe: 1.445 ± 0.029
6.304SerGly: 6.304 ± 0.071
1.016SerHis: 1.016 ± 0.025
1.281SerIle: 1.281 ± 0.032
0.904SerLys: 0.904 ± 0.027
4.585SerLeu: 4.585 ± 0.063
1.015SerMet: 1.015 ± 0.026
0.82SerAsn: 0.82 ± 0.026
3.182SerPro: 3.182 ± 0.051
1.265SerGln: 1.265 ± 0.032
3.704SerArg: 3.704 ± 0.052
2.818SerSer: 2.818 ± 0.06
3.02SerThr: 3.02 ± 0.06
4.211SerVal: 4.211 ± 0.054
0.9SerTrp: 0.9 ± 0.026
1.238SerTyr: 1.238 ± 0.029
0.0SerXaa: 0.0 ± 0.0
Thr
9.109ThrAla: 9.109 ± 0.087
0.442ThrCys: 0.442 ± 0.017
3.49ThrAsp: 3.49 ± 0.05
2.922ThrGlu: 2.922 ± 0.044
1.519ThrPhe: 1.519 ± 0.031
6.866ThrGly: 6.866 ± 0.079
1.22ThrHis: 1.22 ± 0.029
1.596ThrIle: 1.596 ± 0.031
1.04ThrLys: 1.04 ± 0.026
5.631ThrLeu: 5.631 ± 0.057
0.895ThrMet: 0.895 ± 0.018
0.911ThrAsn: 0.911 ± 0.027
4.223ThrPro: 4.223 ± 0.065
1.332ThrGln: 1.332 ± 0.035
4.105ThrArg: 4.105 ± 0.041
3.161ThrSer: 3.161 ± 0.056
3.968ThrThr: 3.968 ± 0.082
5.879ThrVal: 5.879 ± 0.059
0.821ThrTrp: 0.821 ± 0.024
1.297ThrTyr: 1.297 ± 0.032
0.0ThrXaa: 0.0 ± 0.0
Val
11.55ValAla: 11.55 ± 0.097
0.857ValCys: 0.857 ± 0.025
5.017ValAsp: 5.017 ± 0.059
4.819ValGlu: 4.819 ± 0.057
2.465ValPhe: 2.465 ± 0.046
6.727ValGly: 6.727 ± 0.068
1.975ValHis: 1.975 ± 0.034
2.813ValIle: 2.813 ± 0.052
1.612ValLys: 1.612 ± 0.037
9.555ValLeu: 9.555 ± 0.097
1.432ValMet: 1.432 ± 0.029
1.681ValAsn: 1.681 ± 0.036
5.623ValPro: 5.623 ± 0.061
1.872ValGln: 1.872 ± 0.031
7.495ValArg: 7.495 ± 0.082
4.287ValSer: 4.287 ± 0.056
5.751ValThr: 5.751 ± 0.066
8.684ValVal: 8.684 ± 0.09
1.108ValTrp: 1.108 ± 0.026
1.571ValTyr: 1.571 ± 0.031
0.0ValXaa: 0.0 ± 0.0
Trp
1.663TrpAla: 1.663 ± 0.035
0.167TrpCys: 0.167 ± 0.01
0.768TrpAsp: 0.768 ± 0.022
0.683TrpGlu: 0.683 ± 0.017
0.516TrpPhe: 0.516 ± 0.017
0.991TrpGly: 0.991 ± 0.024
0.376TrpHis: 0.376 ± 0.014
0.516TrpIle: 0.516 ± 0.019
0.309TrpLys: 0.309 ± 0.014
1.745TrpLeu: 1.745 ± 0.034
0.267TrpMet: 0.267 ± 0.011
0.383TrpAsn: 0.383 ± 0.013
0.782TrpPro: 0.782 ± 0.022
0.588TrpGln: 0.588 ± 0.02
1.379TrpArg: 1.379 ± 0.033
0.915TrpSer: 0.915 ± 0.022
1.016TrpThr: 1.016 ± 0.027
0.983TrpVal: 0.983 ± 0.024
0.338TrpTrp: 0.338 ± 0.015
0.362TrpTyr: 0.362 ± 0.016
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.915TyrAla: 2.915 ± 0.04
0.192TyrCys: 0.192 ± 0.013
1.548TyrAsp: 1.548 ± 0.042
1.207TyrGlu: 1.207 ± 0.028
0.65TyrPhe: 0.65 ± 0.02
2.242TyrGly: 2.242 ± 0.033
0.427TyrHis: 0.427 ± 0.016
0.389TyrIle: 0.389 ± 0.017
0.309TyrLys: 0.309 ± 0.013
2.164TyrLeu: 2.164 ± 0.035
0.238TyrMet: 0.238 ± 0.011
0.37TyrAsn: 0.37 ± 0.018
1.084TyrPro: 1.084 ± 0.025
0.615TyrGln: 0.615 ± 0.021
1.802TyrArg: 1.802 ± 0.034
0.936TyrSer: 0.936 ± 0.027
1.153TyrThr: 1.153 ± 0.029
1.722TyrVal: 1.722 ± 0.032
0.329TyrTrp: 0.329 ± 0.014
0.434TyrTyr: 0.434 ± 0.019
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5334 proteins (1750405 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski