Amino acid dipepetide frequency for Streptomyces lunaelactis

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
20.683AlaAla: 20.683 ± 0.138
1.027AlaCys: 1.027 ± 0.022
8.145AlaAsp: 8.145 ± 0.067
8.777AlaGlu: 8.777 ± 0.079
3.475AlaPhe: 3.475 ± 0.038
12.787AlaGly: 12.787 ± 0.099
2.759AlaHis: 2.759 ± 0.04
3.778AlaIle: 3.778 ± 0.049
3.173AlaLys: 3.173 ± 0.046
13.933AlaLeu: 13.933 ± 0.107
2.539AlaMet: 2.539 ± 0.033
2.04AlaAsn: 2.04 ± 0.033
6.765AlaPro: 6.765 ± 0.069
3.939AlaGln: 3.939 ± 0.048
9.717AlaArg: 9.717 ± 0.076
6.089AlaSer: 6.089 ± 0.051
6.879AlaThr: 6.879 ± 0.058
12.073AlaVal: 12.073 ± 0.098
1.812AlaTrp: 1.812 ± 0.03
2.739AlaTyr: 2.739 ± 0.036
0.0AlaXaa: 0.0 ± 0.0
Cys
1.076CysAla: 1.076 ± 0.023
0.105CysCys: 0.105 ± 0.007
0.48CysAsp: 0.48 ± 0.014
0.423CysGlu: 0.423 ± 0.014
0.216CysPhe: 0.216 ± 0.011
0.955CysGly: 0.955 ± 0.021
0.187CysHis: 0.187 ± 0.009
0.208CysIle: 0.208 ± 0.009
0.117CysLys: 0.117 ± 0.006
0.75CysLeu: 0.75 ± 0.018
0.126CysMet: 0.126 ± 0.007
0.145CysAsn: 0.145 ± 0.008
0.49CysPro: 0.49 ± 0.014
0.193CysGln: 0.193 ± 0.009
0.647CysArg: 0.647 ± 0.018
0.454CysSer: 0.454 ± 0.015
0.531CysThr: 0.531 ± 0.017
0.654CysVal: 0.654 ± 0.017
0.128CysTrp: 0.128 ± 0.007
0.157CysTyr: 0.157 ± 0.008
0.0CysXaa: 0.0 ± 0.0
Asp
7.498AspAla: 7.498 ± 0.062
0.445AspCys: 0.445 ± 0.014
3.462AspAsp: 3.462 ± 0.043
3.999AspGlu: 3.999 ± 0.04
1.66AspPhe: 1.66 ± 0.026
6.113AspGly: 6.113 ± 0.064
1.354AspHis: 1.354 ± 0.025
2.03AspIle: 2.03 ± 0.032
1.3AspLys: 1.3 ± 0.025
5.991AspLeu: 5.991 ± 0.061
0.888AspMet: 0.888 ± 0.019
1.054AspAsn: 1.054 ± 0.02
4.301AspPro: 4.301 ± 0.046
1.608AspGln: 1.608 ± 0.029
4.723AspArg: 4.723 ± 0.048
2.761AspSer: 2.761 ± 0.035
3.079AspThr: 3.079 ± 0.038
4.528AspVal: 4.528 ± 0.045
1.021AspTrp: 1.021 ± 0.021
1.1AspTyr: 1.1 ± 0.027
0.0AspXaa: 0.0 ± 0.0
Glu
7.31GluAla: 7.31 ± 0.064
0.402GluCys: 0.402 ± 0.013
2.805GluAsp: 2.805 ± 0.041
3.536GluGlu: 3.536 ± 0.046
1.57GluPhe: 1.57 ± 0.03
4.424GluGly: 4.424 ± 0.052
1.486GluHis: 1.486 ± 0.029
2.413GluIle: 2.413 ± 0.033
1.57GluLys: 1.57 ± 0.031
7.192GluLeu: 7.192 ± 0.073
0.931GluMet: 0.931 ± 0.022
1.088GluAsn: 1.088 ± 0.024
3.391GluPro: 3.391 ± 0.04
2.439GluGln: 2.439 ± 0.038
5.494GluArg: 5.494 ± 0.052
2.837GluSer: 2.837 ± 0.035
2.889GluThr: 2.889 ± 0.044
4.471GluVal: 4.471 ± 0.049
0.806GluTrp: 0.806 ± 0.02
1.171GluTyr: 1.171 ± 0.022
0.0GluXaa: 0.0 ± 0.0
Phe
3.669PheAla: 3.669 ± 0.044
0.281PheCys: 0.281 ± 0.011
1.955PheAsp: 1.955 ± 0.034
1.49PheGlu: 1.49 ± 0.03
0.872PhePhe: 0.872 ± 0.021
2.971PheGly: 2.971 ± 0.043
0.607PheHis: 0.607 ± 0.017
0.861PheIle: 0.861 ± 0.021
0.563PheLys: 0.563 ± 0.014
2.55PheLeu: 2.55 ± 0.035
0.426PheMet: 0.426 ± 0.013
0.611PheAsn: 0.611 ± 0.017
1.414PhePro: 1.414 ± 0.026
0.687PheGln: 0.687 ± 0.016
1.819PheArg: 1.819 ± 0.026
1.559PheSer: 1.559 ± 0.028
2.001PheThr: 2.001 ± 0.029
2.149PheVal: 2.149 ± 0.035
0.426PheTrp: 0.426 ± 0.013
0.569PheTyr: 0.569 ± 0.017
0.0PheXaa: 0.0 ± 0.0
Gly
10.74GlyAla: 10.74 ± 0.082
0.855GlyCys: 0.855 ± 0.022
5.08GlyAsp: 5.08 ± 0.051
5.211GlyGlu: 5.211 ± 0.05
2.88GlyPhe: 2.88 ± 0.035
8.705GlyGly: 8.705 ± 0.084
2.302GlyHis: 2.302 ± 0.034
3.639GlyIle: 3.639 ± 0.043
2.726GlyLys: 2.726 ± 0.042
9.33GlyLeu: 9.33 ± 0.067
1.976GlyMet: 1.976 ± 0.034
1.821GlyAsn: 1.821 ± 0.035
5.026GlyPro: 5.026 ± 0.046
2.88GlyGln: 2.88 ± 0.042
7.624GlyArg: 7.624 ± 0.063
5.592GlySer: 5.592 ± 0.052
6.151GlyThr: 6.151 ± 0.07
7.147GlyVal: 7.147 ± 0.067
1.71GlyTrp: 1.71 ± 0.03
2.231GlyTyr: 2.231 ± 0.032
0.0GlyXaa: 0.0 ± 0.0
His
2.571HisAla: 2.571 ± 0.036
0.225HisCys: 0.225 ± 0.01
1.294HisAsp: 1.294 ± 0.027
1.212HisGlu: 1.212 ± 0.026
0.633HisPhe: 0.633 ± 0.018
2.329HisGly: 2.329 ± 0.032
0.686HisHis: 0.686 ± 0.019
0.744HisIle: 0.744 ± 0.018
0.385HisLys: 0.385 ± 0.012
2.361HisLeu: 2.361 ± 0.034
0.344HisMet: 0.344 ± 0.012
0.383HisAsn: 0.383 ± 0.014
1.724HisPro: 1.724 ± 0.029
0.702HisGln: 0.702 ± 0.018
2.009HisArg: 2.009 ± 0.029
1.103HisSer: 1.103 ± 0.021
1.338HisThr: 1.338 ± 0.029
1.656HisVal: 1.656 ± 0.026
0.364HisTrp: 0.364 ± 0.013
0.465HisTyr: 0.465 ± 0.014
0.0HisXaa: 0.0 ± 0.0
Ile
5.203IleAla: 5.203 ± 0.056
0.335IleCys: 0.335 ± 0.012
2.36IleAsp: 2.36 ± 0.035
2.273IleGlu: 2.273 ± 0.036
0.809IlePhe: 0.809 ± 0.019
3.797IleGly: 3.797 ± 0.04
0.672IleHis: 0.672 ± 0.017
0.965IleIle: 0.965 ± 0.025
0.816IleLys: 0.816 ± 0.02
2.579IleLeu: 2.579 ± 0.039
0.475IleMet: 0.475 ± 0.014
0.793IleAsn: 0.793 ± 0.02
1.899IlePro: 1.899 ± 0.028
0.817IleGln: 0.817 ± 0.017
2.363IleArg: 2.363 ± 0.037
1.915IleSer: 1.915 ± 0.033
2.375IleThr: 2.375 ± 0.032
2.894IleVal: 2.894 ± 0.04
0.413IleTrp: 0.413 ± 0.014
0.612IleTyr: 0.612 ± 0.017
0.0IleXaa: 0.0 ± 0.0
Lys
3.142LysAla: 3.142 ± 0.045
0.136LysCys: 0.136 ± 0.009
1.461LysAsp: 1.461 ± 0.034
1.336LysGlu: 1.336 ± 0.028
0.497LysPhe: 0.497 ± 0.015
1.964LysGly: 1.964 ± 0.036
0.476LysHis: 0.476 ± 0.014
0.968LysIle: 0.968 ± 0.022
0.953LysLys: 0.953 ± 0.031
2.213LysLeu: 2.213 ± 0.038
0.41LysMet: 0.41 ± 0.014
0.589LysAsn: 0.589 ± 0.019
1.519LysPro: 1.519 ± 0.027
0.829LysGln: 0.829 ± 0.022
1.602LysArg: 1.602 ± 0.03
1.287LysSer: 1.287 ± 0.024
1.353LysThr: 1.353 ± 0.03
2.011LysVal: 2.011 ± 0.032
0.31LysTrp: 0.31 ± 0.012
0.506LysTyr: 0.506 ± 0.016
0.0LysXaa: 0.0 ± 0.0
Leu
14.754LeuAla: 14.754 ± 0.115
0.837LeuCys: 0.837 ± 0.018
6.473LeuAsp: 6.473 ± 0.058
4.929LeuGlu: 4.929 ± 0.059
2.579LeuPhe: 2.579 ± 0.039
9.127LeuGly: 9.127 ± 0.072
2.279LeuHis: 2.279 ± 0.033
3.591LeuIle: 3.591 ± 0.048
2.21LeuLys: 2.21 ± 0.036
11.228LeuLeu: 11.228 ± 0.092
1.699LeuMet: 1.699 ± 0.031
1.825LeuAsn: 1.825 ± 0.028
6.461LeuPro: 6.461 ± 0.065
2.412LeuGln: 2.412 ± 0.031
8.493LeuArg: 8.493 ± 0.075
5.43LeuSer: 5.43 ± 0.046
6.641LeuThr: 6.641 ± 0.063
8.587LeuVal: 8.587 ± 0.075
1.287LeuTrp: 1.287 ± 0.025
1.816LeuTyr: 1.816 ± 0.031
0.0LeuXaa: 0.0 ± 0.0
Met
2.283MetAla: 2.283 ± 0.03
0.135MetCys: 0.135 ± 0.008
0.938MetAsp: 0.938 ± 0.021
0.818MetGlu: 0.818 ± 0.021
0.457MetPhe: 0.457 ± 0.014
1.378MetGly: 1.378 ± 0.024
0.38MetHis: 0.38 ± 0.014
0.67MetIle: 0.67 ± 0.018
0.464MetLys: 0.464 ± 0.015
1.772MetLeu: 1.772 ± 0.027
0.31MetMet: 0.31 ± 0.012
0.463MetAsn: 0.463 ± 0.015
1.178MetPro: 1.178 ± 0.021
0.494MetGln: 0.494 ± 0.015
1.47MetArg: 1.47 ± 0.029
1.326MetSer: 1.326 ± 0.02
1.511MetThr: 1.511 ± 0.028
1.315MetVal: 1.315 ± 0.026
0.218MetTrp: 0.218 ± 0.009
0.32MetTyr: 0.32 ± 0.013
0.0MetXaa: 0.0 ± 0.0
Asn
2.35AsnAla: 2.35 ± 0.036
0.178AsnCys: 0.178 ± 0.007
1.033AsnAsp: 1.033 ± 0.023
0.924AsnGlu: 0.924 ± 0.021
0.533AsnPhe: 0.533 ± 0.016
2.023AsnGly: 2.023 ± 0.037
0.41AsnHis: 0.41 ± 0.013
0.684AsnIle: 0.684 ± 0.017
0.465AsnLys: 0.465 ± 0.017
1.727AsnLeu: 1.727 ± 0.03
0.308AsnMet: 0.308 ± 0.012
0.475AsnAsn: 0.475 ± 0.018
1.425AsnPro: 1.425 ± 0.029
0.568AsnGln: 0.568 ± 0.016
1.352AsnArg: 1.352 ± 0.023
1.049AsnSer: 1.049 ± 0.025
1.201AsnThr: 1.201 ± 0.031
1.397AsnVal: 1.397 ± 0.026
0.32AsnTrp: 0.32 ± 0.012
0.442AsnTyr: 0.442 ± 0.015
0.0AsnXaa: 0.0 ± 0.0
Pro
8.28ProAla: 8.28 ± 0.068
0.34ProCys: 0.34 ± 0.012
4.325ProAsp: 4.325 ± 0.047
4.302ProGlu: 4.302 ± 0.05
1.509ProPhe: 1.509 ± 0.027
6.47ProGly: 6.47 ± 0.057
1.29ProHis: 1.29 ± 0.026
1.384ProIle: 1.384 ± 0.026
1.347ProLys: 1.347 ± 0.03
5.293ProLeu: 5.293 ± 0.053
1.036ProMet: 1.036 ± 0.023
0.973ProAsn: 0.973 ± 0.022
3.428ProPro: 3.428 ± 0.054
1.946ProGln: 1.946 ± 0.034
3.827ProArg: 3.827 ± 0.05
3.28ProSer: 3.28 ± 0.041
3.129ProThr: 3.129 ± 0.039
5.464ProVal: 5.464 ± 0.057
0.886ProTrp: 0.886 ± 0.022
1.41ProTyr: 1.41 ± 0.026
0.0ProXaa: 0.0 ± 0.0
Gln
3.936GlnAla: 3.936 ± 0.043
0.196GlnCys: 0.196 ± 0.008
1.511GlnAsp: 1.511 ± 0.027
1.523GlnGlu: 1.523 ± 0.027
0.751GlnPhe: 0.751 ± 0.018
2.38GlnGly: 2.38 ± 0.039
0.713GlnHis: 0.713 ± 0.017
1.201GlnIle: 1.201 ± 0.022
0.681GlnLys: 0.681 ± 0.021
3.428GlnLeu: 3.428 ± 0.038
0.532GlnMet: 0.532 ± 0.014
0.544GlnAsn: 0.544 ± 0.015
1.902GlnPro: 1.902 ± 0.033
1.48GlnGln: 1.48 ± 0.043
2.491GlnArg: 2.491 ± 0.037
1.45GlnSer: 1.45 ± 0.027
1.387GlnThr: 1.387 ± 0.025
2.463GlnVal: 2.463 ± 0.036
0.501GlnTrp: 0.501 ± 0.014
0.653GlnTyr: 0.653 ± 0.019
0.0GlnXaa: 0.0 ± 0.0
Arg
9.321ArgAla: 9.321 ± 0.078
0.587ArgCys: 0.587 ± 0.019
4.185ArgAsp: 4.185 ± 0.049
4.677ArgGlu: 4.677 ± 0.051
2.305ArgPhe: 2.305 ± 0.034
5.746ArgGly: 5.746 ± 0.06
2.025ArgHis: 2.025 ± 0.033
3.489ArgIle: 3.489 ± 0.04
1.749ArgLys: 1.749 ± 0.029
8.568ArgLeu: 8.568 ± 0.08
1.756ArgMet: 1.756 ± 0.028
1.472ArgAsn: 1.472 ± 0.026
4.715ArgPro: 4.715 ± 0.054
2.376ArgGln: 2.376 ± 0.035
7.389ArgArg: 7.389 ± 0.087
4.23ArgSer: 4.23 ± 0.045
5.451ArgThr: 5.451 ± 0.047
5.503ArgVal: 5.503 ± 0.05
1.352ArgTrp: 1.352 ± 0.024
1.765ArgTyr: 1.765 ± 0.026
0.0ArgXaa: 0.0 ± 0.0
Ser
6.966SerAla: 6.966 ± 0.056
0.435SerCys: 0.435 ± 0.013
2.834SerAsp: 2.834 ± 0.033
2.645SerGlu: 2.645 ± 0.038
1.602SerPhe: 1.602 ± 0.026
6.242SerGly: 6.242 ± 0.066
1.04SerHis: 1.04 ± 0.02
1.63SerIle: 1.63 ± 0.031
1.178SerLys: 1.178 ± 0.023
4.992SerLeu: 4.992 ± 0.044
1.131SerMet: 1.131 ± 0.019
0.986SerAsn: 0.986 ± 0.022
3.312SerPro: 3.312 ± 0.046
1.384SerGln: 1.384 ± 0.026
3.817SerArg: 3.817 ± 0.05
3.048SerSer: 3.048 ± 0.052
3.175SerThr: 3.175 ± 0.04
4.385SerVal: 4.385 ± 0.042
0.933SerTrp: 0.933 ± 0.021
1.247SerTyr: 1.247 ± 0.024
0.0SerXaa: 0.0 ± 0.0
Thr
8.558ThrAla: 8.558 ± 0.076
0.439ThrCys: 0.439 ± 0.013
3.503ThrAsp: 3.503 ± 0.045
3.366ThrGlu: 3.366 ± 0.041
1.636ThrPhe: 1.636 ± 0.027
6.525ThrGly: 6.525 ± 0.059
1.178ThrHis: 1.178 ± 0.021
1.872ThrIle: 1.872 ± 0.025
1.206ThrLys: 1.206 ± 0.031
5.61ThrLeu: 5.61 ± 0.056
0.998ThrMet: 0.998 ± 0.022
1.051ThrAsn: 1.051 ± 0.023
3.997ThrPro: 3.997 ± 0.049
1.477ThrGln: 1.477 ± 0.026
3.751ThrArg: 3.751 ± 0.042
3.299ThrSer: 3.299 ± 0.041
3.798ThrThr: 3.798 ± 0.057
5.962ThrVal: 5.962 ± 0.048
0.894ThrTrp: 0.894 ± 0.02
1.356ThrTyr: 1.356 ± 0.035
0.0ThrXaa: 0.0 ± 0.0
Val
10.373ValAla: 10.373 ± 0.08
0.735ValCys: 0.735 ± 0.017
4.771ValAsp: 4.771 ± 0.045
4.676ValGlu: 4.676 ± 0.054
2.339ValPhe: 2.339 ± 0.039
6.374ValGly: 6.374 ± 0.054
1.953ValHis: 1.953 ± 0.029
3.103ValIle: 3.103 ± 0.037
1.852ValLys: 1.852 ± 0.034
9.247ValLeu: 9.247 ± 0.08
1.48ValMet: 1.48 ± 0.025
1.704ValAsn: 1.704 ± 0.032
5.117ValPro: 5.117 ± 0.055
2.207ValGln: 2.207 ± 0.027
6.946ValArg: 6.946 ± 0.055
4.356ValSer: 4.356 ± 0.046
5.379ValThr: 5.379 ± 0.051
7.556ValVal: 7.556 ± 0.077
1.118ValTrp: 1.118 ± 0.022
1.556ValTyr: 1.556 ± 0.025
0.0ValXaa: 0.0 ± 0.0
Trp
1.641TrpAla: 1.641 ± 0.027
0.166TrpCys: 0.166 ± 0.008
0.85TrpAsp: 0.85 ± 0.022
0.744TrpGlu: 0.744 ± 0.017
0.505TrpPhe: 0.505 ± 0.013
1.037TrpGly: 1.037 ± 0.022
0.363TrpHis: 0.363 ± 0.012
0.583TrpIle: 0.583 ± 0.018
0.381TrpLys: 0.381 ± 0.014
1.797TrpLeu: 1.797 ± 0.027
0.288TrpMet: 0.288 ± 0.01
0.416TrpAsn: 0.416 ± 0.015
0.788TrpPro: 0.788 ± 0.019
0.645TrpGln: 0.645 ± 0.016
1.324TrpArg: 1.324 ± 0.027
0.933TrpSer: 0.933 ± 0.02
1.025TrpThr: 1.025 ± 0.02
0.993TrpVal: 0.993 ± 0.022
0.349TrpTrp: 0.349 ± 0.011
0.36TrpTyr: 0.36 ± 0.013
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.78TyrAla: 2.78 ± 0.036
0.181TyrCys: 0.181 ± 0.009
1.448TyrAsp: 1.448 ± 0.031
1.363TyrGlu: 1.363 ± 0.026
0.655TyrPhe: 0.655 ± 0.018
2.283TyrGly: 2.283 ± 0.027
0.367TyrHis: 0.367 ± 0.013
0.549TyrIle: 0.549 ± 0.017
0.428TyrLys: 0.428 ± 0.015
2.097TyrLeu: 2.097 ± 0.031
0.249TyrMet: 0.249 ± 0.01
0.429TyrAsn: 0.429 ± 0.015
1.068TyrPro: 1.068 ± 0.021
0.641TyrGln: 0.641 ± 0.018
1.803TyrArg: 1.803 ± 0.029
0.982TyrSer: 0.982 ± 0.023
1.181TyrThr: 1.181 ± 0.03
1.63TyrVal: 1.63 ± 0.028
0.338TyrTrp: 0.338 ± 0.012
0.445TyrTyr: 0.445 ± 0.017
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7136 proteins (2361430 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski