Amino acid dipepetide frequency for Micromonospora pattaloongensis

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
24.627AlaAla: 24.627 ± 0.229
0.965AlaCys: 0.965 ± 0.03
8.52AlaAsp: 8.52 ± 0.077
8.499AlaGlu: 8.499 ± 0.093
3.393AlaPhe: 3.393 ± 0.048
14.475AlaGly: 14.475 ± 0.127
2.574AlaHis: 2.574 ± 0.048
4.266AlaIle: 4.266 ± 0.059
2.405AlaLys: 2.405 ± 0.053
14.825AlaLeu: 14.825 ± 0.142
2.478AlaMet: 2.478 ± 0.04
2.208AlaAsn: 2.208 ± 0.045
7.987AlaPro: 7.987 ± 0.085
3.81AlaGln: 3.81 ± 0.055
11.076AlaArg: 11.076 ± 0.11
5.631AlaSer: 5.631 ± 0.062
8.323AlaThr: 8.323 ± 0.079
13.031AlaVal: 13.031 ± 0.121
1.933AlaTrp: 1.933 ± 0.039
2.737AlaTyr: 2.737 ± 0.045
0.0AlaXaa: 0.0 ± 0.0
Cys
1.022CysAla: 1.022 ± 0.029
0.121CysCys: 0.121 ± 0.008
0.517CysAsp: 0.517 ± 0.018
0.327CysGlu: 0.327 ± 0.014
0.216CysPhe: 0.216 ± 0.012
0.892CysGly: 0.892 ± 0.026
0.196CysHis: 0.196 ± 0.012
0.143CysIle: 0.143 ± 0.009
0.096CysLys: 0.096 ± 0.008
0.676CysLeu: 0.676 ± 0.021
0.105CysMet: 0.105 ± 0.008
0.139CysAsn: 0.139 ± 0.01
0.49CysPro: 0.49 ± 0.019
0.185CysGln: 0.185 ± 0.01
0.666CysArg: 0.666 ± 0.022
0.368CysSer: 0.368 ± 0.015
0.41CysThr: 0.41 ± 0.018
0.674CysVal: 0.674 ± 0.023
0.105CysTrp: 0.105 ± 0.008
0.177CysTyr: 0.177 ± 0.012
0.0CysXaa: 0.0 ± 0.0
Asp
8.049AspAla: 8.049 ± 0.076
0.397AspCys: 0.397 ± 0.017
3.62AspAsp: 3.62 ± 0.065
3.857AspGlu: 3.857 ± 0.063
1.468AspPhe: 1.468 ± 0.029
6.316AspGly: 6.316 ± 0.07
1.206AspHis: 1.206 ± 0.028
1.662AspIle: 1.662 ± 0.034
0.96AspLys: 0.96 ± 0.032
5.938AspLeu: 5.938 ± 0.066
0.765AspMet: 0.765 ± 0.025
0.953AspAsn: 0.953 ± 0.025
4.879AspPro: 4.879 ± 0.058
1.495AspGln: 1.495 ± 0.034
5.274AspArg: 5.274 ± 0.066
2.134AspSer: 2.134 ± 0.041
2.731AspThr: 2.731 ± 0.042
5.087AspVal: 5.087 ± 0.061
0.911AspTrp: 0.911 ± 0.022
1.146AspTyr: 1.146 ± 0.027
0.0AspXaa: 0.0 ± 0.0
Glu
6.36GluAla: 6.36 ± 0.081
0.328GluCys: 0.328 ± 0.016
1.967GluAsp: 1.967 ± 0.041
2.53GluGlu: 2.53 ± 0.049
1.531GluPhe: 1.531 ± 0.032
3.216GluGly: 3.216 ± 0.058
1.302GluHis: 1.302 ± 0.028
2.357GluIle: 2.357 ± 0.047
1.087GluLys: 1.087 ± 0.029
6.868GluLeu: 6.868 ± 0.083
0.907GluMet: 0.907 ± 0.026
0.82GluAsn: 0.82 ± 0.026
3.419GluPro: 3.419 ± 0.051
2.195GluGln: 2.195 ± 0.038
5.215GluArg: 5.215 ± 0.075
2.207GluSer: 2.207 ± 0.039
2.65GluThr: 2.65 ± 0.048
4.507GluVal: 4.507 ± 0.067
0.738GluTrp: 0.738 ± 0.023
1.099GluTyr: 1.099 ± 0.029
0.0GluXaa: 0.0 ± 0.0
Phe
3.883PheAla: 3.883 ± 0.05
0.247PheCys: 0.247 ± 0.014
1.969PheAsp: 1.969 ± 0.036
1.213PheGlu: 1.213 ± 0.03
0.862PhePhe: 0.862 ± 0.025
3.009PheGly: 3.009 ± 0.049
0.522PheHis: 0.522 ± 0.017
0.759PheIle: 0.759 ± 0.021
0.424PheLys: 0.424 ± 0.017
2.409PheLeu: 2.409 ± 0.045
0.357PheMet: 0.357 ± 0.015
0.635PheAsn: 0.635 ± 0.019
1.33PhePro: 1.33 ± 0.03
0.61PheGln: 0.61 ± 0.017
1.791PheArg: 1.791 ± 0.031
1.314PheSer: 1.314 ± 0.032
1.881PheThr: 1.881 ± 0.036
2.453PheVal: 2.453 ± 0.042
0.438PheTrp: 0.438 ± 0.019
0.591PheTyr: 0.591 ± 0.018
0.0PheXaa: 0.0 ± 0.0
Gly
11.814GlyAla: 11.814 ± 0.11
0.855GlyCys: 0.855 ± 0.024
5.306GlyAsp: 5.306 ± 0.062
4.695GlyGlu: 4.695 ± 0.064
2.855GlyPhe: 2.855 ± 0.047
8.815GlyGly: 8.815 ± 0.094
2.108GlyHis: 2.108 ± 0.041
3.2GlyIle: 3.2 ± 0.052
1.968GlyLys: 1.968 ± 0.04
9.301GlyLeu: 9.301 ± 0.082
1.873GlyMet: 1.873 ± 0.034
1.853GlyAsn: 1.853 ± 0.048
5.182GlyPro: 5.182 ± 0.058
2.66GlyGln: 2.66 ± 0.043
8.213GlyArg: 8.213 ± 0.086
4.662GlySer: 4.662 ± 0.068
5.543GlyThr: 5.543 ± 0.066
8.106GlyVal: 8.106 ± 0.084
1.785GlyTrp: 1.785 ± 0.038
2.48GlyTyr: 2.48 ± 0.041
0.0GlyXaa: 0.0 ± 0.0
His
2.627HisAla: 2.627 ± 0.041
0.178HisCys: 0.178 ± 0.011
1.316HisAsp: 1.316 ± 0.029
1.051HisGlu: 1.051 ± 0.022
0.564HisPhe: 0.564 ± 0.018
2.135HisGly: 2.135 ± 0.035
0.58HisHis: 0.58 ± 0.022
0.519HisIle: 0.519 ± 0.018
0.257HisLys: 0.257 ± 0.012
2.202HisLeu: 2.202 ± 0.044
0.285HisMet: 0.285 ± 0.014
0.371HisAsn: 0.371 ± 0.016
1.627HisPro: 1.627 ± 0.035
0.558HisGln: 0.558 ± 0.016
2.049HisArg: 2.049 ± 0.04
0.8HisSer: 0.8 ± 0.023
1.06HisThr: 1.06 ± 0.022
1.729HisVal: 1.729 ± 0.036
0.314HisTrp: 0.314 ± 0.017
0.456HisTyr: 0.456 ± 0.017
0.0HisXaa: 0.0 ± 0.0
Ile
5.13IleAla: 5.13 ± 0.068
0.303IleCys: 0.303 ± 0.016
2.44IleAsp: 2.44 ± 0.039
2.069IleGlu: 2.069 ± 0.04
0.861IlePhe: 0.861 ± 0.024
3.643IleGly: 3.643 ± 0.057
0.537IleHis: 0.537 ± 0.019
1.011IleIle: 1.011 ± 0.031
0.658IleLys: 0.658 ± 0.021
2.381IleLeu: 2.381 ± 0.046
0.484IleMet: 0.484 ± 0.018
0.816IleAsn: 0.816 ± 0.022
1.726IlePro: 1.726 ± 0.036
0.729IleGln: 0.729 ± 0.022
2.399IleArg: 2.399 ± 0.037
1.716IleSer: 1.716 ± 0.032
2.226IleThr: 2.226 ± 0.037
3.137IleVal: 3.137 ± 0.045
0.411IleTrp: 0.411 ± 0.016
0.647IleTyr: 0.647 ± 0.023
0.0IleXaa: 0.0 ± 0.0
Lys
2.183LysAla: 2.183 ± 0.055
0.115LysCys: 0.115 ± 0.009
0.83LysAsp: 0.83 ± 0.025
0.765LysGlu: 0.765 ± 0.021
0.421LysPhe: 0.421 ± 0.017
1.345LysGly: 1.345 ± 0.033
0.339LysHis: 0.339 ± 0.015
0.835LysIle: 0.835 ± 0.021
0.57LysLys: 0.57 ± 0.029
1.831LysLeu: 1.831 ± 0.038
0.343LysMet: 0.343 ± 0.014
0.401LysAsn: 0.401 ± 0.018
1.148LysPro: 1.148 ± 0.028
0.627LysGln: 0.627 ± 0.023
1.32LysArg: 1.32 ± 0.033
0.906LysSer: 0.906 ± 0.022
1.082LysThr: 1.082 ± 0.03
1.584LysVal: 1.584 ± 0.039
0.246LysTrp: 0.246 ± 0.014
0.345LysTyr: 0.345 ± 0.016
0.0LysXaa: 0.0 ± 0.0
Leu
16.251LeuAla: 16.251 ± 0.148
0.76LeuCys: 0.76 ± 0.021
6.524LeuAsp: 6.524 ± 0.074
3.979LeuGlu: 3.979 ± 0.058
2.563LeuPhe: 2.563 ± 0.05
9.253LeuGly: 9.253 ± 0.095
2.173LeuHis: 2.173 ± 0.038
3.43LeuIle: 3.43 ± 0.048
1.436LeuLys: 1.436 ± 0.033
11.361LeuLeu: 11.361 ± 0.131
1.424LeuMet: 1.424 ± 0.031
1.688LeuAsn: 1.688 ± 0.036
6.578LeuPro: 6.578 ± 0.072
2.138LeuGln: 2.138 ± 0.041
9.513LeuArg: 9.513 ± 0.098
4.952LeuSer: 4.952 ± 0.062
6.643LeuThr: 6.643 ± 0.079
9.385LeuVal: 9.385 ± 0.098
1.208LeuTrp: 1.208 ± 0.032
1.698LeuTyr: 1.698 ± 0.039
0.0LeuXaa: 0.0 ± 0.0
Met
2.193MetAla: 2.193 ± 0.038
0.123MetCys: 0.123 ± 0.009
0.736MetAsp: 0.736 ± 0.021
0.635MetGlu: 0.635 ± 0.02
0.475MetPhe: 0.475 ± 0.021
1.148MetGly: 1.148 ± 0.029
0.29MetHis: 0.29 ± 0.014
0.695MetIle: 0.695 ± 0.022
0.357MetLys: 0.357 ± 0.016
1.733MetLeu: 1.733 ± 0.036
0.278MetMet: 0.278 ± 0.013
0.348MetAsn: 0.348 ± 0.015
1.157MetPro: 1.157 ± 0.028
0.449MetGln: 0.449 ± 0.014
1.426MetArg: 1.426 ± 0.031
1.216MetSer: 1.216 ± 0.028
1.609MetThr: 1.609 ± 0.029
1.336MetVal: 1.336 ± 0.03
0.214MetTrp: 0.214 ± 0.013
0.285MetTyr: 0.285 ± 0.013
0.0MetXaa: 0.0 ± 0.0
Asn
2.315AsnAla: 2.315 ± 0.046
0.166AsnCys: 0.166 ± 0.01
1.036AsnAsp: 1.036 ± 0.029
0.791AsnGlu: 0.791 ± 0.023
0.508AsnPhe: 0.508 ± 0.021
1.915AsnGly: 1.915 ± 0.044
0.39AsnHis: 0.39 ± 0.016
0.628AsnIle: 0.628 ± 0.025
0.342AsnLys: 0.342 ± 0.017
1.802AsnLeu: 1.802 ± 0.035
0.28AsnMet: 0.28 ± 0.016
0.483AsnAsn: 0.483 ± 0.023
1.53AsnPro: 1.53 ± 0.035
0.569AsnGln: 0.569 ± 0.02
1.421AsnArg: 1.421 ± 0.033
0.88AsnSer: 0.88 ± 0.028
1.059AsnThr: 1.059 ± 0.03
1.538AsnVal: 1.538 ± 0.034
0.328AsnTrp: 0.328 ± 0.015
0.423AsnTyr: 0.423 ± 0.017
0.0AsnXaa: 0.0 ± 0.0
Pro
10.02ProAla: 10.02 ± 0.109
0.309ProCys: 0.309 ± 0.013
4.485ProAsp: 4.485 ± 0.061
3.936ProGlu: 3.936 ± 0.06
1.451ProPhe: 1.451 ± 0.032
6.604ProGly: 6.604 ± 0.075
1.254ProHis: 1.254 ± 0.029
1.734ProIle: 1.734 ± 0.038
0.972ProLys: 0.972 ± 0.025
5.33ProLeu: 5.33 ± 0.062
1.004ProMet: 1.004 ± 0.024
1.026ProAsn: 1.026 ± 0.03
4.644ProPro: 4.644 ± 0.092
1.708ProGln: 1.708 ± 0.034
4.487ProArg: 4.487 ± 0.056
3.127ProSer: 3.127 ± 0.049
3.967ProThr: 3.967 ± 0.062
5.787ProVal: 5.787 ± 0.069
0.925ProTrp: 0.925 ± 0.027
1.326ProTyr: 1.326 ± 0.029
0.0ProXaa: 0.0 ± 0.0
Gln
3.516GlnAla: 3.516 ± 0.047
0.158GlnCys: 0.158 ± 0.011
0.948GlnAsp: 0.948 ± 0.026
1.148GlnGlu: 1.148 ± 0.029
0.715GlnPhe: 0.715 ± 0.021
1.793GlnGly: 1.793 ± 0.036
0.622GlnHis: 0.622 ± 0.019
1.162GlnIle: 1.162 ± 0.032
0.519GlnLys: 0.519 ± 0.019
3.411GlnLeu: 3.411 ± 0.043
0.506GlnMet: 0.506 ± 0.019
0.534GlnAsn: 0.534 ± 0.02
1.913GlnPro: 1.913 ± 0.046
1.206GlnGln: 1.206 ± 0.032
2.903GlnArg: 2.903 ± 0.045
1.16GlnSer: 1.16 ± 0.033
1.419GlnThr: 1.419 ± 0.032
2.552GlnVal: 2.552 ± 0.047
0.497GlnTrp: 0.497 ± 0.018
0.54GlnTyr: 0.54 ± 0.018
0.0GlnXaa: 0.0 ± 0.0
Arg
10.967ArgAla: 10.967 ± 0.108
0.647ArgCys: 0.647 ± 0.022
4.844ArgAsp: 4.844 ± 0.058
4.362ArgGlu: 4.362 ± 0.062
2.442ArgPhe: 2.442 ± 0.036
6.334ArgGly: 6.334 ± 0.08
2.132ArgHis: 2.132 ± 0.04
3.196ArgIle: 3.196 ± 0.049
1.364ArgLys: 1.364 ± 0.035
9.335ArgLeu: 9.335 ± 0.094
1.912ArgMet: 1.912 ± 0.037
1.506ArgAsn: 1.506 ± 0.034
5.646ArgPro: 5.646 ± 0.079
2.598ArgGln: 2.598 ± 0.045
9.125ArgArg: 9.125 ± 0.119
3.98ArgSer: 3.98 ± 0.048
4.805ArgThr: 4.805 ± 0.059
6.654ArgVal: 6.654 ± 0.07
1.607ArgTrp: 1.607 ± 0.037
2.018ArgTyr: 2.018 ± 0.042
0.0ArgXaa: 0.0 ± 0.0
Ser
6.513SerAla: 6.513 ± 0.076
0.395SerCys: 0.395 ± 0.018
2.462SerAsp: 2.462 ± 0.041
2.015SerGlu: 2.015 ± 0.038
1.371SerPhe: 1.371 ± 0.032
5.279SerGly: 5.279 ± 0.062
0.877SerHis: 0.877 ± 0.025
1.502SerIle: 1.502 ± 0.036
0.78SerLys: 0.78 ± 0.025
4.145SerLeu: 4.145 ± 0.055
1.007SerMet: 1.007 ± 0.026
0.873SerAsn: 0.873 ± 0.027
3.075SerPro: 3.075 ± 0.046
1.107SerGln: 1.107 ± 0.033
3.611SerArg: 3.611 ± 0.052
2.352SerSer: 2.352 ± 0.049
2.945SerThr: 2.945 ± 0.047
3.986SerVal: 3.986 ± 0.056
0.893SerTrp: 0.893 ± 0.028
1.165SerTyr: 1.165 ± 0.031
0.0SerXaa: 0.0 ± 0.0
Thr
8.816ThrAla: 8.816 ± 0.088
0.403ThrCys: 0.403 ± 0.017
3.51ThrAsp: 3.51 ± 0.047
2.923ThrGlu: 2.923 ± 0.049
1.576ThrPhe: 1.576 ± 0.034
6.488ThrGly: 6.488 ± 0.072
1.025ThrHis: 1.025 ± 0.027
2.048ThrIle: 2.048 ± 0.04
0.976ThrLys: 0.976 ± 0.027
5.747ThrLeu: 5.747 ± 0.061
0.977ThrMet: 0.977 ± 0.021
1.13ThrAsn: 1.13 ± 0.036
4.287ThrPro: 4.287 ± 0.064
1.332ThrGln: 1.332 ± 0.032
4.191ThrArg: 4.191 ± 0.055
2.766ThrSer: 2.766 ± 0.053
3.755ThrThr: 3.755 ± 0.055
6.323ThrVal: 6.323 ± 0.072
0.856ThrTrp: 0.856 ± 0.025
1.303ThrTyr: 1.303 ± 0.036
0.0ThrXaa: 0.0 ± 0.0
Val
12.963ValAla: 12.963 ± 0.112
0.715ValCys: 0.715 ± 0.021
5.634ValAsp: 5.634 ± 0.06
4.82ValGlu: 4.82 ± 0.064
2.313ValPhe: 2.313 ± 0.045
7.432ValGly: 7.432 ± 0.091
1.748ValHis: 1.748 ± 0.036
3.165ValIle: 3.165 ± 0.047
1.472ValLys: 1.472 ± 0.033
9.557ValLeu: 9.557 ± 0.088
1.256ValMet: 1.256 ± 0.032
1.809ValAsn: 1.809 ± 0.039
5.368ValPro: 5.368 ± 0.06
1.992ValGln: 1.992 ± 0.038
7.305ValArg: 7.305 ± 0.073
4.271ValSer: 4.271 ± 0.056
6.247ValThr: 6.247 ± 0.073
8.877ValVal: 8.877 ± 0.096
1.085ValTrp: 1.085 ± 0.027
1.633ValTyr: 1.633 ± 0.034
0.0ValXaa: 0.0 ± 0.0
Trp
1.619TrpAla: 1.619 ± 0.03
0.161TrpCys: 0.161 ± 0.009
0.718TrpAsp: 0.718 ± 0.021
0.611TrpGlu: 0.611 ± 0.022
0.5TrpPhe: 0.5 ± 0.02
0.993TrpGly: 0.993 ± 0.027
0.392TrpHis: 0.392 ± 0.016
0.561TrpIle: 0.561 ± 0.019
0.295TrpLys: 0.295 ± 0.017
1.867TrpLeu: 1.867 ± 0.038
0.268TrpMet: 0.268 ± 0.012
0.387TrpAsn: 0.387 ± 0.017
0.911TrpPro: 0.911 ± 0.024
0.618TrpGln: 0.618 ± 0.022
1.562TrpArg: 1.562 ± 0.034
0.992TrpSer: 0.992 ± 0.029
0.947TrpThr: 0.947 ± 0.028
1.054TrpVal: 1.054 ± 0.027
0.376TrpTrp: 0.376 ± 0.016
0.36TrpTyr: 0.36 ± 0.019
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.893TyrAla: 2.893 ± 0.046
0.178TyrCys: 0.178 ± 0.011
1.466TyrAsp: 1.466 ± 0.043
1.078TyrGlu: 1.078 ± 0.03
0.615TyrPhe: 0.615 ± 0.022
2.188TyrGly: 2.188 ± 0.045
0.41TyrHis: 0.41 ± 0.017
0.451TyrIle: 0.451 ± 0.016
0.302TyrLys: 0.302 ± 0.016
2.239TyrLeu: 2.239 ± 0.038
0.194TyrMet: 0.194 ± 0.01
0.423TyrAsn: 0.423 ± 0.018
1.199TyrPro: 1.199 ± 0.029
0.639TyrGln: 0.639 ± 0.022
1.927TyrArg: 1.927 ± 0.037
0.918TyrSer: 0.918 ± 0.031
1.131TyrThr: 1.131 ± 0.035
1.862TyrVal: 1.862 ± 0.037
0.32TyrTrp: 0.32 ± 0.015
0.43TyrTyr: 0.43 ± 0.017
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4874 proteins (1584710 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski