Amino acid dipepetide frequency for Kribbella soli

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
18.538AlaAla: 18.538 ± 0.111
0.874AlaCys: 0.874 ± 0.018
8.124AlaAsp: 8.124 ± 0.063
7.608AlaGlu: 7.608 ± 0.066
3.611AlaPhe: 3.611 ± 0.038
11.952AlaGly: 11.952 ± 0.068
2.211AlaHis: 2.211 ± 0.031
5.063AlaIle: 5.063 ± 0.045
3.226AlaLys: 3.226 ± 0.047
12.608AlaLeu: 12.608 ± 0.087
2.481AlaMet: 2.481 ± 0.029
2.452AlaAsn: 2.452 ± 0.036
5.504AlaPro: 5.504 ± 0.059
3.704AlaGln: 3.704 ± 0.042
8.014AlaArg: 8.014 ± 0.062
5.893AlaSer: 5.893 ± 0.046
7.378AlaThr: 7.378 ± 0.047
11.483AlaVal: 11.483 ± 0.077
1.865AlaTrp: 1.865 ± 0.025
2.691AlaTyr: 2.691 ± 0.032
0.0AlaXaa: 0.0 ± 0.0
Cys
0.801CysAla: 0.801 ± 0.017
0.092CysCys: 0.092 ± 0.006
0.416CysAsp: 0.416 ± 0.013
0.368CysGlu: 0.368 ± 0.011
0.212CysPhe: 0.212 ± 0.009
0.81CysGly: 0.81 ± 0.018
0.177CysHis: 0.177 ± 0.009
0.214CysIle: 0.214 ± 0.009
0.129CysLys: 0.129 ± 0.007
0.662CysLeu: 0.662 ± 0.016
0.104CysMet: 0.104 ± 0.006
0.155CysAsn: 0.155 ± 0.008
0.404CysPro: 0.404 ± 0.014
0.162CysGln: 0.162 ± 0.008
0.47CysArg: 0.47 ± 0.014
0.421CysSer: 0.421 ± 0.013
0.455CysThr: 0.455 ± 0.013
0.535CysVal: 0.535 ± 0.014
0.122CysTrp: 0.122 ± 0.007
0.155CysTyr: 0.155 ± 0.008
0.0CysXaa: 0.0 ± 0.0
Asp
7.332AspAla: 7.332 ± 0.066
0.371AspCys: 0.371 ± 0.013
3.948AspAsp: 3.948 ± 0.045
4.049AspGlu: 4.049 ± 0.043
1.662AspPhe: 1.662 ± 0.023
6.232AspGly: 6.232 ± 0.055
1.431AspHis: 1.431 ± 0.023
1.964AspIle: 1.964 ± 0.027
1.427AspLys: 1.427 ± 0.03
6.892AspLeu: 6.892 ± 0.055
0.709AspMet: 0.709 ± 0.016
1.229AspAsn: 1.229 ± 0.023
4.35AspPro: 4.35 ± 0.041
1.995AspGln: 1.995 ± 0.031
4.576AspArg: 4.576 ± 0.039
2.622AspSer: 2.622 ± 0.029
2.822AspThr: 2.822 ± 0.033
5.523AspVal: 5.523 ± 0.044
1.062AspTrp: 1.062 ± 0.02
1.263AspTyr: 1.263 ± 0.026
0.0AspXaa: 0.0 ± 0.0
Glu
6.256GluAla: 6.256 ± 0.062
0.302GluCys: 0.302 ± 0.01
2.561GluAsp: 2.561 ± 0.031
2.687GluGlu: 2.687 ± 0.042
1.597GluPhe: 1.597 ± 0.026
3.335GluGly: 3.335 ± 0.035
1.428GluHis: 1.428 ± 0.022
2.389GluIle: 2.389 ± 0.036
1.295GluLys: 1.295 ± 0.027
7.19GluLeu: 7.19 ± 0.056
0.865GluMet: 0.865 ± 0.019
1.087GluAsn: 1.087 ± 0.023
3.12GluPro: 3.12 ± 0.04
2.399GluGln: 2.399 ± 0.036
4.651GluArg: 4.651 ± 0.048
2.681GluSer: 2.681 ± 0.036
2.834GluThr: 2.834 ± 0.032
4.511GluVal: 4.511 ± 0.047
0.864GluTrp: 0.864 ± 0.018
1.251GluTyr: 1.251 ± 0.023
0.0GluXaa: 0.0 ± 0.0
Phe
3.829PheAla: 3.829 ± 0.04
0.258PheCys: 0.258 ± 0.01
2.161PheAsp: 2.161 ± 0.032
1.578PheGlu: 1.578 ± 0.025
0.975PhePhe: 0.975 ± 0.023
3.21PheGly: 3.21 ± 0.037
0.635PheHis: 0.635 ± 0.015
0.895PheIle: 0.895 ± 0.021
0.64PheLys: 0.64 ± 0.017
2.75PheLeu: 2.75 ± 0.034
0.411PheMet: 0.411 ± 0.012
0.709PheAsn: 0.709 ± 0.019
1.392PhePro: 1.392 ± 0.027
0.806PheGln: 0.806 ± 0.018
1.926PheArg: 1.926 ± 0.028
1.642PheSer: 1.642 ± 0.025
2.081PheThr: 2.081 ± 0.028
2.681PheVal: 2.681 ± 0.032
0.52PheTrp: 0.52 ± 0.013
0.698PheTyr: 0.698 ± 0.016
0.0PheXaa: 0.0 ± 0.0
Gly
9.364GlyAla: 9.364 ± 0.071
0.766GlyCys: 0.766 ± 0.015
4.946GlyAsp: 4.946 ± 0.051
4.393GlyGlu: 4.393 ± 0.04
3.054GlyPhe: 3.054 ± 0.032
7.599GlyGly: 7.599 ± 0.068
1.894GlyHis: 1.894 ± 0.025
3.881GlyIle: 3.881 ± 0.042
2.671GlyLys: 2.671 ± 0.04
9.453GlyLeu: 9.453 ± 0.073
1.764GlyMet: 1.764 ± 0.028
2.1GlyAsn: 2.1 ± 0.034
4.488GlyPro: 4.488 ± 0.046
2.937GlyGln: 2.937 ± 0.042
6.383GlyArg: 6.383 ± 0.051
5.263GlySer: 5.263 ± 0.053
5.848GlyThr: 5.848 ± 0.068
7.626GlyVal: 7.626 ± 0.059
1.87GlyTrp: 1.87 ± 0.033
2.502GlyTyr: 2.502 ± 0.032
0.0GlyXaa: 0.0 ± 0.0
His
2.258HisAla: 2.258 ± 0.033
0.173HisCys: 0.173 ± 0.008
1.318HisAsp: 1.318 ± 0.022
1.168HisGlu: 1.168 ± 0.022
0.625HisPhe: 0.625 ± 0.017
2.034HisGly: 2.034 ± 0.031
0.636HisHis: 0.636 ± 0.017
0.609HisIle: 0.609 ± 0.016
0.387HisLys: 0.387 ± 0.012
2.378HisLeu: 2.378 ± 0.028
0.284HisMet: 0.284 ± 0.01
0.439HisAsn: 0.439 ± 0.013
1.556HisPro: 1.556 ± 0.027
0.713HisGln: 0.713 ± 0.014
1.782HisArg: 1.782 ± 0.026
0.985HisSer: 0.985 ± 0.019
1.144HisThr: 1.144 ± 0.022
1.676HisVal: 1.676 ± 0.026
0.39HisTrp: 0.39 ± 0.012
0.469HisTyr: 0.469 ± 0.014
0.0HisXaa: 0.0 ± 0.0
Ile
5.763IleAla: 5.763 ± 0.053
0.32IleCys: 0.32 ± 0.011
2.87IleAsp: 2.87 ± 0.032
2.354IleGlu: 2.354 ± 0.032
1.032IlePhe: 1.032 ± 0.022
4.292IleGly: 4.292 ± 0.043
0.724IleHis: 0.724 ± 0.017
1.116IleIle: 1.116 ± 0.022
0.866IleLys: 0.866 ± 0.021
3.055IleLeu: 3.055 ± 0.046
0.506IleMet: 0.506 ± 0.015
0.861IleAsn: 0.861 ± 0.02
2.11IlePro: 2.11 ± 0.031
0.985IleGln: 0.985 ± 0.019
2.582IleArg: 2.582 ± 0.033
2.234IleSer: 2.234 ± 0.033
2.769IleThr: 2.769 ± 0.033
3.568IleVal: 3.568 ± 0.035
0.539IleTrp: 0.539 ± 0.013
0.75IleTyr: 0.75 ± 0.016
0.0IleXaa: 0.0 ± 0.0
Lys
3.213LysAla: 3.213 ± 0.044
0.115LysCys: 0.115 ± 0.008
1.389LysAsp: 1.389 ± 0.026
1.02LysGlu: 1.02 ± 0.021
0.65LysPhe: 0.65 ± 0.018
1.68LysGly: 1.68 ± 0.026
0.53LysHis: 0.53 ± 0.013
0.992LysIle: 0.992 ± 0.02
0.794LysLys: 0.794 ± 0.027
2.643LysLeu: 2.643 ± 0.035
0.433LysMet: 0.433 ± 0.013
0.608LysAsn: 0.608 ± 0.017
1.673LysPro: 1.673 ± 0.033
0.904LysGln: 0.904 ± 0.022
1.438LysArg: 1.438 ± 0.022
1.402LysSer: 1.402 ± 0.028
1.573LysThr: 1.573 ± 0.032
2.222LysVal: 2.222 ± 0.033
0.343LysTrp: 0.343 ± 0.012
0.652LysTyr: 0.652 ± 0.016
0.0LysXaa: 0.0 ± 0.0
Leu
14.165LeuAla: 14.165 ± 0.094
0.671LeuCys: 0.671 ± 0.017
6.799LeuAsp: 6.799 ± 0.052
5.063LeuGlu: 5.063 ± 0.051
2.813LeuPhe: 2.813 ± 0.037
9.117LeuGly: 9.117 ± 0.069
2.181LeuHis: 2.181 ± 0.026
4.234LeuIle: 4.234 ± 0.049
2.245LeuLys: 2.245 ± 0.034
10.857LeuLeu: 10.857 ± 0.091
1.715LeuMet: 1.715 ± 0.026
2.124LeuAsn: 2.124 ± 0.027
6.088LeuPro: 6.088 ± 0.047
2.932LeuGln: 2.932 ± 0.034
7.756LeuArg: 7.756 ± 0.063
5.696LeuSer: 5.696 ± 0.046
7.333LeuThr: 7.333 ± 0.058
9.248LeuVal: 9.248 ± 0.081
1.37LeuTrp: 1.37 ± 0.02
1.974LeuTyr: 1.974 ± 0.026
0.0LeuXaa: 0.0 ± 0.0
Met
2.04MetAla: 2.04 ± 0.029
0.101MetCys: 0.101 ± 0.007
0.846MetAsp: 0.846 ± 0.018
0.668MetGlu: 0.668 ± 0.015
0.525MetPhe: 0.525 ± 0.014
1.148MetGly: 1.148 ± 0.022
0.359MetHis: 0.359 ± 0.012
0.816MetIle: 0.816 ± 0.02
0.492MetLys: 0.492 ± 0.015
1.823MetLeu: 1.823 ± 0.028
0.31MetMet: 0.31 ± 0.011
0.499MetAsn: 0.499 ± 0.014
1.076MetPro: 1.076 ± 0.018
0.488MetGln: 0.488 ± 0.012
1.319MetArg: 1.319 ± 0.025
1.378MetSer: 1.378 ± 0.022
1.592MetThr: 1.592 ± 0.027
1.424MetVal: 1.424 ± 0.023
0.196MetTrp: 0.196 ± 0.009
0.376MetTyr: 0.376 ± 0.013
0.0MetXaa: 0.0 ± 0.0
Asn
2.523AsnAla: 2.523 ± 0.034
0.183AsnCys: 0.183 ± 0.009
1.259AsnAsp: 1.259 ± 0.023
0.993AsnGlu: 0.993 ± 0.02
0.638AsnPhe: 0.638 ± 0.016
2.314AsnGly: 2.314 ± 0.042
0.487AsnHis: 0.487 ± 0.014
0.77AsnIle: 0.77 ± 0.021
0.518AsnLys: 0.518 ± 0.016
2.249AsnLeu: 2.249 ± 0.03
0.322AsnMet: 0.322 ± 0.011
0.576AsnAsn: 0.576 ± 0.016
1.684AsnPro: 1.684 ± 0.028
0.748AsnGln: 0.748 ± 0.018
1.464AsnArg: 1.464 ± 0.028
1.138AsnSer: 1.138 ± 0.022
1.301AsnThr: 1.301 ± 0.025
1.693AsnVal: 1.693 ± 0.03
0.423AsnTrp: 0.423 ± 0.013
0.567AsnTyr: 0.567 ± 0.015
0.0AsnXaa: 0.0 ± 0.0
Pro
7.327ProAla: 7.327 ± 0.068
0.289ProCys: 0.289 ± 0.011
4.377ProAsp: 4.377 ± 0.041
3.749ProGlu: 3.749 ± 0.045
1.557ProPhe: 1.557 ± 0.023
5.507ProGly: 5.507 ± 0.057
1.15ProHis: 1.15 ± 0.022
2.007ProIle: 2.007 ± 0.03
1.378ProLys: 1.378 ± 0.027
4.865ProLeu: 4.865 ± 0.048
1.017ProMet: 1.017 ± 0.02
1.272ProAsn: 1.272 ± 0.023
3.089ProPro: 3.089 ± 0.042
1.702ProGln: 1.702 ± 0.032
3.231ProArg: 3.231 ± 0.038
3.3ProSer: 3.3 ± 0.034
3.76ProThr: 3.76 ± 0.051
4.977ProVal: 4.977 ± 0.047
0.976ProTrp: 0.976 ± 0.021
1.379ProTyr: 1.379 ± 0.024
0.0ProXaa: 0.0 ± 0.0
Gln
4.006GlnAla: 4.006 ± 0.047
0.159GlnCys: 0.159 ± 0.007
1.431GlnAsp: 1.431 ± 0.024
1.333GlnGlu: 1.333 ± 0.025
0.897GlnPhe: 0.897 ± 0.018
2.108GlnGly: 2.108 ± 0.031
0.737GlnHis: 0.737 ± 0.017
1.363GlnIle: 1.363 ± 0.021
0.734GlnLys: 0.734 ± 0.018
3.825GlnLeu: 3.825 ± 0.039
0.524GlnMet: 0.524 ± 0.013
0.698GlnAsn: 0.698 ± 0.016
2.012GlnPro: 2.012 ± 0.031
1.521GlnGln: 1.521 ± 0.039
2.479GlnArg: 2.479 ± 0.034
1.634GlnSer: 1.634 ± 0.03
1.769GlnThr: 1.769 ± 0.025
2.909GlnVal: 2.909 ± 0.036
0.531GlnTrp: 0.531 ± 0.015
0.842GlnTyr: 0.842 ± 0.016
0.0GlnXaa: 0.0 ± 0.0
Arg
7.764ArgAla: 7.764 ± 0.06
0.456ArgCys: 0.456 ± 0.014
3.965ArgAsp: 3.965 ± 0.045
3.92ArgGlu: 3.92 ± 0.04
2.284ArgPhe: 2.284 ± 0.029
5.028ArgGly: 5.028 ± 0.043
1.651ArgHis: 1.651 ± 0.026
3.383ArgIle: 3.383 ± 0.034
1.696ArgLys: 1.696 ± 0.026
7.865ArgLeu: 7.865 ± 0.059
1.636ArgMet: 1.636 ± 0.024
1.543ArgAsn: 1.543 ± 0.025
4.013ArgPro: 4.013 ± 0.043
2.324ArgGln: 2.324 ± 0.028
6.837ArgArg: 6.837 ± 0.062
4.135ArgSer: 4.135 ± 0.043
4.839ArgThr: 4.839 ± 0.045
5.277ArgVal: 5.277 ± 0.048
1.381ArgTrp: 1.381 ± 0.022
1.811ArgTyr: 1.811 ± 0.025
0.0ArgXaa: 0.0 ± 0.0
Ser
6.449SerAla: 6.449 ± 0.055
0.365SerCys: 0.365 ± 0.011
2.947SerAsp: 2.947 ± 0.036
2.559SerGlu: 2.559 ± 0.034
1.728SerPhe: 1.728 ± 0.029
5.829SerGly: 5.829 ± 0.055
1.01SerHis: 1.01 ± 0.019
2.108SerIle: 2.108 ± 0.03
1.332SerLys: 1.332 ± 0.025
5.126SerLeu: 5.126 ± 0.043
1.241SerMet: 1.241 ± 0.018
1.269SerAsn: 1.269 ± 0.024
3.029SerPro: 3.029 ± 0.039
1.519SerGln: 1.519 ± 0.024
3.71SerArg: 3.71 ± 0.039
3.244SerSer: 3.244 ± 0.045
3.781SerThr: 3.781 ± 0.05
4.494SerVal: 4.494 ± 0.044
1.083SerTrp: 1.083 ± 0.02
1.491SerTyr: 1.491 ± 0.028
0.0SerXaa: 0.0 ± 0.0
Thr
8.364ThrAla: 8.364 ± 0.062
0.418ThrCys: 0.418 ± 0.013
3.724ThrAsp: 3.724 ± 0.041
3.127ThrGlu: 3.127 ± 0.033
2.0ThrPhe: 2.0 ± 0.032
6.149ThrGly: 6.149 ± 0.058
1.171ThrHis: 1.171 ± 0.02
2.597ThrIle: 2.597 ± 0.035
1.536ThrLys: 1.536 ± 0.027
5.868ThrLeu: 5.868 ± 0.051
1.09ThrMet: 1.09 ± 0.018
1.343ThrAsn: 1.343 ± 0.024
4.184ThrPro: 4.184 ± 0.049
1.698ThrGln: 1.698 ± 0.029
3.734ThrArg: 3.734 ± 0.037
3.634ThrSer: 3.634 ± 0.04
4.481ThrThr: 4.481 ± 0.057
6.013ThrVal: 6.013 ± 0.054
1.086ThrTrp: 1.086 ± 0.024
1.647ThrTyr: 1.647 ± 0.029
0.0ThrXaa: 0.0 ± 0.0
Val
10.953ValAla: 10.953 ± 0.08
0.615ValCys: 0.615 ± 0.015
5.606ValAsp: 5.606 ± 0.051
4.798ValGlu: 4.798 ± 0.045
2.544ValPhe: 2.544 ± 0.038
7.04ValGly: 7.04 ± 0.056
1.751ValHis: 1.751 ± 0.025
3.568ValIle: 3.568 ± 0.037
1.966ValLys: 1.966 ± 0.034
9.867ValLeu: 9.867 ± 0.073
1.474ValMet: 1.474 ± 0.027
1.87ValAsn: 1.87 ± 0.029
5.022ValPro: 5.022 ± 0.046
2.521ValGln: 2.521 ± 0.033
6.323ValArg: 6.323 ± 0.043
4.627ValSer: 4.627 ± 0.046
5.679ValThr: 5.679 ± 0.05
9.053ValVal: 9.053 ± 0.073
1.155ValTrp: 1.155 ± 0.023
1.618ValTyr: 1.618 ± 0.025
0.0ValXaa: 0.0 ± 0.0
Trp
1.616TrpAla: 1.616 ± 0.023
0.14TrpCys: 0.14 ± 0.006
0.88TrpAsp: 0.88 ± 0.02
0.688TrpGlu: 0.688 ± 0.017
0.619TrpPhe: 0.619 ± 0.016
1.083TrpGly: 1.083 ± 0.022
0.423TrpHis: 0.423 ± 0.013
0.744TrpIle: 0.744 ± 0.016
0.435TrpLys: 0.435 ± 0.013
1.96TrpLeu: 1.96 ± 0.03
0.342TrpMet: 0.342 ± 0.013
0.487TrpAsn: 0.487 ± 0.017
0.895TrpPro: 0.895 ± 0.019
0.704TrpGln: 0.704 ± 0.017
1.271TrpArg: 1.271 ± 0.025
1.185TrpSer: 1.185 ± 0.022
1.159TrpThr: 1.159 ± 0.02
1.13TrpVal: 1.13 ± 0.019
0.401TrpTrp: 0.401 ± 0.012
0.428TrpTyr: 0.428 ± 0.012
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.721TyrAla: 2.721 ± 0.031
0.195TyrCys: 0.195 ± 0.01
1.891TyrAsp: 1.891 ± 0.03
1.181TyrGlu: 1.181 ± 0.024
0.773TyrPhe: 0.773 ± 0.019
2.334TyrGly: 2.334 ± 0.031
0.442TyrHis: 0.442 ± 0.013
0.58TyrIle: 0.58 ± 0.016
0.508TyrLys: 0.508 ± 0.015
2.445TyrLeu: 2.445 ± 0.033
0.246TyrMet: 0.246 ± 0.009
0.531TyrAsn: 0.531 ± 0.017
1.219TyrPro: 1.219 ± 0.021
0.749TyrGln: 0.749 ± 0.017
1.863TyrArg: 1.863 ± 0.027
1.199TyrSer: 1.199 ± 0.019
1.268TyrThr: 1.268 ± 0.024
2.009TyrVal: 2.009 ± 0.03
0.413TyrTrp: 0.413 ± 0.013
0.577TyrTyr: 0.577 ± 0.013
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8271 proteins (2671487 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski