Amino acid dipepetide frequency for Pseudomonas guineae

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
13.144AlaAla: 13.144 ± 0.131
1.285AlaCys: 1.285 ± 0.033
5.978AlaAsp: 5.978 ± 0.078
7.749AlaGlu: 7.749 ± 0.095
3.863AlaPhe: 3.863 ± 0.057
8.947AlaGly: 8.947 ± 0.092
2.249AlaHis: 2.249 ± 0.047
5.332AlaIle: 5.332 ± 0.083
4.06AlaLys: 4.06 ± 0.072
14.164AlaLeu: 14.164 ± 0.142
2.917AlaMet: 2.917 ± 0.056
3.085AlaAsn: 3.085 ± 0.052
4.436AlaPro: 4.436 ± 0.07
5.582AlaGln: 5.582 ± 0.082
6.557AlaArg: 6.557 ± 0.091
6.143AlaSer: 6.143 ± 0.09
4.6AlaThr: 4.6 ± 0.071
7.572AlaVal: 7.572 ± 0.093
1.572AlaTrp: 1.572 ± 0.042
2.529AlaTyr: 2.529 ± 0.048
0.0AlaXaa: 0.0 ± 0.0
Cys
1.196CysAla: 1.196 ± 0.033
0.144CysCys: 0.144 ± 0.011
0.553CysAsp: 0.553 ± 0.023
0.573CysGlu: 0.573 ± 0.021
0.339CysPhe: 0.339 ± 0.017
0.986CysGly: 0.986 ± 0.034
0.288CysHis: 0.288 ± 0.017
0.478CysIle: 0.478 ± 0.019
0.322CysLys: 0.322 ± 0.016
1.135CysLeu: 1.135 ± 0.028
0.228CysMet: 0.228 ± 0.013
0.292CysAsn: 0.292 ± 0.014
0.499CysPro: 0.499 ± 0.023
0.436CysGln: 0.436 ± 0.021
0.608CysArg: 0.608 ± 0.026
0.645CysSer: 0.645 ± 0.026
0.474CysThr: 0.474 ± 0.02
0.671CysVal: 0.671 ± 0.024
0.166CysTrp: 0.166 ± 0.013
0.279CysTyr: 0.279 ± 0.016
0.0CysXaa: 0.0 ± 0.0
Asp
5.705AspAla: 5.705 ± 0.069
0.557AspCys: 0.557 ± 0.022
2.804AspAsp: 2.804 ± 0.066
3.522AspGlu: 3.522 ± 0.062
2.071AspPhe: 2.071 ± 0.046
4.292AspGly: 4.292 ± 0.072
1.041AspHis: 1.041 ± 0.032
2.703AspIle: 2.703 ± 0.053
2.003AspLys: 2.003 ± 0.042
5.76AspLeu: 5.76 ± 0.075
1.252AspMet: 1.252 ± 0.029
1.578AspAsn: 1.578 ± 0.039
2.571AspPro: 2.571 ± 0.049
2.148AspGln: 2.148 ± 0.043
2.713AspArg: 2.713 ± 0.048
3.143AspSer: 3.143 ± 0.056
2.22AspThr: 2.22 ± 0.047
3.491AspVal: 3.491 ± 0.053
0.953AspTrp: 0.953 ± 0.031
1.727AspTyr: 1.727 ± 0.041
0.0AspXaa: 0.0 ± 0.0
Glu
6.531GluAla: 6.531 ± 0.093
0.458GluCys: 0.458 ± 0.019
2.523GluAsp: 2.523 ± 0.045
3.099GluGlu: 3.099 ± 0.058
1.847GluPhe: 1.847 ± 0.045
3.902GluGly: 3.902 ± 0.06
1.636GluHis: 1.636 ± 0.038
2.82GluIle: 2.82 ± 0.056
2.117GluLys: 2.117 ± 0.045
7.657GluLeu: 7.657 ± 0.09
1.398GluMet: 1.398 ± 0.038
1.516GluAsn: 1.516 ± 0.04
2.411GluPro: 2.411 ± 0.044
4.355GluGln: 4.355 ± 0.072
4.836GluArg: 4.836 ± 0.067
2.703GluSer: 2.703 ± 0.05
2.441GluThr: 2.441 ± 0.046
4.374GluVal: 4.374 ± 0.056
0.706GluTrp: 0.706 ± 0.024
1.215GluTyr: 1.215 ± 0.032
0.0GluXaa: 0.0 ± 0.0
Phe
4.243PheAla: 4.243 ± 0.065
0.44PheCys: 0.44 ± 0.021
2.374PheAsp: 2.374 ± 0.048
2.053PheGlu: 2.053 ± 0.046
1.417PhePhe: 1.417 ± 0.04
3.108PheGly: 3.108 ± 0.056
0.714PheHis: 0.714 ± 0.025
1.972PheIle: 1.972 ± 0.047
1.328PheLys: 1.328 ± 0.029
3.273PheLeu: 3.273 ± 0.056
0.833PheMet: 0.833 ± 0.031
1.437PheAsn: 1.437 ± 0.036
1.36PhePro: 1.36 ± 0.032
1.225PheGln: 1.225 ± 0.031
1.757PheArg: 1.757 ± 0.038
2.696PheSer: 2.696 ± 0.05
1.827PheThr: 1.827 ± 0.041
2.454PheVal: 2.454 ± 0.051
0.551PheTrp: 0.551 ± 0.025
1.046PheTyr: 1.046 ± 0.029
0.0PheXaa: 0.0 ± 0.0
Gly
7.128GlyAla: 7.128 ± 0.091
0.983GlyCys: 0.983 ± 0.03
3.862GlyAsp: 3.862 ± 0.064
4.823GlyGlu: 4.823 ± 0.068
3.251GlyPhe: 3.251 ± 0.064
5.862GlyGly: 5.862 ± 0.098
1.817GlyHis: 1.817 ± 0.043
4.215GlyIle: 4.215 ± 0.068
3.415GlyLys: 3.415 ± 0.062
9.409GlyLeu: 9.409 ± 0.114
2.3GlyMet: 2.3 ± 0.046
2.418GlyAsn: 2.418 ± 0.057
2.416GlyPro: 2.416 ± 0.048
3.717GlyGln: 3.717 ± 0.054
4.566GlyArg: 4.566 ± 0.066
4.623GlySer: 4.623 ± 0.066
3.49GlyThr: 3.49 ± 0.065
5.832GlyVal: 5.832 ± 0.079
1.283GlyTrp: 1.283 ± 0.041
2.394GlyTyr: 2.394 ± 0.044
0.0GlyXaa: 0.0 ± 0.0
His
2.359HisAla: 2.359 ± 0.043
0.356HisCys: 0.356 ± 0.018
1.14HisAsp: 1.14 ± 0.034
1.165HisGlu: 1.165 ± 0.036
1.001HisPhe: 1.001 ± 0.026
1.857HisGly: 1.857 ± 0.041
0.575HisHis: 0.575 ± 0.024
1.049HisIle: 1.049 ± 0.03
0.714HisLys: 0.714 ± 0.022
2.659HisLeu: 2.659 ± 0.048
0.518HisMet: 0.518 ± 0.019
0.734HisAsn: 0.734 ± 0.027
1.346HisPro: 1.346 ± 0.036
0.995HisGln: 0.995 ± 0.03
1.238HisArg: 1.238 ± 0.03
1.328HisSer: 1.328 ± 0.037
0.987HisThr: 0.987 ± 0.032
1.314HisVal: 1.314 ± 0.034
0.404HisTrp: 0.404 ± 0.019
0.74HisTyr: 0.74 ± 0.028
0.0HisXaa: 0.0 ± 0.0
Ile
5.801IleAla: 5.801 ± 0.076
0.517IleCys: 0.517 ± 0.022
3.284IleAsp: 3.284 ± 0.056
3.455IleGlu: 3.455 ± 0.053
1.523IlePhe: 1.523 ± 0.038
4.508IleGly: 4.508 ± 0.06
0.937IleHis: 0.937 ± 0.027
2.284IleIle: 2.284 ± 0.056
1.912IleLys: 1.912 ± 0.048
4.207IleLeu: 4.207 ± 0.067
0.909IleMet: 0.909 ± 0.03
1.884IleAsn: 1.884 ± 0.046
2.237IlePro: 2.237 ± 0.048
1.744IleGln: 1.744 ± 0.039
2.858IleArg: 2.858 ± 0.05
3.241IleSer: 3.241 ± 0.057
2.565IleThr: 2.565 ± 0.05
3.019IleVal: 3.019 ± 0.054
0.513IleTrp: 0.513 ± 0.022
1.169IleTyr: 1.169 ± 0.031
0.0IleXaa: 0.0 ± 0.0
Lys
4.128LysAla: 4.128 ± 0.067
0.215LysCys: 0.215 ± 0.014
1.727LysAsp: 1.727 ± 0.04
1.696LysGlu: 1.696 ± 0.042
0.887LysPhe: 0.887 ± 0.029
2.552LysGly: 2.552 ± 0.049
0.809LysHis: 0.809 ± 0.027
1.697LysIle: 1.697 ± 0.043
1.364LysLys: 1.364 ± 0.046
3.987LysLeu: 3.987 ± 0.057
0.741LysMet: 0.741 ± 0.029
1.057LysAsn: 1.057 ± 0.031
2.183LysPro: 2.183 ± 0.051
1.839LysGln: 1.839 ± 0.038
2.514LysArg: 2.514 ± 0.048
1.865LysSer: 1.865 ± 0.043
1.918LysThr: 1.918 ± 0.038
2.837LysVal: 2.837 ± 0.054
0.354LysTrp: 0.354 ± 0.017
0.691LysTyr: 0.691 ± 0.024
0.0LysXaa: 0.0 ± 0.0
Leu
14.723LeuAla: 14.723 ± 0.141
1.295LeuCys: 1.295 ± 0.037
6.713LeuAsp: 6.713 ± 0.079
6.584LeuGlu: 6.584 ± 0.083
4.306LeuPhe: 4.306 ± 0.082
9.358LeuGly: 9.358 ± 0.107
2.622LeuHis: 2.622 ± 0.049
5.782LeuIle: 5.782 ± 0.082
4.551LeuLys: 4.551 ± 0.063
15.229LeuLeu: 15.229 ± 0.214
2.626LeuMet: 2.626 ± 0.053
4.001LeuAsn: 4.001 ± 0.06
6.435LeuPro: 6.435 ± 0.088
5.407LeuGln: 5.407 ± 0.088
7.349LeuArg: 7.349 ± 0.092
7.782LeuSer: 7.782 ± 0.104
5.74LeuThr: 5.74 ± 0.073
7.791LeuVal: 7.791 ± 0.08
1.456LeuTrp: 1.456 ± 0.037
2.576LeuTyr: 2.576 ± 0.045
0.0LeuXaa: 0.0 ± 0.0
Met
2.743MetAla: 2.743 ± 0.05
0.167MetCys: 0.167 ± 0.011
1.01MetAsp: 1.01 ± 0.032
0.939MetGlu: 0.939 ± 0.025
0.678MetPhe: 0.678 ± 0.021
1.76MetGly: 1.76 ± 0.045
0.595MetHis: 0.595 ± 0.02
1.117MetIle: 1.117 ± 0.032
0.817MetLys: 0.817 ± 0.025
2.929MetLeu: 2.929 ± 0.058
0.474MetMet: 0.474 ± 0.022
0.83MetAsn: 0.83 ± 0.024
1.426MetPro: 1.426 ± 0.038
1.217MetGln: 1.217 ± 0.033
1.553MetArg: 1.553 ± 0.032
1.727MetSer: 1.727 ± 0.034
1.385MetThr: 1.385 ± 0.035
1.553MetVal: 1.553 ± 0.032
0.191MetTrp: 0.191 ± 0.011
0.354MetTyr: 0.354 ± 0.018
0.0MetXaa: 0.0 ± 0.0
Asn
3.387AsnAla: 3.387 ± 0.05
0.328AsnCys: 0.328 ± 0.018
1.562AsnAsp: 1.562 ± 0.04
1.492AsnGlu: 1.492 ± 0.037
1.057AsnPhe: 1.057 ± 0.03
2.474AsnGly: 2.474 ± 0.052
0.629AsnHis: 0.629 ± 0.022
1.47AsnIle: 1.47 ± 0.036
1.03AsnLys: 1.03 ± 0.032
3.659AsnLeu: 3.659 ± 0.055
0.647AsnMet: 0.647 ± 0.022
1.012AsnAsn: 1.012 ± 0.032
1.976AsnPro: 1.976 ± 0.045
1.486AsnGln: 1.486 ± 0.039
1.877AsnArg: 1.877 ± 0.035
1.8AsnSer: 1.8 ± 0.043
1.52AsnThr: 1.52 ± 0.038
1.933AsnVal: 1.933 ± 0.051
0.494AsnTrp: 0.494 ± 0.017
0.774AsnTyr: 0.774 ± 0.028
0.0AsnXaa: 0.0 ± 0.0
Pro
5.521ProAla: 5.521 ± 0.076
0.39ProCys: 0.39 ± 0.019
2.524ProAsp: 2.524 ± 0.047
3.167ProGlu: 3.167 ± 0.052
1.748ProPhe: 1.748 ± 0.039
3.628ProGly: 3.628 ± 0.056
1.004ProHis: 1.004 ± 0.033
2.031ProIle: 2.031 ± 0.041
1.555ProLys: 1.555 ± 0.046
5.856ProLeu: 5.856 ± 0.079
1.171ProMet: 1.171 ± 0.03
1.422ProAsn: 1.422 ± 0.036
1.724ProPro: 1.724 ± 0.038
2.324ProGln: 2.324 ± 0.044
2.429ProArg: 2.429 ± 0.048
2.666ProSer: 2.666 ± 0.043
2.099ProThr: 2.099 ± 0.046
3.56ProVal: 3.56 ± 0.058
0.728ProTrp: 0.728 ± 0.025
1.139ProTyr: 1.139 ± 0.032
0.0ProXaa: 0.0 ± 0.0
Gln
6.452GlnAla: 6.452 ± 0.093
0.355GlnCys: 0.355 ± 0.019
1.899GlnAsp: 1.899 ± 0.038
2.001GlnGlu: 2.001 ± 0.047
1.505GlnPhe: 1.505 ± 0.043
3.631GlnGly: 3.631 ± 0.058
1.358GlnHis: 1.358 ± 0.035
2.15GlnIle: 2.15 ± 0.049
1.217GlnLys: 1.217 ± 0.032
6.522GlnLeu: 6.522 ± 0.109
1.083GlnMet: 1.083 ± 0.03
1.104GlnAsn: 1.104 ± 0.03
2.69GlnPro: 2.69 ± 0.056
3.312GlnGln: 3.312 ± 0.083
4.034GlnArg: 4.034 ± 0.082
2.471GlnSer: 2.471 ± 0.046
1.922GlnThr: 1.922 ± 0.042
3.787GlnVal: 3.787 ± 0.061
0.684GlnTrp: 0.684 ± 0.024
0.96GlnTyr: 0.96 ± 0.03
0.0GlnXaa: 0.0 ± 0.0
Arg
5.866ArgAla: 5.866 ± 0.075
0.598ArgCys: 0.598 ± 0.022
3.406ArgAsp: 3.406 ± 0.055
4.394ArgGlu: 4.394 ± 0.062
2.62ArgPhe: 2.62 ± 0.052
3.949ArgGly: 3.949 ± 0.055
1.559ArgHis: 1.559 ± 0.037
3.31ArgIle: 3.31 ± 0.06
2.015ArgLys: 2.015 ± 0.047
8.223ArgLeu: 8.223 ± 0.11
1.572ArgMet: 1.572 ± 0.028
1.971ArgAsn: 1.971 ± 0.039
2.434ArgPro: 2.434 ± 0.046
3.295ArgGln: 3.295 ± 0.057
3.846ArgArg: 3.846 ± 0.06
3.426ArgSer: 3.426 ± 0.046
2.512ArgThr: 2.512 ± 0.05
4.253ArgVal: 4.253 ± 0.065
0.981ArgTrp: 0.981 ± 0.032
2.002ArgTyr: 2.002 ± 0.04
0.0ArgXaa: 0.0 ± 0.0
Ser
6.519SerAla: 6.519 ± 0.086
0.551SerCys: 0.551 ± 0.025
2.903SerAsp: 2.903 ± 0.047
3.349SerGlu: 3.349 ± 0.054
2.251SerPhe: 2.251 ± 0.048
5.178SerGly: 5.178 ± 0.078
1.323SerHis: 1.323 ± 0.037
2.779SerIle: 2.779 ± 0.049
1.943SerLys: 1.943 ± 0.047
7.426SerLeu: 7.426 ± 0.099
1.415SerMet: 1.415 ± 0.035
1.949SerAsn: 1.949 ± 0.043
2.573SerPro: 2.573 ± 0.045
2.675SerGln: 2.675 ± 0.052
3.533SerArg: 3.533 ± 0.049
3.644SerSer: 3.644 ± 0.066
2.76SerThr: 2.76 ± 0.059
3.903SerVal: 3.903 ± 0.061
0.883SerTrp: 0.883 ± 0.027
1.425SerTyr: 1.425 ± 0.036
0.0SerXaa: 0.0 ± 0.0
Thr
5.124ThrAla: 5.124 ± 0.07
0.479ThrCys: 0.479 ± 0.022
2.337ThrAsp: 2.337 ± 0.048
2.376ThrGlu: 2.376 ± 0.045
1.707ThrPhe: 1.707 ± 0.039
3.868ThrGly: 3.868 ± 0.07
0.992ThrHis: 0.992 ± 0.028
1.913ThrIle: 1.913 ± 0.044
1.1ThrLys: 1.1 ± 0.034
6.485ThrLeu: 6.485 ± 0.082
0.802ThrMet: 0.802 ± 0.029
1.139ThrAsn: 1.139 ± 0.035
2.9ThrPro: 2.9 ± 0.053
2.015ThrGln: 2.015 ± 0.045
2.835ThrArg: 2.835 ± 0.05
2.497ThrSer: 2.497 ± 0.06
2.186ThrThr: 2.186 ± 0.043
3.151ThrVal: 3.151 ± 0.057
0.645ThrTrp: 0.645 ± 0.023
1.175ThrTyr: 1.175 ± 0.036
0.0ThrXaa: 0.0 ± 0.0
Val
7.451ValAla: 7.451 ± 0.11
0.716ValCys: 0.716 ± 0.025
3.751ValAsp: 3.751 ± 0.056
4.459ValGlu: 4.459 ± 0.069
2.543ValPhe: 2.543 ± 0.047
5.051ValGly: 5.051 ± 0.061
1.38ValHis: 1.38 ± 0.039
3.863ValIle: 3.863 ± 0.058
2.384ValLys: 2.384 ± 0.054
8.338ValLeu: 8.338 ± 0.083
1.766ValMet: 1.766 ± 0.041
2.078ValAsn: 2.078 ± 0.048
3.08ValPro: 3.08 ± 0.053
2.875ValGln: 2.875 ± 0.057
4.21ValArg: 4.21 ± 0.061
4.219ValSer: 4.219 ± 0.063
3.421ValThr: 3.421 ± 0.066
5.196ValVal: 5.196 ± 0.071
0.802ValTrp: 0.802 ± 0.025
1.545ValTyr: 1.545 ± 0.041
0.0ValXaa: 0.0 ± 0.0
Trp
1.205TrpAla: 1.205 ± 0.036
0.162TrpCys: 0.162 ± 0.012
0.608TrpAsp: 0.608 ± 0.026
0.538TrpGlu: 0.538 ± 0.023
0.506TrpPhe: 0.506 ± 0.02
0.884TrpGly: 0.884 ± 0.028
0.381TrpHis: 0.381 ± 0.02
0.611TrpIle: 0.611 ± 0.024
0.382TrpLys: 0.382 ± 0.019
2.349TrpLeu: 2.349 ± 0.06
0.354TrpMet: 0.354 ± 0.018
0.393TrpAsn: 0.393 ± 0.019
0.693TrpPro: 0.693 ± 0.024
1.06TrpGln: 1.06 ± 0.033
1.062TrpArg: 1.062 ± 0.032
0.761TrpSer: 0.761 ± 0.022
0.57TrpThr: 0.57 ± 0.019
0.931TrpVal: 0.931 ± 0.031
0.227TrpTrp: 0.227 ± 0.014
0.343TrpTyr: 0.343 ± 0.017
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.535TyrAla: 2.535 ± 0.046
0.314TyrCys: 0.314 ± 0.015
1.297TyrAsp: 1.297 ± 0.035
1.111TyrGlu: 1.111 ± 0.034
0.989TyrPhe: 0.989 ± 0.03
2.009TyrGly: 2.009 ± 0.038
0.57TyrHis: 0.57 ± 0.021
0.989TyrIle: 0.989 ± 0.029
0.83TyrLys: 0.83 ± 0.028
3.057TyrLeu: 3.057 ± 0.057
0.457TyrMet: 0.457 ± 0.018
0.71TyrAsn: 0.71 ± 0.026
1.298TyrPro: 1.298 ± 0.034
1.431TyrGln: 1.431 ± 0.034
1.861TyrArg: 1.861 ± 0.039
1.601TyrSer: 1.601 ± 0.038
1.09TyrThr: 1.09 ± 0.029
1.507TyrVal: 1.507 ± 0.036
0.43TyrTrp: 0.43 ± 0.019
0.705TyrTyr: 0.705 ± 0.027
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3794 proteins (1233122 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski