Amino acid dipepetide frequency for Sinomonas atrocyanea

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
24.895AlaAla: 24.895 ± 0.259
0.922AlaCys: 0.922 ± 0.027
7.615AlaAsp: 7.615 ± 0.086
9.227AlaGlu: 9.227 ± 0.122
4.05AlaPhe: 4.05 ± 0.062
14.278AlaGly: 14.278 ± 0.127
2.917AlaHis: 2.917 ± 0.052
4.778AlaIle: 4.778 ± 0.064
3.137AlaLys: 3.137 ± 0.062
14.628AlaLeu: 14.628 ± 0.138
2.736AlaMet: 2.736 ± 0.045
2.386AlaAsn: 2.386 ± 0.042
7.702AlaPro: 7.702 ± 0.106
4.389AlaGln: 4.389 ± 0.057
9.833AlaArg: 9.833 ± 0.109
7.703AlaSer: 7.703 ± 0.095
6.741AlaThr: 6.741 ± 0.097
12.655AlaVal: 12.655 ± 0.131
1.956AlaTrp: 1.956 ± 0.042
2.614AlaTyr: 2.614 ± 0.049
0.0AlaXaa: 0.0 ± 0.0
Cys
0.866CysAla: 0.866 ± 0.028
0.074CysCys: 0.074 ± 0.009
0.31CysAsp: 0.31 ± 0.014
0.304CysGlu: 0.304 ± 0.016
0.201CysPhe: 0.201 ± 0.012
0.784CysGly: 0.784 ± 0.025
0.15CysHis: 0.15 ± 0.012
0.199CysIle: 0.199 ± 0.014
0.094CysLys: 0.094 ± 0.008
0.592CysLeu: 0.592 ± 0.02
0.103CysMet: 0.103 ± 0.009
0.113CysAsn: 0.113 ± 0.009
0.4CysPro: 0.4 ± 0.019
0.164CysGln: 0.164 ± 0.011
0.483CysArg: 0.483 ± 0.022
0.404CysSer: 0.404 ± 0.017
0.436CysThr: 0.436 ± 0.019
0.448CysVal: 0.448 ± 0.019
0.097CysTrp: 0.097 ± 0.011
0.146CysTyr: 0.146 ± 0.011
0.0CysXaa: 0.0 ± 0.0
Asp
7.971AspAla: 7.971 ± 0.093
0.285AspCys: 0.285 ± 0.017
2.829AspAsp: 2.829 ± 0.049
3.463AspGlu: 3.463 ± 0.056
1.693AspPhe: 1.693 ± 0.039
5.753AspGly: 5.753 ± 0.084
1.202AspHis: 1.202 ± 0.038
1.925AspIle: 1.925 ± 0.046
0.94AspLys: 0.94 ± 0.036
5.676AspLeu: 5.676 ± 0.074
0.787AspMet: 0.787 ± 0.025
0.777AspAsn: 0.777 ± 0.026
4.039AspPro: 4.039 ± 0.054
1.456AspGln: 1.456 ± 0.032
3.886AspArg: 3.886 ± 0.064
2.45AspSer: 2.45 ± 0.051
2.53AspThr: 2.53 ± 0.044
4.627AspVal: 4.627 ± 0.072
0.839AspTrp: 0.839 ± 0.026
1.241AspTyr: 1.241 ± 0.029
0.0AspXaa: 0.0 ± 0.0
Glu
8.451GluAla: 8.451 ± 0.108
0.3GluCys: 0.3 ± 0.016
3.4GluAsp: 3.4 ± 0.056
3.553GluGlu: 3.553 ± 0.062
1.64GluPhe: 1.64 ± 0.038
4.563GluGly: 4.563 ± 0.067
1.62GluHis: 1.62 ± 0.036
2.31GluIle: 2.31 ± 0.046
1.462GluLys: 1.462 ± 0.037
6.441GluLeu: 6.441 ± 0.091
0.894GluMet: 0.894 ± 0.024
1.112GluAsn: 1.112 ± 0.032
3.082GluPro: 3.082 ± 0.056
2.118GluGln: 2.118 ± 0.038
4.966GluArg: 4.966 ± 0.071
2.567GluSer: 2.567 ± 0.049
2.685GluThr: 2.685 ± 0.049
4.389GluVal: 4.389 ± 0.063
0.778GluTrp: 0.778 ± 0.026
1.103GluTyr: 1.103 ± 0.031
0.0GluXaa: 0.0 ± 0.0
Phe
4.131PheAla: 4.131 ± 0.07
0.248PheCys: 0.248 ± 0.012
1.993PheAsp: 1.993 ± 0.045
1.692PheGlu: 1.692 ± 0.04
1.03PhePhe: 1.03 ± 0.032
3.358PheGly: 3.358 ± 0.055
0.625PheHis: 0.625 ± 0.022
1.073PheIle: 1.073 ± 0.033
0.55PheLys: 0.55 ± 0.023
2.885PheLeu: 2.885 ± 0.048
0.526PheMet: 0.526 ± 0.02
0.672PheAsn: 0.672 ± 0.026
1.519PhePro: 1.519 ± 0.034
0.742PheGln: 0.742 ± 0.031
1.797PheArg: 1.797 ± 0.037
1.751PheSer: 1.751 ± 0.044
2.0PheThr: 2.0 ± 0.044
2.499PheVal: 2.499 ± 0.044
0.446PheTrp: 0.446 ± 0.018
0.718PheTyr: 0.718 ± 0.025
0.0PheXaa: 0.0 ± 0.0
Gly
12.414GlyAla: 12.414 ± 0.125
0.68GlyCys: 0.68 ± 0.023
4.215GlyAsp: 4.215 ± 0.061
4.847GlyGlu: 4.847 ± 0.066
3.13GlyPhe: 3.13 ± 0.061
8.597GlyGly: 8.597 ± 0.089
2.246GlyHis: 2.246 ± 0.04
4.441GlyIle: 4.441 ± 0.062
2.327GlyLys: 2.327 ± 0.049
9.768GlyLeu: 9.768 ± 0.096
2.013GlyMet: 2.013 ± 0.04
1.813GlyAsn: 1.813 ± 0.042
5.031GlyPro: 5.031 ± 0.071
3.027GlyGln: 3.027 ± 0.05
7.362GlyArg: 7.362 ± 0.09
5.823GlySer: 5.823 ± 0.065
6.416GlyThr: 6.416 ± 0.08
7.427GlyVal: 7.427 ± 0.076
1.724GlyTrp: 1.724 ± 0.038
2.402GlyTyr: 2.402 ± 0.049
0.0GlyXaa: 0.0 ± 0.0
His
2.692HisAla: 2.692 ± 0.045
0.164HisCys: 0.164 ± 0.01
1.232HisAsp: 1.232 ± 0.032
1.297HisGlu: 1.297 ± 0.03
0.616HisPhe: 0.616 ± 0.021
2.352HisGly: 2.352 ± 0.04
0.597HisHis: 0.597 ± 0.025
0.658HisIle: 0.658 ± 0.021
0.307HisLys: 0.307 ± 0.015
2.213HisLeu: 2.213 ± 0.041
0.36HisMet: 0.36 ± 0.017
0.36HisAsn: 0.36 ± 0.019
1.593HisPro: 1.593 ± 0.034
0.556HisGln: 0.556 ± 0.023
1.814HisArg: 1.814 ± 0.039
1.092HisSer: 1.092 ± 0.031
1.102HisThr: 1.102 ± 0.031
1.674HisVal: 1.674 ± 0.037
0.351HisTrp: 0.351 ± 0.015
0.471HisTyr: 0.471 ± 0.018
0.0HisXaa: 0.0 ± 0.0
Ile
5.66IleAla: 5.66 ± 0.069
0.259IleCys: 0.259 ± 0.014
2.39IleAsp: 2.39 ± 0.049
2.325IleGlu: 2.325 ± 0.048
1.001IlePhe: 1.001 ± 0.033
4.044IleGly: 4.044 ± 0.063
0.692IleHis: 0.692 ± 0.022
1.404IleIle: 1.404 ± 0.039
0.786IleLys: 0.786 ± 0.03
3.456IleLeu: 3.456 ± 0.053
0.676IleMet: 0.676 ± 0.024
0.833IleAsn: 0.833 ± 0.027
2.075IlePro: 2.075 ± 0.039
0.95IleGln: 0.95 ± 0.027
2.434IleArg: 2.434 ± 0.045
2.048IleSer: 2.048 ± 0.035
2.249IleThr: 2.249 ± 0.053
3.623IleVal: 3.623 ± 0.051
0.426IleTrp: 0.426 ± 0.018
0.677IleTyr: 0.677 ± 0.026
0.0IleXaa: 0.0 ± 0.0
Lys
3.1LysAla: 3.1 ± 0.062
0.092LysCys: 0.092 ± 0.007
1.454LysAsp: 1.454 ± 0.037
1.143LysGlu: 1.143 ± 0.033
0.56LysPhe: 0.56 ± 0.023
1.857LysGly: 1.857 ± 0.04
0.424LysHis: 0.424 ± 0.018
0.953LysIle: 0.953 ± 0.031
0.773LysLys: 0.773 ± 0.029
1.892LysLeu: 1.892 ± 0.044
0.431LysMet: 0.431 ± 0.019
0.507LysAsn: 0.507 ± 0.022
1.234LysPro: 1.234 ± 0.035
0.658LysGln: 0.658 ± 0.024
1.336LysArg: 1.336 ± 0.034
1.122LysSer: 1.122 ± 0.035
1.243LysThr: 1.243 ± 0.032
1.984LysVal: 1.984 ± 0.046
0.22LysTrp: 0.22 ± 0.013
0.5LysTyr: 0.5 ± 0.019
0.0LysXaa: 0.0 ± 0.0
Leu
16.291LeuAla: 16.291 ± 0.141
0.636LeuCys: 0.636 ± 0.023
6.044LeuAsp: 6.044 ± 0.08
5.7LeuGlu: 5.7 ± 0.079
2.793LeuPhe: 2.793 ± 0.052
10.347LeuGly: 10.347 ± 0.107
2.008LeuHis: 2.008 ± 0.039
3.52LeuIle: 3.52 ± 0.056
2.101LeuLys: 2.101 ± 0.047
9.912LeuLeu: 9.912 ± 0.118
1.719LeuMet: 1.719 ± 0.041
1.872LeuAsn: 1.872 ± 0.044
5.784LeuPro: 5.784 ± 0.066
2.369LeuGln: 2.369 ± 0.042
7.439LeuArg: 7.439 ± 0.073
5.464LeuSer: 5.464 ± 0.07
5.791LeuThr: 5.791 ± 0.072
9.067LeuVal: 9.067 ± 0.103
1.29LeuTrp: 1.29 ± 0.033
1.68LeuTyr: 1.68 ± 0.038
0.0LeuXaa: 0.0 ± 0.0
Met
2.57MetAla: 2.57 ± 0.042
0.116MetCys: 0.116 ± 0.009
1.007MetAsp: 1.007 ± 0.029
0.788MetGlu: 0.788 ± 0.025
0.523MetPhe: 0.523 ± 0.023
1.588MetGly: 1.588 ± 0.036
0.324MetHis: 0.324 ± 0.017
0.692MetIle: 0.692 ± 0.026
0.48MetLys: 0.48 ± 0.019
1.758MetLeu: 1.758 ± 0.037
0.313MetMet: 0.313 ± 0.018
0.468MetAsn: 0.468 ± 0.019
1.085MetPro: 1.085 ± 0.029
0.48MetGln: 0.48 ± 0.018
1.177MetArg: 1.177 ± 0.029
1.431MetSer: 1.431 ± 0.035
1.435MetThr: 1.435 ± 0.031
1.449MetVal: 1.449 ± 0.038
0.207MetTrp: 0.207 ± 0.013
0.305MetTyr: 0.305 ± 0.016
0.0MetXaa: 0.0 ± 0.0
Asn
2.462AsnAla: 2.462 ± 0.047
0.135AsnCys: 0.135 ± 0.011
0.942AsnAsp: 0.942 ± 0.03
0.924AsnGlu: 0.924 ± 0.029
0.607AsnPhe: 0.607 ± 0.023
1.862AsnGly: 1.862 ± 0.046
0.416AsnHis: 0.416 ± 0.018
0.781AsnIle: 0.781 ± 0.025
0.387AsnLys: 0.387 ± 0.017
1.941AsnLeu: 1.941 ± 0.04
0.321AsnMet: 0.321 ± 0.016
0.459AsnAsn: 0.459 ± 0.02
1.493AsnPro: 1.493 ± 0.035
0.572AsnGln: 0.572 ± 0.023
1.215AsnArg: 1.215 ± 0.03
1.002AsnSer: 1.002 ± 0.03
1.056AsnThr: 1.056 ± 0.034
1.589AsnVal: 1.589 ± 0.036
0.307AsnTrp: 0.307 ± 0.016
0.483AsnTyr: 0.483 ± 0.019
0.0AsnXaa: 0.0 ± 0.0
Pro
9.043ProAla: 9.043 ± 0.113
0.261ProCys: 0.261 ± 0.016
3.637ProAsp: 3.637 ± 0.056
4.272ProGlu: 4.272 ± 0.05
1.674ProPhe: 1.674 ± 0.033
6.235ProGly: 6.235 ± 0.084
1.24ProHis: 1.24 ± 0.03
1.697ProIle: 1.697 ± 0.034
1.107ProLys: 1.107 ± 0.036
5.215ProLeu: 5.215 ± 0.066
0.956ProMet: 0.956 ± 0.028
0.984ProAsn: 0.984 ± 0.025
2.652ProPro: 2.652 ± 0.068
1.765ProGln: 1.765 ± 0.045
3.694ProArg: 3.694 ± 0.067
3.611ProSer: 3.611 ± 0.067
3.149ProThr: 3.149 ± 0.072
4.751ProVal: 4.751 ± 0.06
0.891ProTrp: 0.891 ± 0.029
1.109ProTyr: 1.109 ± 0.03
0.0ProXaa: 0.0 ± 0.0
Gln
3.889GlnAla: 3.889 ± 0.059
0.16GlnCys: 0.16 ± 0.013
1.437GlnAsp: 1.437 ± 0.036
1.455GlnGlu: 1.455 ± 0.036
0.873GlnPhe: 0.873 ± 0.024
2.425GlnGly: 2.425 ± 0.048
0.67GlnHis: 0.67 ± 0.023
1.394GlnIle: 1.394 ± 0.033
0.765GlnLys: 0.765 ± 0.029
3.196GlnLeu: 3.196 ± 0.056
0.573GlnMet: 0.573 ± 0.021
0.64GlnAsn: 0.64 ± 0.026
1.674GlnPro: 1.674 ± 0.042
1.226GlnGln: 1.226 ± 0.039
2.348GlnArg: 2.348 ± 0.044
1.437GlnSer: 1.437 ± 0.035
1.477GlnThr: 1.477 ± 0.036
2.155GlnVal: 2.155 ± 0.041
0.447GlnTrp: 0.447 ± 0.015
0.667GlnTyr: 0.667 ± 0.023
0.0GlnXaa: 0.0 ± 0.0
Arg
9.323ArgAla: 9.323 ± 0.103
0.453ArgCys: 0.453 ± 0.021
3.577ArgAsp: 3.577 ± 0.053
4.414ArgGlu: 4.414 ± 0.063
2.272ArgPhe: 2.272 ± 0.047
5.803ArgGly: 5.803 ± 0.068
1.738ArgHis: 1.738 ± 0.036
3.287ArgIle: 3.287 ± 0.051
1.439ArgLys: 1.439 ± 0.035
8.0ArgLeu: 8.0 ± 0.104
1.538ArgMet: 1.538 ± 0.036
1.281ArgAsn: 1.281 ± 0.034
4.311ArgPro: 4.311 ± 0.075
2.091ArgGln: 2.091 ± 0.039
7.079ArgArg: 7.079 ± 0.105
3.986ArgSer: 3.986 ± 0.067
4.232ArgThr: 4.232 ± 0.072
5.306ArgVal: 5.306 ± 0.063
1.219ArgTrp: 1.219 ± 0.036
1.521ArgTyr: 1.521 ± 0.042
0.0ArgXaa: 0.0 ± 0.0
Ser
7.724SerAla: 7.724 ± 0.094
0.34SerCys: 0.34 ± 0.016
2.501SerAsp: 2.501 ± 0.052
2.691SerGlu: 2.691 ± 0.047
1.806SerPhe: 1.806 ± 0.04
5.898SerGly: 5.898 ± 0.079
1.058SerHis: 1.058 ± 0.03
2.155SerIle: 2.155 ± 0.04
1.181SerLys: 1.181 ± 0.035
5.386SerLeu: 5.386 ± 0.069
1.15SerMet: 1.15 ± 0.027
1.013SerAsn: 1.013 ± 0.03
3.491SerPro: 3.491 ± 0.062
1.598SerGln: 1.598 ± 0.042
3.756SerArg: 3.756 ± 0.054
3.498SerSer: 3.498 ± 0.069
3.213SerThr: 3.213 ± 0.054
4.529SerVal: 4.529 ± 0.061
0.904SerTrp: 0.904 ± 0.026
1.172SerTyr: 1.172 ± 0.031
0.0SerXaa: 0.0 ± 0.0
Thr
8.205ThrAla: 8.205 ± 0.095
0.342ThrCys: 0.342 ± 0.016
2.955ThrAsp: 2.955 ± 0.054
2.879ThrGlu: 2.879 ± 0.044
1.751ThrPhe: 1.751 ± 0.04
5.76ThrGly: 5.76 ± 0.078
1.061ThrHis: 1.061 ± 0.028
2.07ThrIle: 2.07 ± 0.044
1.16ThrLys: 1.16 ± 0.03
5.381ThrLeu: 5.381 ± 0.075
0.967ThrMet: 0.967 ± 0.029
0.999ThrAsn: 0.999 ± 0.029
3.674ThrPro: 3.674 ± 0.051
1.435ThrGln: 1.435 ± 0.03
3.272ThrArg: 3.272 ± 0.048
3.178ThrSer: 3.178 ± 0.057
3.309ThrThr: 3.309 ± 0.081
5.701ThrVal: 5.701 ± 0.074
0.802ThrTrp: 0.802 ± 0.025
1.168ThrTyr: 1.168 ± 0.04
0.0ThrXaa: 0.0 ± 0.0
Val
11.211ValAla: 11.211 ± 0.1
0.622ValCys: 0.622 ± 0.024
4.883ValAsp: 4.883 ± 0.067
4.612ValGlu: 4.612 ± 0.075
2.722ValPhe: 2.722 ± 0.043
7.013ValGly: 7.013 ± 0.083
1.826ValHis: 1.826 ± 0.038
3.322ValIle: 3.322 ± 0.063
1.768ValLys: 1.768 ± 0.039
9.614ValLeu: 9.614 ± 0.107
1.472ValMet: 1.472 ± 0.036
1.81ValAsn: 1.81 ± 0.036
5.267ValPro: 5.267 ± 0.066
2.169ValGln: 2.169 ± 0.044
6.125ValArg: 6.125 ± 0.083
4.493ValSer: 4.493 ± 0.064
4.927ValThr: 4.927 ± 0.066
8.27ValVal: 8.27 ± 0.11
1.115ValTrp: 1.115 ± 0.028
1.571ValTyr: 1.571 ± 0.041
0.0ValXaa: 0.0 ± 0.0
Trp
1.763TrpAla: 1.763 ± 0.041
0.119TrpCys: 0.119 ± 0.009
0.81TrpAsp: 0.81 ± 0.026
0.707TrpGlu: 0.707 ± 0.022
0.543TrpPhe: 0.543 ± 0.02
1.165TrpGly: 1.165 ± 0.034
0.336TrpHis: 0.336 ± 0.016
0.679TrpIle: 0.679 ± 0.023
0.353TrpLys: 0.353 ± 0.017
1.721TrpLeu: 1.721 ± 0.041
0.328TrpMet: 0.328 ± 0.016
0.393TrpAsn: 0.393 ± 0.019
0.723TrpPro: 0.723 ± 0.025
0.515TrpGln: 0.515 ± 0.019
1.154TrpArg: 1.154 ± 0.029
0.817TrpSer: 0.817 ± 0.029
0.865TrpThr: 0.865 ± 0.029
1.106TrpVal: 1.106 ± 0.031
0.321TrpTrp: 0.321 ± 0.018
0.291TrpTyr: 0.291 ± 0.016
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.499TyrAla: 2.499 ± 0.051
0.161TyrCys: 0.161 ± 0.011
1.14TyrAsp: 1.14 ± 0.034
1.137TyrGlu: 1.137 ± 0.034
0.772TyrPhe: 0.772 ± 0.025
2.02TyrGly: 2.02 ± 0.035
0.351TyrHis: 0.351 ± 0.017
0.667TyrIle: 0.667 ± 0.022
0.366TyrLys: 0.366 ± 0.016
2.152TyrLeu: 2.152 ± 0.038
0.332TyrMet: 0.332 ± 0.017
0.459TyrAsn: 0.459 ± 0.02
1.116TyrPro: 1.116 ± 0.036
0.631TyrGln: 0.631 ± 0.026
1.701TyrArg: 1.701 ± 0.038
1.186TyrSer: 1.186 ± 0.028
1.215TyrThr: 1.215 ± 0.036
1.563TyrVal: 1.563 ± 0.031
0.37TyrTrp: 0.37 ± 0.017
0.529TyrTyr: 0.529 ± 0.023
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4104 proteins (1325928 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski