Amino acid dipepetide frequency for Methylomonas sp. LWB

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.751AlaAla: 12.751 ± 0.139
1.202AlaCys: 1.202 ± 0.031
7.018AlaAsp: 7.018 ± 0.078
7.635AlaGlu: 7.635 ± 0.089
3.676AlaPhe: 3.676 ± 0.051
9.167AlaGly: 9.167 ± 0.118
1.948AlaHis: 1.948 ± 0.039
5.795AlaIle: 5.795 ± 0.065
4.533AlaLys: 4.533 ± 0.063
12.087AlaLeu: 12.087 ± 0.103
2.687AlaMet: 2.687 ± 0.05
3.775AlaAsn: 3.775 ± 0.056
3.936AlaPro: 3.936 ± 0.056
3.877AlaGln: 3.877 ± 0.058
5.472AlaArg: 5.472 ± 0.069
5.722AlaSer: 5.722 ± 0.08
4.897AlaThr: 4.897 ± 0.08
7.401AlaVal: 7.401 ± 0.076
1.607AlaTrp: 1.607 ± 0.035
2.885AlaTyr: 2.885 ± 0.049
0.0AlaXaa: 0.0 ± 0.0
Cys
0.927CysAla: 0.927 ± 0.025
0.155CysCys: 0.155 ± 0.011
0.57CysAsp: 0.57 ± 0.02
0.521CysGlu: 0.521 ± 0.019
0.394CysPhe: 0.394 ± 0.018
0.971CysGly: 0.971 ± 0.026
0.331CysHis: 0.331 ± 0.018
0.456CysIle: 0.456 ± 0.018
0.338CysLys: 0.338 ± 0.017
1.161CysLeu: 1.161 ± 0.032
0.18CysMet: 0.18 ± 0.011
0.311CysAsn: 0.311 ± 0.014
0.53CysPro: 0.53 ± 0.022
0.387CysGln: 0.387 ± 0.014
0.782CysArg: 0.782 ± 0.022
0.599CysSer: 0.599 ± 0.022
0.393CysThr: 0.393 ± 0.016
0.639CysVal: 0.639 ± 0.021
0.161CysTrp: 0.161 ± 0.01
0.335CysTyr: 0.335 ± 0.015
0.0CysXaa: 0.0 ± 0.0
Asp
5.931AspAla: 5.931 ± 0.069
0.637AspCys: 0.637 ± 0.024
3.27AspAsp: 3.27 ± 0.056
3.21AspGlu: 3.21 ± 0.054
2.587AspPhe: 2.587 ± 0.043
4.72AspGly: 4.72 ± 0.073
1.16AspHis: 1.16 ± 0.027
3.588AspIle: 3.588 ± 0.052
2.463AspLys: 2.463 ± 0.044
5.974AspLeu: 5.974 ± 0.071
1.167AspMet: 1.167 ± 0.028
1.941AspAsn: 1.941 ± 0.043
2.677AspPro: 2.677 ± 0.052
2.256AspGln: 2.256 ± 0.041
3.373AspArg: 3.373 ± 0.053
3.381AspSer: 3.381 ± 0.052
2.744AspThr: 2.744 ± 0.052
3.642AspVal: 3.642 ± 0.057
1.122AspTrp: 1.122 ± 0.031
2.095AspTyr: 2.095 ± 0.041
0.0AspXaa: 0.0 ± 0.0
Glu
5.881GluAla: 5.881 ± 0.076
0.478GluCys: 0.478 ± 0.017
2.466GluAsp: 2.466 ± 0.04
2.595GluGlu: 2.595 ± 0.056
2.32GluPhe: 2.32 ± 0.038
2.901GluGly: 2.901 ± 0.051
1.549GluHis: 1.549 ± 0.03
3.593GluIle: 3.593 ± 0.056
2.733GluLys: 2.733 ± 0.058
6.737GluLeu: 6.737 ± 0.079
1.361GluMet: 1.361 ± 0.032
2.062GluAsn: 2.062 ± 0.043
2.307GluPro: 2.307 ± 0.044
3.445GluGln: 3.445 ± 0.06
4.006GluArg: 4.006 ± 0.066
2.954GluSer: 2.954 ± 0.046
3.246GluThr: 3.246 ± 0.047
3.642GluVal: 3.642 ± 0.053
0.73GluTrp: 0.73 ± 0.025
1.543GluTyr: 1.543 ± 0.037
0.0GluXaa: 0.0 ± 0.0
Phe
3.926PheAla: 3.926 ± 0.053
0.509PheCys: 0.509 ± 0.018
2.741PheAsp: 2.741 ± 0.046
2.332PheGlu: 2.332 ± 0.04
1.628PhePhe: 1.628 ± 0.039
3.434PheGly: 3.434 ± 0.05
0.772PheHis: 0.772 ± 0.023
1.93PheIle: 1.93 ± 0.039
1.619PheLys: 1.619 ± 0.037
3.538PheLeu: 3.538 ± 0.057
0.779PheMet: 0.779 ± 0.025
1.695PheAsn: 1.695 ± 0.039
1.598PhePro: 1.598 ± 0.037
1.384PheGln: 1.384 ± 0.028
2.122PheArg: 2.122 ± 0.038
2.877PheSer: 2.877 ± 0.048
1.876PheThr: 1.876 ± 0.035
2.72PheVal: 2.72 ± 0.047
0.598PheTrp: 0.598 ± 0.021
1.269PheTyr: 1.269 ± 0.032
0.0PheXaa: 0.0 ± 0.0
Gly
6.742GlyAla: 6.742 ± 0.104
0.942GlyCys: 0.942 ± 0.029
4.468GlyAsp: 4.468 ± 0.075
4.399GlyGlu: 4.399 ± 0.057
3.48GlyPhe: 3.48 ± 0.055
6.282GlyGly: 6.282 ± 0.163
1.652GlyHis: 1.652 ± 0.034
4.349GlyIle: 4.349 ± 0.058
3.731GlyLys: 3.731 ± 0.053
8.322GlyLeu: 8.322 ± 0.074
1.843GlyMet: 1.843 ± 0.043
2.897GlyAsn: 2.897 ± 0.087
2.107GlyPro: 2.107 ± 0.041
3.14GlyGln: 3.14 ± 0.048
4.55GlyArg: 4.55 ± 0.063
4.479GlySer: 4.479 ± 0.099
3.563GlyThr: 3.563 ± 0.107
5.421GlyVal: 5.421 ± 0.067
1.218GlyTrp: 1.218 ± 0.036
2.641GlyTyr: 2.641 ± 0.044
0.001GlyXaa: 0.001 ± 0.001
His
2.161HisAla: 2.161 ± 0.036
0.348HisCys: 0.348 ± 0.016
1.22HisAsp: 1.22 ± 0.033
1.051HisGlu: 1.051 ± 0.03
1.05HisPhe: 1.05 ± 0.029
1.826HisGly: 1.826 ± 0.034
0.649HisHis: 0.649 ± 0.023
1.145HisIle: 1.145 ± 0.033
0.752HisLys: 0.752 ± 0.021
2.323HisLeu: 2.323 ± 0.043
0.432HisMet: 0.432 ± 0.018
0.652HisAsn: 0.652 ± 0.022
1.335HisPro: 1.335 ± 0.031
0.976HisGln: 0.976 ± 0.029
1.409HisArg: 1.409 ± 0.033
1.189HisSer: 1.189 ± 0.031
0.932HisThr: 0.932 ± 0.027
1.254HisVal: 1.254 ± 0.028
0.433HisTrp: 0.433 ± 0.017
0.815HisTyr: 0.815 ± 0.025
0.0HisXaa: 0.0 ± 0.0
Ile
6.6IleAla: 6.6 ± 0.076
0.535IleCys: 0.535 ± 0.022
3.941IleAsp: 3.941 ± 0.057
3.672IleGlu: 3.672 ± 0.059
1.751IlePhe: 1.751 ± 0.034
4.612IleGly: 4.612 ± 0.054
1.086IleHis: 1.086 ± 0.029
2.245IleIle: 2.245 ± 0.052
2.436IleLys: 2.436 ± 0.04
4.783IleLeu: 4.783 ± 0.072
0.855IleMet: 0.855 ± 0.025
2.197IleAsn: 2.197 ± 0.042
2.504IlePro: 2.504 ± 0.047
1.859IleGln: 1.859 ± 0.037
3.146IleArg: 3.146 ± 0.045
3.309IleSer: 3.309 ± 0.052
2.504IleThr: 2.504 ± 0.052
3.97IleVal: 3.97 ± 0.061
0.614IleTrp: 0.614 ± 0.024
1.449IleTyr: 1.449 ± 0.032
0.0IleXaa: 0.0 ± 0.0
Lys
4.168LysAla: 4.168 ± 0.07
0.274LysCys: 0.274 ± 0.015
1.993LysAsp: 1.993 ± 0.042
1.848LysGlu: 1.848 ± 0.038
1.243LysPhe: 1.243 ± 0.03
2.344LysGly: 2.344 ± 0.042
1.027LysHis: 1.027 ± 0.028
2.6LysIle: 2.6 ± 0.048
1.86LysLys: 1.86 ± 0.047
4.765LysLeu: 4.765 ± 0.067
0.905LysMet: 0.905 ± 0.028
1.584LysAsn: 1.584 ± 0.038
2.397LysPro: 2.397 ± 0.046
2.319LysGln: 2.319 ± 0.048
2.372LysArg: 2.372 ± 0.039
2.361LysSer: 2.361 ± 0.044
2.61LysThr: 2.61 ± 0.043
2.631LysVal: 2.631 ± 0.052
0.412LysTrp: 0.412 ± 0.019
1.028LysTyr: 1.028 ± 0.027
0.0LysXaa: 0.0 ± 0.0
Leu
13.416LeuAla: 13.416 ± 0.136
1.005LeuCys: 1.005 ± 0.023
6.277LeuAsp: 6.277 ± 0.062
5.875LeuGlu: 5.875 ± 0.084
4.174LeuPhe: 4.174 ± 0.056
7.626LeuGly: 7.626 ± 0.068
2.238LeuHis: 2.238 ± 0.044
5.721LeuIle: 5.721 ± 0.07
4.577LeuLys: 4.577 ± 0.063
11.976LeuLeu: 11.976 ± 0.145
2.162LeuMet: 2.162 ± 0.045
4.251LeuAsn: 4.251 ± 0.058
5.735LeuPro: 5.735 ± 0.069
4.538LeuGln: 4.538 ± 0.064
6.432LeuArg: 6.432 ± 0.078
7.242LeuSer: 7.242 ± 0.071
6.274LeuThr: 6.274 ± 0.082
6.548LeuVal: 6.548 ± 0.075
1.191LeuTrp: 1.191 ± 0.03
2.649LeuTyr: 2.649 ± 0.048
0.0LeuXaa: 0.0 ± 0.0
Met
2.436MetAla: 2.436 ± 0.049
0.124MetCys: 0.124 ± 0.01
1.038MetAsp: 1.038 ± 0.025
1.008MetGlu: 1.008 ± 0.029
0.651MetPhe: 0.651 ± 0.024
1.314MetGly: 1.314 ± 0.037
0.426MetHis: 0.426 ± 0.018
1.06MetIle: 1.06 ± 0.032
0.991MetLys: 0.991 ± 0.031
2.496MetLeu: 2.496 ± 0.047
0.501MetMet: 0.501 ± 0.021
0.934MetAsn: 0.934 ± 0.026
1.297MetPro: 1.297 ± 0.03
0.938MetGln: 0.938 ± 0.03
1.295MetArg: 1.295 ± 0.035
1.396MetSer: 1.396 ± 0.032
1.402MetThr: 1.402 ± 0.03
1.289MetVal: 1.289 ± 0.033
0.149MetTrp: 0.149 ± 0.01
0.403MetTyr: 0.403 ± 0.016
0.0MetXaa: 0.0 ± 0.0
Asn
3.749AsnAla: 3.749 ± 0.066
0.335AsnCys: 0.335 ± 0.015
2.01AsnAsp: 2.01 ± 0.047
1.676AsnGlu: 1.676 ± 0.033
1.287AsnPhe: 1.287 ± 0.028
2.959AsnGly: 2.959 ± 0.068
0.828AsnHis: 0.828 ± 0.024
2.002AsnIle: 2.002 ± 0.04
1.347AsnLys: 1.347 ± 0.036
3.954AsnLeu: 3.954 ± 0.058
0.672AsnMet: 0.672 ± 0.023
1.362AsnAsn: 1.362 ± 0.044
2.35AsnPro: 2.35 ± 0.041
1.662AsnGln: 1.662 ± 0.037
2.427AsnArg: 2.427 ± 0.04
2.016AsnSer: 2.016 ± 0.049
1.816AsnThr: 1.816 ± 0.046
2.344AsnVal: 2.344 ± 0.045
0.552AsnTrp: 0.552 ± 0.019
1.081AsnTyr: 1.081 ± 0.031
0.0AsnXaa: 0.0 ± 0.0
Pro
5.351ProAla: 5.351 ± 0.076
0.37ProCys: 0.37 ± 0.016
3.164ProAsp: 3.164 ± 0.055
3.366ProGlu: 3.366 ± 0.053
1.771ProPhe: 1.771 ± 0.036
3.522ProGly: 3.522 ± 0.053
0.915ProHis: 0.915 ± 0.026
2.381ProIle: 2.381 ± 0.041
1.678ProLys: 1.678 ± 0.036
4.768ProLeu: 4.768 ± 0.061
0.94ProMet: 0.94 ± 0.026
1.693ProAsn: 1.693 ± 0.034
2.02ProPro: 2.02 ± 0.05
1.578ProGln: 1.578 ± 0.039
2.133ProArg: 2.133 ± 0.041
2.457ProSer: 2.457 ± 0.042
2.29ProThr: 2.29 ± 0.048
3.56ProVal: 3.56 ± 0.063
0.593ProTrp: 0.593 ± 0.02
1.243ProTyr: 1.243 ± 0.033
0.0ProXaa: 0.0 ± 0.0
Gln
5.589GlnAla: 5.589 ± 0.077
0.358GlnCys: 0.358 ± 0.016
1.906GlnAsp: 1.906 ± 0.036
1.771GlnGlu: 1.771 ± 0.031
1.609GlnPhe: 1.609 ± 0.032
2.726GlnGly: 2.726 ± 0.043
1.049GlnHis: 1.049 ± 0.025
2.476GlnIle: 2.476 ± 0.042
1.508GlnLys: 1.508 ± 0.035
4.654GlnLeu: 4.654 ± 0.062
0.864GlnMet: 0.864 ± 0.021
1.387GlnAsn: 1.387 ± 0.03
2.06GlnPro: 2.06 ± 0.04
2.474GlnGln: 2.474 ± 0.055
2.959GlnArg: 2.959 ± 0.052
2.381GlnSer: 2.381 ± 0.042
2.567GlnThr: 2.567 ± 0.039
2.903GlnVal: 2.903 ± 0.043
0.573GlnTrp: 0.573 ± 0.022
1.177GlnTyr: 1.177 ± 0.029
0.0GlnXaa: 0.0 ± 0.0
Arg
5.211ArgAla: 5.211 ± 0.076
0.624ArgCys: 0.624 ± 0.019
3.45ArgAsp: 3.45 ± 0.053
3.545ArgGlu: 3.545 ± 0.062
2.842ArgPhe: 2.842 ± 0.051
3.665ArgGly: 3.665 ± 0.054
1.706ArgHis: 1.706 ± 0.038
3.673ArgIle: 3.673 ± 0.047
2.283ArgLys: 2.283 ± 0.036
7.167ArgLeu: 7.167 ± 0.097
1.42ArgMet: 1.42 ± 0.031
2.092ArgAsn: 2.092 ± 0.039
2.544ArgPro: 2.544 ± 0.046
3.34ArgGln: 3.34 ± 0.057
4.164ArgArg: 4.164 ± 0.065
2.787ArgSer: 2.787 ± 0.045
2.39ArgThr: 2.39 ± 0.039
3.781ArgVal: 3.781 ± 0.055
0.968ArgTrp: 0.968 ± 0.027
2.197ArgTyr: 2.197 ± 0.051
0.0ArgXaa: 0.0 ± 0.0
Ser
6.492SerAla: 6.492 ± 0.092
0.556SerCys: 0.556 ± 0.022
3.173SerAsp: 3.173 ± 0.047
3.207SerGlu: 3.207 ± 0.056
2.342SerPhe: 2.342 ± 0.044
5.519SerGly: 5.519 ± 0.121
1.339SerHis: 1.339 ± 0.03
2.917SerIle: 2.917 ± 0.05
2.049SerLys: 2.049 ± 0.044
6.27SerLeu: 6.27 ± 0.076
1.093SerMet: 1.093 ± 0.026
2.013SerAsn: 2.013 ± 0.045
2.591SerPro: 2.591 ± 0.04
2.284SerGln: 2.284 ± 0.042
3.445SerArg: 3.445 ± 0.047
3.355SerSer: 3.355 ± 0.063
2.65SerThr: 2.65 ± 0.041
4.282SerVal: 4.282 ± 0.066
0.731SerTrp: 0.731 ± 0.023
1.673SerTyr: 1.673 ± 0.036
0.0SerXaa: 0.0 ± 0.0
Thr
5.518ThrAla: 5.518 ± 0.072
0.433ThrCys: 0.433 ± 0.016
2.908ThrAsp: 2.908 ± 0.05
2.72ThrGlu: 2.72 ± 0.045
1.774ThrPhe: 1.774 ± 0.043
4.575ThrGly: 4.575 ± 0.087
1.027ThrHis: 1.027 ± 0.027
2.681ThrIle: 2.681 ± 0.05
1.537ThrLys: 1.537 ± 0.036
6.277ThrLeu: 6.277 ± 0.1
0.948ThrMet: 0.948 ± 0.026
1.535ThrAsn: 1.535 ± 0.042
2.891ThrPro: 2.891 ± 0.054
1.897ThrGln: 1.897 ± 0.042
2.487ThrArg: 2.487 ± 0.042
2.578ThrSer: 2.578 ± 0.054
2.534ThrThr: 2.534 ± 0.053
4.035ThrVal: 4.035 ± 0.059
0.605ThrTrp: 0.605 ± 0.022
1.276ThrTyr: 1.276 ± 0.038
0.0ThrXaa: 0.0 ± 0.0
Val
7.481ValAla: 7.481 ± 0.082
0.749ValCys: 0.749 ± 0.021
4.016ValAsp: 4.016 ± 0.057
4.259ValGlu: 4.259 ± 0.056
2.807ValPhe: 2.807 ± 0.051
5.046ValGly: 5.046 ± 0.063
1.197ValHis: 1.197 ± 0.033
3.614ValIle: 3.614 ± 0.059
2.944ValLys: 2.944 ± 0.046
7.009ValLeu: 7.009 ± 0.069
1.579ValMet: 1.579 ± 0.037
2.613ValAsn: 2.613 ± 0.045
2.827ValPro: 2.827 ± 0.043
2.163ValGln: 2.163 ± 0.042
3.565ValArg: 3.565 ± 0.052
4.287ValSer: 4.287 ± 0.059
3.601ValThr: 3.601 ± 0.055
5.125ValVal: 5.125 ± 0.065
0.919ValTrp: 0.919 ± 0.022
1.918ValTyr: 1.918 ± 0.033
0.0ValXaa: 0.0 ± 0.0
Trp
1.006TrpAla: 1.006 ± 0.028
0.138TrpCys: 0.138 ± 0.01
0.673TrpAsp: 0.673 ± 0.022
0.605TrpGlu: 0.605 ± 0.024
0.616TrpPhe: 0.616 ± 0.019
0.804TrpGly: 0.804 ± 0.026
0.389TrpHis: 0.389 ± 0.015
0.644TrpIle: 0.644 ± 0.023
0.421TrpLys: 0.421 ± 0.018
2.279TrpLeu: 2.279 ± 0.049
0.281TrpMet: 0.281 ± 0.016
0.495TrpAsn: 0.495 ± 0.021
0.687TrpPro: 0.687 ± 0.026
0.898TrpGln: 0.898 ± 0.026
1.186TrpArg: 1.186 ± 0.037
0.813TrpSer: 0.813 ± 0.023
0.611TrpThr: 0.611 ± 0.023
0.821TrpVal: 0.821 ± 0.022
0.213TrpTrp: 0.213 ± 0.012
0.362TrpTyr: 0.362 ± 0.015
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.733TyrAla: 2.733 ± 0.044
0.371TyrCys: 0.371 ± 0.016
1.636TyrAsp: 1.636 ± 0.05
1.254TyrGlu: 1.254 ± 0.031
1.346TyrPhe: 1.346 ± 0.032
2.215TyrGly: 2.215 ± 0.042
0.662TyrHis: 0.662 ± 0.022
1.195TyrIle: 1.195 ± 0.032
0.941TyrLys: 0.941 ± 0.028
3.368TyrLeu: 3.368 ± 0.042
0.477TyrMet: 0.477 ± 0.022
0.94TyrAsn: 0.94 ± 0.028
1.399TyrPro: 1.399 ± 0.032
1.576TyrGln: 1.576 ± 0.032
2.524TyrArg: 2.524 ± 0.055
1.799TyrSer: 1.799 ± 0.036
1.338TyrThr: 1.338 ± 0.035
1.714TyrVal: 1.714 ± 0.038
0.551TyrTrp: 0.551 ± 0.022
0.929TyrTyr: 0.929 ± 0.028
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.001XaaPro: 0.001 ± 0.001
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.001XaaXaa: 0.001 ± 0.002
Statistics based on 4433 proteins (1481426 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski