Amino acid dipepetide frequency for Pyrinomonas methylaliphatogenes

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.523AlaAla: 12.523 ± 0.146
1.045AlaCys: 1.045 ± 0.03
4.941AlaAsp: 4.941 ± 0.064
8.129AlaGlu: 8.129 ± 0.108
3.834AlaPhe: 3.834 ± 0.062
7.434AlaGly: 7.434 ± 0.1
2.181AlaHis: 2.181 ± 0.044
6.777AlaIle: 6.777 ± 0.095
3.299AlaLys: 3.299 ± 0.057
12.672AlaLeu: 12.672 ± 0.134
2.21AlaMet: 2.21 ± 0.05
2.794AlaAsn: 2.794 ± 0.062
4.761AlaPro: 4.761 ± 0.079
4.403AlaGln: 4.403 ± 0.076
11.441AlaArg: 11.441 ± 0.119
5.784AlaSer: 5.784 ± 0.084
5.259AlaThr: 5.259 ± 0.078
7.004AlaVal: 7.004 ± 0.092
1.208AlaTrp: 1.208 ± 0.037
2.57AlaTyr: 2.57 ± 0.054
0.0AlaXaa: 0.0 ± 0.0
Cys
1.096CysAla: 1.096 ± 0.032
0.146CysCys: 0.146 ± 0.013
0.514CysAsp: 0.514 ± 0.021
0.598CysGlu: 0.598 ± 0.024
0.384CysPhe: 0.384 ± 0.019
1.002CysGly: 1.002 ± 0.037
0.212CysHis: 0.212 ± 0.016
0.371CysIle: 0.371 ± 0.018
0.173CysLys: 0.173 ± 0.012
0.827CysLeu: 0.827 ± 0.029
0.13CysMet: 0.13 ± 0.01
0.202CysAsn: 0.202 ± 0.015
0.455CysPro: 0.455 ± 0.021
0.254CysGln: 0.254 ± 0.015
0.709CysArg: 0.709 ± 0.025
0.457CysSer: 0.457 ± 0.02
0.315CysThr: 0.315 ± 0.017
0.611CysVal: 0.611 ± 0.025
0.129CysTrp: 0.129 ± 0.01
0.257CysTyr: 0.257 ± 0.017
0.0CysXaa: 0.0 ± 0.0
Asp
5.079AspAla: 5.079 ± 0.074
0.447AspCys: 0.447 ± 0.024
2.733AspAsp: 2.733 ± 0.045
4.533AspGlu: 4.533 ± 0.075
2.164AspPhe: 2.164 ± 0.047
4.657AspGly: 4.657 ± 0.089
0.98AspHis: 0.98 ± 0.031
2.244AspIle: 2.244 ± 0.047
1.324AspLys: 1.324 ± 0.037
5.822AspLeu: 5.822 ± 0.067
0.746AspMet: 0.746 ± 0.026
0.985AspAsn: 0.985 ± 0.033
3.294AspPro: 3.294 ± 0.063
1.639AspGln: 1.639 ± 0.047
4.181AspArg: 4.181 ± 0.056
1.884AspSer: 1.884 ± 0.043
1.723AspThr: 1.723 ± 0.043
3.57AspVal: 3.57 ± 0.059
0.904AspTrp: 0.904 ± 0.034
1.642AspTyr: 1.642 ± 0.038
0.0AspXaa: 0.0 ± 0.0
Glu
8.474GluAla: 8.474 ± 0.107
0.441GluCys: 0.441 ± 0.022
3.057GluAsp: 3.057 ± 0.054
5.654GluGlu: 5.654 ± 0.102
2.341GluPhe: 2.341 ± 0.049
4.45GluGly: 4.45 ± 0.064
1.25GluHis: 1.25 ± 0.035
4.711GluIle: 4.711 ± 0.086
2.683GluLys: 2.683 ± 0.061
7.43GluLeu: 7.43 ± 0.096
1.78GluMet: 1.78 ± 0.038
1.759GluAsn: 1.759 ± 0.036
2.669GluPro: 2.669 ± 0.046
2.41GluGln: 2.41 ± 0.052
8.77GluArg: 8.77 ± 0.111
2.974GluSer: 2.974 ± 0.057
3.582GluThr: 3.582 ± 0.052
4.838GluVal: 4.838 ± 0.078
0.85GluTrp: 0.85 ± 0.027
1.701GluTyr: 1.701 ± 0.043
0.0GluXaa: 0.0 ± 0.0
Phe
4.51PheAla: 4.51 ± 0.071
0.399PheCys: 0.399 ± 0.019
2.68PheAsp: 2.68 ± 0.046
2.382PheGlu: 2.382 ± 0.046
1.781PhePhe: 1.781 ± 0.053
3.556PheGly: 3.556 ± 0.063
0.756PheHis: 0.756 ± 0.028
2.275PheIle: 2.275 ± 0.047
1.217PheLys: 1.217 ± 0.033
3.422PheLeu: 3.422 ± 0.064
0.623PheMet: 0.623 ± 0.023
1.481PheAsn: 1.481 ± 0.048
1.633PhePro: 1.633 ± 0.038
1.026PheGln: 1.026 ± 0.033
2.809PheArg: 2.809 ± 0.046
2.293PheSer: 2.293 ± 0.057
2.042PheThr: 2.042 ± 0.044
2.719PheVal: 2.719 ± 0.05
0.514PheTrp: 0.514 ± 0.021
1.285PheTyr: 1.285 ± 0.037
0.0PheXaa: 0.0 ± 0.0
Gly
8.084GlyAla: 8.084 ± 0.087
0.732GlyCys: 0.732 ± 0.025
3.681GlyAsp: 3.681 ± 0.067
5.446GlyGlu: 5.446 ± 0.076
3.121GlyPhe: 3.121 ± 0.061
6.173GlyGly: 6.173 ± 0.094
1.468GlyHis: 1.468 ± 0.035
4.302GlyIle: 4.302 ± 0.06
2.702GlyLys: 2.702 ± 0.056
7.146GlyLeu: 7.146 ± 0.083
1.648GlyMet: 1.648 ± 0.037
1.882GlyAsn: 1.882 ± 0.046
2.76GlyPro: 2.76 ± 0.057
2.554GlyGln: 2.554 ± 0.054
6.885GlyArg: 6.885 ± 0.092
4.045GlySer: 4.045 ± 0.073
4.144GlyThr: 4.144 ± 0.067
5.476GlyVal: 5.476 ± 0.084
1.221GlyTrp: 1.221 ± 0.033
2.339GlyTyr: 2.339 ± 0.051
0.0GlyXaa: 0.0 ± 0.0
His
1.905HisAla: 1.905 ± 0.039
0.164HisCys: 0.164 ± 0.013
0.91HisAsp: 0.91 ± 0.031
1.256HisGlu: 1.256 ± 0.038
0.767HisPhe: 0.767 ± 0.029
1.518HisGly: 1.518 ± 0.038
0.532HisHis: 0.532 ± 0.022
0.964HisIle: 0.964 ± 0.032
0.476HisLys: 0.476 ± 0.021
2.181HisLeu: 2.181 ± 0.05
0.306HisMet: 0.306 ± 0.017
0.526HisAsn: 0.526 ± 0.022
1.294HisPro: 1.294 ± 0.036
0.58HisGln: 0.58 ± 0.023
1.561HisArg: 1.561 ± 0.036
0.832HisSer: 0.832 ± 0.026
0.88HisThr: 0.88 ± 0.028
1.257HisVal: 1.257 ± 0.035
0.256HisTrp: 0.256 ± 0.017
0.624HisTyr: 0.624 ± 0.021
0.0HisXaa: 0.0 ± 0.0
Ile
7.421IleAla: 7.421 ± 0.087
0.564IleCys: 0.564 ± 0.025
3.093IleAsp: 3.093 ± 0.055
5.299IleGlu: 5.299 ± 0.081
2.483IlePhe: 2.483 ± 0.052
4.865IleGly: 4.865 ± 0.072
0.934IleHis: 0.934 ± 0.03
3.687IleIle: 3.687 ± 0.069
2.003IleLys: 2.003 ± 0.046
4.52IleLeu: 4.52 ± 0.078
0.855IleMet: 0.855 ± 0.025
1.982IleAsn: 1.982 ± 0.055
2.637IlePro: 2.637 ± 0.047
1.28IleGln: 1.28 ± 0.036
3.794IleArg: 3.794 ± 0.06
3.271IleSer: 3.271 ± 0.054
2.941IleThr: 2.941 ± 0.057
4.752IleVal: 4.752 ± 0.074
0.586IleTrp: 0.586 ± 0.024
1.743IleTyr: 1.743 ± 0.037
0.0IleXaa: 0.0 ± 0.0
Lys
2.873LysAla: 2.873 ± 0.059
0.208LysCys: 0.208 ± 0.015
1.483LysAsp: 1.483 ± 0.042
2.179LysGlu: 2.179 ± 0.052
0.94LysPhe: 0.94 ± 0.032
2.167LysGly: 2.167 ± 0.045
0.582LysHis: 0.582 ± 0.026
2.079LysIle: 2.079 ± 0.047
1.193LysLys: 1.193 ± 0.042
3.544LysLeu: 3.544 ± 0.065
0.755LysMet: 0.755 ± 0.028
0.903LysAsn: 0.903 ± 0.03
1.519LysPro: 1.519 ± 0.039
1.05LysGln: 1.05 ± 0.035
3.457LysArg: 3.457 ± 0.065
1.484LysSer: 1.484 ± 0.036
1.579LysThr: 1.579 ± 0.039
2.19LysVal: 2.19 ± 0.047
0.312LysTrp: 0.312 ± 0.018
0.829LysTyr: 0.829 ± 0.028
0.0LysXaa: 0.0 ± 0.0
Leu
12.034LeuAla: 12.034 ± 0.12
1.048LeuCys: 1.048 ± 0.031
5.469LeuAsp: 5.469 ± 0.082
6.49LeuGlu: 6.49 ± 0.095
4.068LeuPhe: 4.068 ± 0.066
7.173LeuGly: 7.173 ± 0.092
1.804LeuHis: 1.804 ± 0.043
6.292LeuIle: 6.292 ± 0.081
3.329LeuLys: 3.329 ± 0.064
10.627LeuLeu: 10.627 ± 0.133
1.798LeuMet: 1.798 ± 0.041
2.928LeuAsn: 2.928 ± 0.062
5.291LeuPro: 5.291 ± 0.071
3.036LeuGln: 3.036 ± 0.062
9.546LeuArg: 9.546 ± 0.118
6.066LeuSer: 6.066 ± 0.087
5.146LeuThr: 5.146 ± 0.067
6.752LeuVal: 6.752 ± 0.091
1.23LeuTrp: 1.23 ± 0.039
2.456LeuTyr: 2.456 ± 0.05
0.0LeuXaa: 0.0 ± 0.0
Met
1.97MetAla: 1.97 ± 0.048
0.151MetCys: 0.151 ± 0.013
0.767MetAsp: 0.767 ± 0.028
1.083MetGlu: 1.083 ± 0.033
0.53MetPhe: 0.53 ± 0.022
1.384MetGly: 1.384 ± 0.038
0.298MetHis: 0.298 ± 0.016
1.222MetIle: 1.222 ± 0.033
0.869MetLys: 0.869 ± 0.028
1.992MetLeu: 1.992 ± 0.042
0.456MetMet: 0.456 ± 0.02
0.668MetAsn: 0.668 ± 0.023
0.927MetPro: 0.927 ± 0.03
0.605MetGln: 0.605 ± 0.025
2.234MetArg: 2.234 ± 0.042
1.24MetSer: 1.24 ± 0.037
1.126MetThr: 1.126 ± 0.034
1.206MetVal: 1.206 ± 0.034
0.164MetTrp: 0.164 ± 0.011
0.262MetTyr: 0.262 ± 0.016
0.0MetXaa: 0.0 ± 0.0
Asn
2.93AsnAla: 2.93 ± 0.065
0.267AsnCys: 0.267 ± 0.016
1.404AsnAsp: 1.404 ± 0.041
1.772AsnGlu: 1.772 ± 0.038
1.284AsnPhe: 1.284 ± 0.046
2.412AsnGly: 2.412 ± 0.055
0.46AsnHis: 0.46 ± 0.018
1.5AsnIle: 1.5 ± 0.039
0.745AsnLys: 0.745 ± 0.03
3.099AsnLeu: 3.099 ± 0.064
0.483AsnMet: 0.483 ± 0.021
0.909AsnAsn: 0.909 ± 0.043
1.937AsnPro: 1.937 ± 0.052
0.89AsnGln: 0.89 ± 0.035
2.308AsnArg: 2.308 ± 0.053
1.363AsnSer: 1.363 ± 0.036
1.18AsnThr: 1.18 ± 0.038
2.178AsnVal: 2.178 ± 0.047
0.374AsnTrp: 0.374 ± 0.017
0.971AsnTyr: 0.971 ± 0.035
0.0AsnXaa: 0.0 ± 0.0
Pro
4.278ProAla: 4.278 ± 0.067
0.373ProCys: 0.373 ± 0.019
2.888ProAsp: 2.888 ± 0.058
3.859ProGlu: 3.859 ± 0.067
2.132ProPhe: 2.132 ± 0.046
3.37ProGly: 3.37 ± 0.064
1.045ProHis: 1.045 ± 0.03
2.622ProIle: 2.622 ± 0.056
1.488ProLys: 1.488 ± 0.037
4.688ProLeu: 4.688 ± 0.06
0.826ProMet: 0.826 ± 0.025
1.709ProAsn: 1.709 ± 0.047
2.624ProPro: 2.624 ± 0.061
1.976ProGln: 1.976 ± 0.046
3.513ProArg: 3.513 ± 0.067
2.767ProSer: 2.767 ± 0.049
2.694ProThr: 2.694 ± 0.056
3.568ProVal: 3.568 ± 0.068
0.49ProTrp: 0.49 ± 0.021
1.361ProTyr: 1.361 ± 0.038
0.0ProXaa: 0.0 ± 0.0
Gln
3.638GlnAla: 3.638 ± 0.071
0.188GlnCys: 0.188 ± 0.012
1.182GlnAsp: 1.182 ± 0.031
1.826GlnGlu: 1.826 ± 0.046
1.12GlnPhe: 1.12 ± 0.032
2.067GlnGly: 2.067 ± 0.042
0.574GlnHis: 0.574 ± 0.024
2.275GlnIle: 2.275 ± 0.045
1.169GlnLys: 1.169 ± 0.036
3.421GlnLeu: 3.421 ± 0.061
0.821GlnMet: 0.821 ± 0.03
0.941GlnAsn: 0.941 ± 0.031
1.671GlnPro: 1.671 ± 0.043
1.337GlnGln: 1.337 ± 0.043
3.278GlnArg: 3.278 ± 0.06
1.654GlnSer: 1.654 ± 0.04
2.091GlnThr: 2.091 ± 0.047
2.103GlnVal: 2.103 ± 0.05
0.342GlnTrp: 0.342 ± 0.018
0.774GlnTyr: 0.774 ± 0.028
0.0GlnXaa: 0.0 ± 0.0
Arg
11.311ArgAla: 11.311 ± 0.12
0.718ArgCys: 0.718 ± 0.027
4.571ArgAsp: 4.571 ± 0.07
6.993ArgGlu: 6.993 ± 0.08
3.879ArgPhe: 3.879 ± 0.066
5.979ArgGly: 5.979 ± 0.079
1.718ArgHis: 1.718 ± 0.042
5.18ArgIle: 5.18 ± 0.063
2.446ArgLys: 2.446 ± 0.052
9.772ArgLeu: 9.772 ± 0.107
1.88ArgMet: 1.88 ± 0.037
2.386ArgAsn: 2.386 ± 0.046
3.682ArgPro: 3.682 ± 0.058
2.95ArgGln: 2.95 ± 0.059
8.78ArgArg: 8.78 ± 0.118
4.537ArgSer: 4.537 ± 0.066
4.184ArgThr: 4.184 ± 0.06
6.372ArgVal: 6.372 ± 0.086
1.204ArgTrp: 1.204 ± 0.034
2.633ArgTyr: 2.633 ± 0.05
0.0ArgXaa: 0.0 ± 0.0
Ser
5.534SerAla: 5.534 ± 0.075
0.474SerCys: 0.474 ± 0.021
2.791SerAsp: 2.791 ± 0.051
3.234SerGlu: 3.234 ± 0.054
2.556SerPhe: 2.556 ± 0.054
4.626SerGly: 4.626 ± 0.074
0.918SerHis: 0.918 ± 0.027
2.765SerIle: 2.765 ± 0.052
1.449SerLys: 1.449 ± 0.036
5.788SerLeu: 5.788 ± 0.078
0.919SerMet: 0.919 ± 0.03
1.571SerAsn: 1.571 ± 0.041
2.884SerPro: 2.884 ± 0.06
1.464SerGln: 1.464 ± 0.042
3.732SerArg: 3.732 ± 0.056
3.431SerSer: 3.431 ± 0.067
2.51SerThr: 2.51 ± 0.051
3.73SerVal: 3.73 ± 0.06
0.682SerTrp: 0.682 ± 0.026
1.548SerTyr: 1.548 ± 0.043
0.0SerXaa: 0.0 ± 0.0
Thr
4.909ThrAla: 4.909 ± 0.07
0.391ThrCys: 0.391 ± 0.019
2.671ThrAsp: 2.671 ± 0.052
3.063ThrGlu: 3.063 ± 0.053
1.97ThrPhe: 1.97 ± 0.054
4.581ThrGly: 4.581 ± 0.068
0.979ThrHis: 0.979 ± 0.026
3.161ThrIle: 3.161 ± 0.057
1.409ThrLys: 1.409 ± 0.034
5.295ThrLeu: 5.295 ± 0.074
0.829ThrMet: 0.829 ± 0.029
1.514ThrAsn: 1.514 ± 0.045
3.022ThrPro: 3.022 ± 0.055
1.395ThrGln: 1.395 ± 0.041
3.633ThrArg: 3.633 ± 0.062
2.714ThrSer: 2.714 ± 0.06
2.59ThrThr: 2.59 ± 0.06
3.957ThrVal: 3.957 ± 0.066
0.538ThrTrp: 0.538 ± 0.023
1.283ThrTyr: 1.283 ± 0.031
0.0ThrXaa: 0.0 ± 0.0
Val
7.773ValAla: 7.773 ± 0.093
0.719ValCys: 0.719 ± 0.023
3.396ValAsp: 3.396 ± 0.053
5.467ValGlu: 5.467 ± 0.086
2.418ValPhe: 2.418 ± 0.056
5.24ValGly: 5.24 ± 0.082
1.209ValHis: 1.209 ± 0.034
4.385ValIle: 4.385 ± 0.067
2.216ValLys: 2.216 ± 0.046
6.155ValLeu: 6.155 ± 0.08
1.396ValMet: 1.396 ± 0.039
2.113ValAsn: 2.113 ± 0.051
3.426ValPro: 3.426 ± 0.064
2.136ValGln: 2.136 ± 0.049
6.545ValArg: 6.545 ± 0.08
3.797ValSer: 3.797 ± 0.054
3.817ValThr: 3.817 ± 0.066
5.113ValVal: 5.113 ± 0.068
0.823ValTrp: 0.823 ± 0.03
1.861ValTyr: 1.861 ± 0.043
0.0ValXaa: 0.0 ± 0.0
Trp
1.093TrpAla: 1.093 ± 0.031
0.105TrpCys: 0.105 ± 0.01
0.565TrpAsp: 0.565 ± 0.026
0.709TrpGlu: 0.709 ± 0.028
0.468TrpPhe: 0.468 ± 0.023
0.817TrpGly: 0.817 ± 0.027
0.285TrpHis: 0.285 ± 0.017
0.634TrpIle: 0.634 ± 0.03
0.386TrpLys: 0.386 ± 0.019
1.337TrpLeu: 1.337 ± 0.04
0.289TrpMet: 0.289 ± 0.017
0.367TrpAsn: 0.367 ± 0.02
0.537TrpPro: 0.537 ± 0.021
0.602TrpGln: 0.602 ± 0.023
1.487TrpArg: 1.487 ± 0.04
0.697TrpSer: 0.697 ± 0.027
0.78TrpThr: 0.78 ± 0.028
0.697TrpVal: 0.697 ± 0.021
0.251TrpTrp: 0.251 ± 0.018
0.351TrpTyr: 0.351 ± 0.019
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.833TyrAla: 2.833 ± 0.052
0.263TyrCys: 0.263 ± 0.019
1.755TyrAsp: 1.755 ± 0.059
1.853TyrGlu: 1.853 ± 0.039
1.162TyrPhe: 1.162 ± 0.03
2.334TyrGly: 2.334 ± 0.05
0.605TyrHis: 0.605 ± 0.025
1.264TyrIle: 1.264 ± 0.033
0.747TyrLys: 0.747 ± 0.032
2.847TyrLeu: 2.847 ± 0.054
0.404TyrMet: 0.404 ± 0.019
0.844TyrAsn: 0.844 ± 0.03
1.247TyrPro: 1.247 ± 0.031
0.888TyrGln: 0.888 ± 0.031
2.512TyrArg: 2.512 ± 0.045
1.318TyrSer: 1.318 ± 0.034
1.321TyrThr: 1.321 ± 0.04
1.912TyrVal: 1.912 ± 0.045
0.378TyrTrp: 0.378 ± 0.018
0.906TyrTyr: 0.906 ± 0.035
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3179 proteins (1112582 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski