Amino acid dipepetide frequency for Kosmotoga pacifica

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.754AlaAla: 4.754 ± 0.099
0.483AlaCys: 0.483 ± 0.027
2.788AlaAsp: 2.788 ± 0.077
4.243AlaGlu: 4.243 ± 0.105
3.823AlaPhe: 3.823 ± 0.073
5.045AlaGly: 5.045 ± 0.097
1.005AlaHis: 1.005 ± 0.045
5.77AlaIle: 5.77 ± 0.107
4.13AlaLys: 4.13 ± 0.09
7.392AlaLeu: 7.392 ± 0.106
1.787AlaMet: 1.787 ± 0.058
2.118AlaAsn: 2.118 ± 0.06
1.815AlaPro: 1.815 ± 0.054
1.429AlaGln: 1.429 ± 0.056
3.342AlaArg: 3.342 ± 0.087
3.871AlaSer: 3.871 ± 0.087
3.189AlaThr: 3.189 ± 0.085
5.407AlaVal: 5.407 ± 0.1
0.577AlaTrp: 0.577 ± 0.031
2.199AlaTyr: 2.199 ± 0.059
0.0AlaXaa: 0.0 ± 0.0
Cys
0.395CysAla: 0.395 ± 0.031
0.058CysCys: 0.058 ± 0.009
0.375CysAsp: 0.375 ± 0.023
0.428CysGlu: 0.428 ± 0.026
0.281CysPhe: 0.281 ± 0.021
0.805CysGly: 0.805 ± 0.042
0.185CysHis: 0.185 ± 0.023
0.475CysIle: 0.475 ± 0.032
0.417CysLys: 0.417 ± 0.026
0.479CysLeu: 0.479 ± 0.028
0.126CysMet: 0.126 ± 0.015
0.27CysAsn: 0.27 ± 0.023
0.466CysPro: 0.466 ± 0.032
0.152CysGln: 0.152 ± 0.016
0.337CysArg: 0.337 ± 0.022
0.489CysSer: 0.489 ± 0.034
0.344CysThr: 0.344 ± 0.026
0.438CysVal: 0.438 ± 0.027
0.079CysTrp: 0.079 ± 0.012
0.246CysTyr: 0.246 ± 0.022
0.0CysXaa: 0.0 ± 0.0
Asp
2.971AspAla: 2.971 ± 0.079
0.367AspCys: 0.367 ± 0.023
2.182AspAsp: 2.182 ± 0.06
4.0AspGlu: 4.0 ± 0.081
3.022AspPhe: 3.022 ± 0.072
3.548AspGly: 3.548 ± 0.087
0.761AspHis: 0.761 ± 0.04
4.34AspIle: 4.34 ± 0.092
2.642AspLys: 2.642 ± 0.079
4.773AspLeu: 4.773 ± 0.086
1.204AspMet: 1.204 ± 0.05
1.85AspAsn: 1.85 ± 0.061
2.449AspPro: 2.449 ± 0.067
0.84AspGln: 0.84 ± 0.039
2.555AspArg: 2.555 ± 0.063
2.917AspSer: 2.917 ± 0.073
2.33AspThr: 2.33 ± 0.071
3.624AspVal: 3.624 ± 0.072
0.645AspTrp: 0.645 ± 0.035
2.204AspTyr: 2.204 ± 0.064
0.0AspXaa: 0.0 ± 0.0
Glu
5.006GluAla: 5.006 ± 0.1
0.374GluCys: 0.374 ± 0.025
3.413GluAsp: 3.413 ± 0.078
6.969GluGlu: 6.969 ± 0.142
3.555GluPhe: 3.555 ± 0.086
4.59GluGly: 4.59 ± 0.089
1.161GluHis: 1.161 ± 0.037
6.913GluIle: 6.913 ± 0.113
7.386GluLys: 7.386 ± 0.104
9.056GluLeu: 9.056 ± 0.148
2.011GluMet: 2.011 ± 0.058
3.57GluAsn: 3.57 ± 0.07
2.216GluPro: 2.216 ± 0.07
1.541GluGln: 1.541 ± 0.062
4.026GluArg: 4.026 ± 0.08
3.705GluSer: 3.705 ± 0.085
3.421GluThr: 3.421 ± 0.074
5.466GluVal: 5.466 ± 0.095
0.685GluTrp: 0.685 ± 0.039
2.746GluTyr: 2.746 ± 0.072
0.0GluXaa: 0.0 ± 0.0
Phe
3.201PheAla: 3.201 ± 0.084
0.387PheCys: 0.387 ± 0.027
2.877PheAsp: 2.877 ± 0.067
3.79PheGlu: 3.79 ± 0.081
2.85PhePhe: 2.85 ± 0.094
3.866PheGly: 3.866 ± 0.092
0.762PheHis: 0.762 ± 0.036
3.776PheIle: 3.776 ± 0.098
3.137PheLys: 3.137 ± 0.077
5.534PheLeu: 5.534 ± 0.115
1.296PheMet: 1.296 ± 0.047
2.11PheAsn: 2.11 ± 0.058
2.068PhePro: 2.068 ± 0.062
1.141PheGln: 1.141 ± 0.044
2.211PheArg: 2.211 ± 0.062
4.18PheSer: 4.18 ± 0.102
2.419PheThr: 2.419 ± 0.068
3.649PheVal: 3.649 ± 0.075
0.613PheTrp: 0.613 ± 0.04
1.887PheTyr: 1.887 ± 0.065
0.0PheXaa: 0.0 ± 0.0
Gly
4.56GlyAla: 4.56 ± 0.097
0.587GlyCys: 0.587 ± 0.032
3.282GlyAsp: 3.282 ± 0.086
4.873GlyGlu: 4.873 ± 0.08
3.889GlyPhe: 3.889 ± 0.085
4.959GlyGly: 4.959 ± 0.101
1.243GlyHis: 1.243 ± 0.051
6.819GlyIle: 6.819 ± 0.105
5.827GlyLys: 5.827 ± 0.105
6.637GlyLeu: 6.637 ± 0.117
2.037GlyMet: 2.037 ± 0.056
2.821GlyAsn: 2.821 ± 0.073
1.89GlyPro: 1.89 ± 0.065
1.455GlyGln: 1.455 ± 0.047
3.188GlyArg: 3.188 ± 0.078
3.943GlySer: 3.943 ± 0.082
3.967GlyThr: 3.967 ± 0.094
5.246GlyVal: 5.246 ± 0.102
0.734GlyTrp: 0.734 ± 0.04
3.179GlyTyr: 3.179 ± 0.072
0.0GlyXaa: 0.0 ± 0.0
His
0.952HisAla: 0.952 ± 0.037
0.16HisCys: 0.16 ± 0.017
0.774HisAsp: 0.774 ± 0.036
1.075HisGlu: 1.075 ± 0.042
0.866HisPhe: 0.866 ± 0.037
1.313HisGly: 1.313 ± 0.053
0.344HisHis: 0.344 ± 0.024
1.238HisIle: 1.238 ± 0.048
0.812HisLys: 0.812 ± 0.037
1.597HisLeu: 1.597 ± 0.046
0.359HisMet: 0.359 ± 0.023
0.661HisAsn: 0.661 ± 0.035
0.947HisPro: 0.947 ± 0.042
0.372HisGln: 0.372 ± 0.028
0.845HisArg: 0.845 ± 0.037
0.928HisSer: 0.928 ± 0.038
0.785HisThr: 0.785 ± 0.035
1.032HisVal: 1.032 ± 0.048
0.228HisTrp: 0.228 ± 0.022
0.704HisTyr: 0.704 ± 0.029
0.0HisXaa: 0.0 ± 0.0
Ile
6.075IleAla: 6.075 ± 0.106
0.522IleCys: 0.522 ± 0.031
4.712IleAsp: 4.712 ± 0.1
6.624IleGlu: 6.624 ± 0.109
4.046IlePhe: 4.046 ± 0.097
5.504IleGly: 5.504 ± 0.105
1.146IleHis: 1.146 ± 0.046
6.111IleIle: 6.111 ± 0.085
5.195IleLys: 5.195 ± 0.091
7.94IleLeu: 7.94 ± 0.141
1.724IleMet: 1.724 ± 0.057
2.999IleAsn: 2.999 ± 0.063
3.928IlePro: 3.928 ± 0.078
1.624IleGln: 1.624 ± 0.05
3.588IleArg: 3.588 ± 0.074
5.782IleSer: 5.782 ± 0.115
4.373IleThr: 4.373 ± 0.099
6.073IleVal: 6.073 ± 0.099
0.718IleTrp: 0.718 ± 0.038
2.784IleTyr: 2.784 ± 0.071
0.0IleXaa: 0.0 ± 0.0
Lys
5.006LysAla: 5.006 ± 0.101
0.44LysCys: 0.44 ± 0.027
3.553LysAsp: 3.553 ± 0.07
6.647LysGlu: 6.647 ± 0.125
2.735LysPhe: 2.735 ± 0.071
4.729LysGly: 4.729 ± 0.106
1.227LysHis: 1.227 ± 0.055
5.84LysIle: 5.84 ± 0.091
6.334LysLys: 6.334 ± 0.102
7.358LysLeu: 7.358 ± 0.127
1.908LysMet: 1.908 ± 0.056
3.198LysAsn: 3.198 ± 0.081
2.422LysPro: 2.422 ± 0.057
1.491LysGln: 1.491 ± 0.05
4.077LysArg: 4.077 ± 0.095
3.654LysSer: 3.654 ± 0.071
3.424LysThr: 3.424 ± 0.077
5.23LysVal: 5.23 ± 0.09
0.632LysTrp: 0.632 ± 0.032
2.677LysTyr: 2.677 ± 0.063
0.0LysXaa: 0.0 ± 0.0
Leu
6.402LeuAla: 6.402 ± 0.1
0.627LeuCys: 0.627 ± 0.037
4.864LeuAsp: 4.864 ± 0.087
7.835LeuGlu: 7.835 ± 0.137
5.304LeuPhe: 5.304 ± 0.13
6.906LeuGly: 6.906 ± 0.115
1.495LeuHis: 1.495 ± 0.054
7.576LeuIle: 7.576 ± 0.131
9.32LeuLys: 9.32 ± 0.127
10.688LeuLeu: 10.688 ± 0.183
2.593LeuMet: 2.593 ± 0.065
4.464LeuAsn: 4.464 ± 0.092
4.167LeuPro: 4.167 ± 0.091
2.227LeuGln: 2.227 ± 0.063
4.866LeuArg: 4.866 ± 0.098
7.807LeuSer: 7.807 ± 0.133
4.997LeuThr: 4.997 ± 0.098
6.595LeuVal: 6.595 ± 0.104
0.997LeuTrp: 0.997 ± 0.043
3.474LeuTyr: 3.474 ± 0.072
0.0LeuXaa: 0.0 ± 0.0
Met
1.817MetAla: 1.817 ± 0.055
0.142MetCys: 0.142 ± 0.017
1.21MetAsp: 1.21 ± 0.045
1.83MetGlu: 1.83 ± 0.058
0.914MetPhe: 0.914 ± 0.045
1.764MetGly: 1.764 ± 0.058
0.334MetHis: 0.334 ± 0.022
1.93MetIle: 1.93 ± 0.059
2.43MetLys: 2.43 ± 0.071
2.508MetLeu: 2.508 ± 0.078
0.622MetMet: 0.622 ± 0.038
1.232MetAsn: 1.232 ± 0.049
0.926MetPro: 0.926 ± 0.046
0.435MetGln: 0.435 ± 0.028
1.271MetArg: 1.271 ± 0.049
1.402MetSer: 1.402 ± 0.042
1.217MetThr: 1.217 ± 0.046
1.804MetVal: 1.804 ± 0.052
0.184MetTrp: 0.184 ± 0.018
0.668MetTyr: 0.668 ± 0.03
0.0MetXaa: 0.0 ± 0.0
Asn
2.74AsnAla: 2.74 ± 0.074
0.322AsnCys: 0.322 ± 0.027
1.832AsnAsp: 1.832 ± 0.056
2.75AsnGlu: 2.75 ± 0.067
2.247AsnPhe: 2.247 ± 0.064
2.857AsnGly: 2.857 ± 0.073
0.686AsnHis: 0.686 ± 0.032
3.418AsnIle: 3.418 ± 0.078
2.128AsnLys: 2.128 ± 0.062
4.14AsnLeu: 4.14 ± 0.086
0.903AsnMet: 0.903 ± 0.04
1.615AsnAsn: 1.615 ± 0.059
2.182AsnPro: 2.182 ± 0.067
0.813AsnGln: 0.813 ± 0.036
2.002AsnArg: 2.002 ± 0.062
2.498AsnSer: 2.498 ± 0.069
1.944AsnThr: 1.944 ± 0.058
2.895AsnVal: 2.895 ± 0.074
0.522AsnTrp: 0.522 ± 0.033
1.629AsnTyr: 1.629 ± 0.049
0.0AsnXaa: 0.0 ± 0.0
Pro
2.211ProAla: 2.211 ± 0.068
0.2ProCys: 0.2 ± 0.019
2.531ProAsp: 2.531 ± 0.074
4.143ProGlu: 4.143 ± 0.085
2.115ProPhe: 2.115 ± 0.067
3.057ProGly: 3.057 ± 0.082
0.719ProHis: 0.719 ± 0.032
2.602ProIle: 2.602 ± 0.071
2.146ProLys: 2.146 ± 0.06
3.614ProLeu: 3.614 ± 0.072
0.838ProMet: 0.838 ± 0.036
1.349ProAsn: 1.349 ± 0.048
1.283ProPro: 1.283 ± 0.04
1.009ProGln: 1.009 ± 0.039
1.476ProArg: 1.476 ± 0.053
2.265ProSer: 2.265 ± 0.059
1.882ProThr: 1.882 ± 0.055
3.337ProVal: 3.337 ± 0.069
0.392ProTrp: 0.392 ± 0.028
1.523ProTyr: 1.523 ± 0.057
0.0ProXaa: 0.0 ± 0.0
Gln
1.392GlnAla: 1.392 ± 0.054
0.149GlnCys: 0.149 ± 0.018
0.921GlnAsp: 0.921 ± 0.043
1.652GlnGlu: 1.652 ± 0.054
0.908GlnPhe: 0.908 ± 0.035
1.301GlnGly: 1.301 ± 0.04
0.339GlnHis: 0.339 ± 0.026
1.809GlnIle: 1.809 ± 0.058
1.797GlnLys: 1.797 ± 0.055
2.364GlnLeu: 2.364 ± 0.075
0.531GlnMet: 0.531 ± 0.03
0.937GlnAsn: 0.937 ± 0.043
0.721GlnPro: 0.721 ± 0.039
0.637GlnGln: 0.637 ± 0.036
1.204GlnArg: 1.204 ± 0.043
1.133GlnSer: 1.133 ± 0.046
0.952GlnThr: 0.952 ± 0.034
1.386GlnVal: 1.386 ± 0.056
0.243GlnTrp: 0.243 ± 0.022
0.794GlnTyr: 0.794 ± 0.041
0.0GlnXaa: 0.0 ± 0.0
Arg
2.978ArgAla: 2.978 ± 0.068
0.355ArgCys: 0.355 ± 0.022
2.204ArgAsp: 2.204 ± 0.073
4.272ArgGlu: 4.272 ± 0.095
2.46ArgPhe: 2.46 ± 0.063
3.059ArgGly: 3.059 ± 0.075
0.737ArgHis: 0.737 ± 0.037
4.238ArgIle: 4.238 ± 0.089
4.352ArgLys: 4.352 ± 0.087
4.625ArgLeu: 4.625 ± 0.1
1.227ArgMet: 1.227 ± 0.044
2.14ArgAsn: 2.14 ± 0.064
1.486ArgPro: 1.486 ± 0.05
1.06ArgGln: 1.06 ± 0.042
2.558ArgArg: 2.558 ± 0.079
2.528ArgSer: 2.528 ± 0.063
2.235ArgThr: 2.235 ± 0.064
3.537ArgVal: 3.537 ± 0.078
0.481ArgTrp: 0.481 ± 0.031
2.143ArgTyr: 2.143 ± 0.072
0.0ArgXaa: 0.0 ± 0.0
Ser
3.692SerAla: 3.692 ± 0.088
0.458SerCys: 0.458 ± 0.033
2.918SerAsp: 2.918 ± 0.083
4.514SerGlu: 4.514 ± 0.092
3.576SerPhe: 3.576 ± 0.079
5.24SerGly: 5.24 ± 0.098
1.018SerHis: 1.018 ± 0.04
4.934SerIle: 4.934 ± 0.104
4.029SerLys: 4.029 ± 0.078
6.567SerLeu: 6.567 ± 0.116
1.506SerMet: 1.506 ± 0.054
2.254SerAsn: 2.254 ± 0.071
2.617SerPro: 2.617 ± 0.073
1.329SerGln: 1.329 ± 0.042
3.031SerArg: 3.031 ± 0.078
3.958SerSer: 3.958 ± 0.104
3.067SerThr: 3.067 ± 0.085
4.443SerVal: 4.443 ± 0.096
0.699SerTrp: 0.699 ± 0.038
2.247SerTyr: 2.247 ± 0.055
0.0SerXaa: 0.0 ± 0.0
Thr
3.376ThrAla: 3.376 ± 0.083
0.319ThrCys: 0.319 ± 0.023
2.335ThrAsp: 2.335 ± 0.067
3.216ThrGlu: 3.216 ± 0.081
2.52ThrPhe: 2.52 ± 0.066
4.434ThrGly: 4.434 ± 0.085
0.805ThrHis: 0.805 ± 0.038
3.975ThrIle: 3.975 ± 0.088
2.589ThrLys: 2.589 ± 0.067
5.417ThrLeu: 5.417 ± 0.123
1.095ThrMet: 1.095 ± 0.042
1.682ThrAsn: 1.682 ± 0.054
2.406ThrPro: 2.406 ± 0.071
0.962ThrGln: 0.962 ± 0.04
2.214ThrArg: 2.214 ± 0.05
2.936ThrSer: 2.936 ± 0.074
2.588ThrThr: 2.588 ± 0.077
3.925ThrVal: 3.925 ± 0.087
0.438ThrTrp: 0.438 ± 0.029
1.748ThrTyr: 1.748 ± 0.06
0.0ThrXaa: 0.0 ± 0.0
Val
4.854ValAla: 4.854 ± 0.11
0.506ValCys: 0.506 ± 0.029
3.881ValAsp: 3.881 ± 0.088
5.636ValGlu: 5.636 ± 0.109
4.005ValPhe: 4.005 ± 0.084
4.735ValGly: 4.735 ± 0.105
1.124ValHis: 1.124 ± 0.044
6.235ValIle: 6.235 ± 0.106
5.064ValLys: 5.064 ± 0.115
7.339ValLeu: 7.339 ± 0.102
1.801ValMet: 1.801 ± 0.057
2.875ValAsn: 2.875 ± 0.077
2.849ValPro: 2.849 ± 0.068
1.481ValGln: 1.481 ± 0.047
3.34ValArg: 3.34 ± 0.075
4.972ValSer: 4.972 ± 0.099
3.532ValThr: 3.532 ± 0.087
5.845ValVal: 5.845 ± 0.118
0.597ValTrp: 0.597 ± 0.029
2.426ValTyr: 2.426 ± 0.066
0.0ValXaa: 0.0 ± 0.0
Trp
0.526TrpAla: 0.526 ± 0.03
0.074TrpCys: 0.074 ± 0.01
0.529TrpAsp: 0.529 ± 0.027
0.663TrpGlu: 0.663 ± 0.036
0.476TrpPhe: 0.476 ± 0.028
0.676TrpGly: 0.676 ± 0.036
0.197TrpHis: 0.197 ± 0.021
0.721TrpIle: 0.721 ± 0.033
0.916TrpLys: 0.916 ± 0.038
1.169TrpLeu: 1.169 ± 0.046
0.299TrpMet: 0.299 ± 0.023
0.503TrpAsn: 0.503 ± 0.033
0.339TrpPro: 0.339 ± 0.025
0.298TrpGln: 0.298 ± 0.024
0.458TrpArg: 0.458 ± 0.025
0.569TrpSer: 0.569 ± 0.034
0.463TrpThr: 0.463 ± 0.032
0.646TrpVal: 0.646 ± 0.034
0.167TrpTrp: 0.167 ± 0.017
0.372TrpTyr: 0.372 ± 0.024
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.257TyrAla: 2.257 ± 0.06
0.316TyrCys: 0.316 ± 0.023
2.042TyrAsp: 2.042 ± 0.055
2.649TyrGlu: 2.649 ± 0.077
2.161TyrPhe: 2.161 ± 0.059
2.746TyrGly: 2.746 ± 0.073
0.704TyrHis: 0.704 ± 0.036
2.75TyrIle: 2.75 ± 0.067
1.931TyrLys: 1.931 ± 0.053
4.006TyrLeu: 4.006 ± 0.084
0.774TyrMet: 0.774 ± 0.033
1.478TyrAsn: 1.478 ± 0.056
1.515TyrPro: 1.515 ± 0.055
0.873TyrGln: 0.873 ± 0.035
2.087TyrArg: 2.087 ± 0.067
2.561TyrSer: 2.561 ± 0.064
1.85TyrThr: 1.85 ± 0.065
2.52TyrVal: 2.52 ± 0.065
0.43TyrTrp: 0.43 ± 0.03
1.639TyrTyr: 1.639 ± 0.059
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1897 proteins (604815 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski