Amino acid dipepetide frequency for Oribacterium sinus F0268

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.286AlaAla: 5.286 ± 0.127
0.995AlaCys: 0.995 ± 0.04
3.196AlaAsp: 3.196 ± 0.072
5.761AlaGlu: 5.761 ± 0.098
3.74AlaPhe: 3.74 ± 0.083
5.453AlaGly: 5.453 ± 0.095
1.055AlaHis: 1.055 ± 0.042
4.944AlaIle: 4.944 ± 0.077
5.974AlaLys: 5.974 ± 0.093
8.312AlaLeu: 8.312 ± 0.117
2.377AlaMet: 2.377 ± 0.06
2.518AlaAsn: 2.518 ± 0.062
1.93AlaPro: 1.93 ± 0.053
1.862AlaGln: 1.862 ± 0.062
2.583AlaArg: 2.583 ± 0.055
4.105AlaSer: 4.105 ± 0.081
3.002AlaThr: 3.002 ± 0.081
5.233AlaVal: 5.233 ± 0.093
0.625AlaTrp: 0.625 ± 0.032
2.755AlaTyr: 2.755 ± 0.057
0.0AlaXaa: 0.0 ± 0.0
Cys
0.747CysAla: 0.747 ± 0.029
0.195CysCys: 0.195 ± 0.016
0.593CysAsp: 0.593 ± 0.027
0.67CysGlu: 0.67 ± 0.03
0.752CysPhe: 0.752 ± 0.034
1.062CysGly: 1.062 ± 0.042
0.302CysHis: 0.302 ± 0.022
1.013CysIle: 1.013 ± 0.037
0.934CysLys: 0.934 ± 0.033
1.209CysLeu: 1.209 ± 0.035
0.334CysMet: 0.334 ± 0.023
0.418CysAsn: 0.418 ± 0.024
0.59CysPro: 0.59 ± 0.036
0.314CysGln: 0.314 ± 0.019
0.446CysArg: 0.446 ± 0.024
0.799CysSer: 0.799 ± 0.03
0.664CysThr: 0.664 ± 0.03
0.683CysVal: 0.683 ± 0.038
0.066CysTrp: 0.066 ± 0.008
0.498CysTyr: 0.498 ± 0.028
0.0CysXaa: 0.0 ± 0.0
Asp
3.646AspAla: 3.646 ± 0.066
0.749AspCys: 0.749 ± 0.033
1.974AspAsp: 1.974 ± 0.062
3.565AspGlu: 3.565 ± 0.076
3.151AspPhe: 3.151 ± 0.069
3.767AspGly: 3.767 ± 0.074
0.77AspHis: 0.77 ± 0.035
3.701AspIle: 3.701 ± 0.071
3.449AspLys: 3.449 ± 0.07
5.049AspLeu: 5.049 ± 0.079
1.416AspMet: 1.416 ± 0.046
1.83AspAsn: 1.83 ± 0.056
1.682AspPro: 1.682 ± 0.048
1.334AspGln: 1.334 ± 0.043
2.23AspArg: 2.23 ± 0.058
3.447AspSer: 3.447 ± 0.074
2.399AspThr: 2.399 ± 0.057
3.022AspVal: 3.022 ± 0.072
0.516AspTrp: 0.516 ± 0.026
2.447AspTyr: 2.447 ± 0.062
0.0AspXaa: 0.0 ± 0.0
Glu
6.479GluAla: 6.479 ± 0.102
0.759GluCys: 0.759 ± 0.032
4.899GluAsp: 4.899 ± 0.092
11.206GluGlu: 11.206 ± 0.191
2.436GluPhe: 2.436 ± 0.059
5.871GluGly: 5.871 ± 0.101
1.331GluHis: 1.331 ± 0.037
5.544GluIle: 5.544 ± 0.079
9.421GluLys: 9.421 ± 0.143
6.847GluLeu: 6.847 ± 0.096
2.285GluMet: 2.285 ± 0.055
4.684GluAsn: 4.684 ± 0.076
1.628GluPro: 1.628 ± 0.052
3.028GluGln: 3.028 ± 0.068
3.874GluArg: 3.874 ± 0.081
4.386GluSer: 4.386 ± 0.083
3.008GluThr: 3.008 ± 0.071
4.559GluVal: 4.559 ± 0.087
0.661GluTrp: 0.661 ± 0.029
2.754GluTyr: 2.754 ± 0.066
0.0GluXaa: 0.0 ± 0.0
Phe
3.188PheAla: 3.188 ± 0.077
0.713PheCys: 0.713 ± 0.032
2.125PheAsp: 2.125 ± 0.055
2.139PheGlu: 2.139 ± 0.05
3.016PhePhe: 3.016 ± 0.093
3.021PheGly: 3.021 ± 0.069
0.998PheHis: 0.998 ± 0.034
2.766PheIle: 2.766 ± 0.076
2.152PheLys: 2.152 ± 0.049
5.818PheLeu: 5.818 ± 0.126
1.307PheMet: 1.307 ± 0.045
1.434PheAsn: 1.434 ± 0.048
2.072PhePro: 2.072 ± 0.055
2.044PheGln: 2.044 ± 0.06
2.081PheArg: 2.081 ± 0.058
4.634PheSer: 4.634 ± 0.097
2.433PheThr: 2.433 ± 0.052
2.835PheVal: 2.835 ± 0.066
0.431PheTrp: 0.431 ± 0.023
2.08PheTyr: 2.08 ± 0.055
0.0PheXaa: 0.0 ± 0.0
Gly
4.873GlyAla: 4.873 ± 0.092
0.828GlyCys: 0.828 ± 0.037
3.394GlyAsp: 3.394 ± 0.064
6.27GlyGlu: 6.27 ± 0.105
3.303GlyPhe: 3.303 ± 0.059
5.006GlyGly: 5.006 ± 0.114
1.107GlyHis: 1.107 ± 0.034
6.114GlyIle: 6.114 ± 0.104
6.901GlyLys: 6.901 ± 0.098
6.025GlyLeu: 6.025 ± 0.113
2.239GlyMet: 2.239 ± 0.053
3.321GlyAsn: 3.321 ± 0.076
1.245GlyPro: 1.245 ± 0.039
2.247GlyGln: 2.247 ± 0.069
3.205GlyArg: 3.205 ± 0.064
4.296GlySer: 4.296 ± 0.082
3.548GlyThr: 3.548 ± 0.079
4.558GlyVal: 4.558 ± 0.073
0.635GlyTrp: 0.635 ± 0.037
2.782GlyTyr: 2.782 ± 0.061
0.0GlyXaa: 0.0 ± 0.0
His
1.097HisAla: 1.097 ± 0.04
0.347HisCys: 0.347 ± 0.024
0.758HisAsp: 0.758 ± 0.034
1.016HisGlu: 1.016 ± 0.049
1.205HisPhe: 1.205 ± 0.044
1.291HisGly: 1.291 ± 0.042
0.436HisHis: 0.436 ± 0.036
1.226HisIle: 1.226 ± 0.049
1.085HisLys: 1.085 ± 0.039
1.63HisLeu: 1.63 ± 0.044
0.355HisMet: 0.355 ± 0.019
0.62HisAsn: 0.62 ± 0.029
0.835HisPro: 0.835 ± 0.032
0.536HisGln: 0.536 ± 0.027
0.827HisArg: 0.827 ± 0.036
1.236HisSer: 1.236 ± 0.049
0.725HisThr: 0.725 ± 0.037
0.965HisVal: 0.965 ± 0.037
0.172HisTrp: 0.172 ± 0.016
0.77HisTyr: 0.77 ± 0.034
0.0HisXaa: 0.0 ± 0.0
Ile
5.117IleAla: 5.117 ± 0.096
0.992IleCys: 0.992 ± 0.04
3.255IleAsp: 3.255 ± 0.072
4.296IleGlu: 4.296 ± 0.073
3.489IlePhe: 3.489 ± 0.076
4.612IleGly: 4.612 ± 0.094
1.35IleHis: 1.35 ± 0.042
4.06IleIle: 4.06 ± 0.089
3.633IleLys: 3.633 ± 0.074
8.076IleLeu: 8.076 ± 0.111
1.687IleMet: 1.687 ± 0.05
2.333IleAsn: 2.333 ± 0.062
3.373IlePro: 3.373 ± 0.073
2.441IleGln: 2.441 ± 0.057
3.229IleArg: 3.229 ± 0.074
5.255IleSer: 5.255 ± 0.084
3.229IleThr: 3.229 ± 0.063
4.046IleVal: 4.046 ± 0.079
0.453IleTrp: 0.453 ± 0.022
2.44IleTyr: 2.44 ± 0.061
0.001IleXaa: 0.001 ± 0.001
Lys
6.189LysAla: 6.189 ± 0.109
0.512LysCys: 0.512 ± 0.025
4.792LysAsp: 4.792 ± 0.091
9.263LysGlu: 9.263 ± 0.14
1.879LysPhe: 1.879 ± 0.047
5.25LysGly: 5.25 ± 0.073
1.093LysHis: 1.093 ± 0.039
4.944LysIle: 4.944 ± 0.076
7.564LysLys: 7.564 ± 0.105
6.07LysLeu: 6.07 ± 0.083
2.334LysMet: 2.334 ± 0.053
4.027LysAsn: 4.027 ± 0.073
1.997LysPro: 1.997 ± 0.057
2.592LysGln: 2.592 ± 0.058
3.537LysArg: 3.537 ± 0.074
4.09LysSer: 4.09 ± 0.074
3.758LysThr: 3.758 ± 0.079
4.257LysVal: 4.257 ± 0.083
0.604LysTrp: 0.604 ± 0.029
2.48LysTyr: 2.48 ± 0.062
0.001LysXaa: 0.001 ± 0.001
Leu
6.996LeuAla: 6.996 ± 0.105
1.635LeuCys: 1.635 ± 0.044
4.648LeuAsp: 4.648 ± 0.084
7.863LeuGlu: 7.863 ± 0.12
5.33LeuPhe: 5.33 ± 0.116
6.716LeuGly: 6.716 ± 0.121
1.824LeuHis: 1.824 ± 0.052
5.48LeuIle: 5.48 ± 0.101
6.589LeuLys: 6.589 ± 0.098
11.16LeuLeu: 11.16 ± 0.189
2.61LeuMet: 2.61 ± 0.062
3.346LeuAsn: 3.346 ± 0.065
4.263LeuPro: 4.263 ± 0.081
3.884LeuGln: 3.884 ± 0.079
4.251LeuArg: 4.251 ± 0.09
9.472LeuSer: 9.472 ± 0.134
4.135LeuThr: 4.135 ± 0.075
5.265LeuVal: 5.265 ± 0.088
0.858LeuTrp: 0.858 ± 0.036
3.9LeuTyr: 3.9 ± 0.069
0.001LeuXaa: 0.001 ± 0.001
Met
2.334MetAla: 2.334 ± 0.057
0.216MetCys: 0.216 ± 0.019
1.756MetAsp: 1.756 ± 0.048
2.873MetGlu: 2.873 ± 0.068
0.76MetPhe: 0.76 ± 0.033
2.168MetGly: 2.168 ± 0.059
0.46MetHis: 0.46 ± 0.027
1.712MetIle: 1.712 ± 0.05
2.372MetLys: 2.372 ± 0.053
2.611MetLeu: 2.611 ± 0.062
0.88MetMet: 0.88 ± 0.032
1.357MetAsn: 1.357 ± 0.049
1.02MetPro: 1.02 ± 0.031
1.231MetGln: 1.231 ± 0.041
1.192MetArg: 1.192 ± 0.04
1.501MetSer: 1.501 ± 0.04
1.218MetThr: 1.218 ± 0.042
1.75MetVal: 1.75 ± 0.054
0.176MetTrp: 0.176 ± 0.017
0.639MetTyr: 0.639 ± 0.027
0.0MetXaa: 0.0 ± 0.0
Asn
3.278AsnAla: 3.278 ± 0.062
0.511AsnCys: 0.511 ± 0.026
1.711AsnAsp: 1.711 ± 0.053
2.409AsnGlu: 2.409 ± 0.067
1.852AsnPhe: 1.852 ± 0.048
3.215AsnGly: 3.215 ± 0.084
0.722AsnHis: 0.722 ± 0.03
2.923AsnIle: 2.923 ± 0.069
2.673AsnLys: 2.673 ± 0.064
3.966AsnLeu: 3.966 ± 0.079
1.173AsnMet: 1.173 ± 0.04
1.61AsnAsn: 1.61 ± 0.061
2.237AsnPro: 2.237 ± 0.061
1.547AsnGln: 1.547 ± 0.042
1.911AsnArg: 1.911 ± 0.048
2.469AsnSer: 2.469 ± 0.072
2.266AsnThr: 2.266 ± 0.057
2.498AsnVal: 2.498 ± 0.055
0.399AsnTrp: 0.399 ± 0.023
1.591AsnTyr: 1.591 ± 0.047
0.0AsnXaa: 0.0 ± 0.0
Pro
2.138ProAla: 2.138 ± 0.057
0.369ProCys: 0.369 ± 0.024
1.783ProAsp: 1.783 ± 0.046
3.413ProGlu: 3.413 ± 0.081
1.77ProPhe: 1.77 ± 0.047
2.333ProGly: 2.333 ± 0.059
0.575ProHis: 0.575 ± 0.027
2.238ProIle: 2.238 ± 0.059
2.687ProLys: 2.687 ± 0.072
3.352ProLeu: 3.352 ± 0.068
0.931ProMet: 0.931 ± 0.035
1.345ProAsn: 1.345 ± 0.043
0.682ProPro: 0.682 ± 0.037
0.86ProGln: 0.86 ± 0.035
1.048ProArg: 1.048 ± 0.042
2.12ProSer: 2.12 ± 0.052
1.404ProThr: 1.404 ± 0.044
2.521ProVal: 2.521 ± 0.066
0.293ProTrp: 0.293 ± 0.023
1.6ProTyr: 1.6 ± 0.05
0.0ProXaa: 0.0 ± 0.0
Gln
2.369GlnAla: 2.369 ± 0.057
0.364GlnCys: 0.364 ± 0.021
1.864GlnAsp: 1.864 ± 0.053
3.725GlnGlu: 3.725 ± 0.082
1.244GlnPhe: 1.244 ± 0.037
2.647GlnGly: 2.647 ± 0.064
0.521GlnHis: 0.521 ± 0.029
2.204GlnIle: 2.204 ± 0.057
3.193GlnLys: 3.193 ± 0.064
2.418GlnLeu: 2.418 ± 0.061
0.907GlnMet: 0.907 ± 0.033
1.801GlnAsn: 1.801 ± 0.049
0.718GlnPro: 0.718 ± 0.037
1.013GlnGln: 1.013 ± 0.036
1.564GlnArg: 1.564 ± 0.045
2.17GlnSer: 2.17 ± 0.057
1.186GlnThr: 1.186 ± 0.044
1.885GlnVal: 1.885 ± 0.049
0.365GlnTrp: 0.365 ± 0.024
1.469GlnTyr: 1.469 ± 0.045
0.001GlnXaa: 0.001 ± 0.001
Arg
2.647ArgAla: 2.647 ± 0.058
0.453ArgCys: 0.453 ± 0.026
2.104ArgAsp: 2.104 ± 0.058
4.146ArgGlu: 4.146 ± 0.089
2.042ArgPhe: 2.042 ± 0.056
2.814ArgGly: 2.814 ± 0.061
0.622ArgHis: 0.622 ± 0.03
3.447ArgIle: 3.447 ± 0.065
3.966ArgLys: 3.966 ± 0.07
3.936ArgLeu: 3.936 ± 0.072
1.331ArgMet: 1.331 ± 0.044
1.956ArgAsn: 1.956 ± 0.053
1.196ArgPro: 1.196 ± 0.041
1.419ArgGln: 1.419 ± 0.05
2.203ArgArg: 2.203 ± 0.066
2.264ArgSer: 2.264 ± 0.059
1.714ArgThr: 1.714 ± 0.048
2.498ArgVal: 2.498 ± 0.051
0.315ArgTrp: 0.315 ± 0.021
1.86ArgTyr: 1.86 ± 0.054
0.0ArgXaa: 0.0 ± 0.0
Ser
4.318SerAla: 4.318 ± 0.081
0.791SerCys: 0.791 ± 0.032
2.892SerAsp: 2.892 ± 0.063
4.333SerGlu: 4.333 ± 0.084
4.1SerPhe: 4.1 ± 0.08
5.274SerGly: 5.274 ± 0.1
1.205SerHis: 1.205 ± 0.045
4.912SerIle: 4.912 ± 0.093
4.085SerLys: 4.085 ± 0.079
7.962SerLeu: 7.962 ± 0.139
1.956SerMet: 1.956 ± 0.046
2.366SerAsn: 2.366 ± 0.074
2.275SerPro: 2.275 ± 0.056
2.039SerGln: 2.039 ± 0.046
2.604SerArg: 2.604 ± 0.068
4.669SerSer: 4.669 ± 0.11
3.12SerThr: 3.12 ± 0.073
4.203SerVal: 4.203 ± 0.083
0.534SerTrp: 0.534 ± 0.024
3.07SerTyr: 3.07 ± 0.062
0.0SerXaa: 0.0 ± 0.0
Thr
3.754ThrAla: 3.754 ± 0.087
0.404ThrCys: 0.404 ± 0.026
2.435ThrAsp: 2.435 ± 0.059
4.101ThrGlu: 4.101 ± 0.072
1.592ThrPhe: 1.592 ± 0.048
4.001ThrGly: 4.001 ± 0.076
0.697ThrHis: 0.697 ± 0.03
3.128ThrIle: 3.128 ± 0.07
3.155ThrLys: 3.155 ± 0.069
4.419ThrLeu: 4.419 ± 0.078
1.217ThrMet: 1.217 ± 0.04
1.528ThrAsn: 1.528 ± 0.052
1.716ThrPro: 1.716 ± 0.056
1.19ThrGln: 1.19 ± 0.045
1.52ThrArg: 1.52 ± 0.046
2.237ThrSer: 2.237 ± 0.052
2.135ThrThr: 2.135 ± 0.064
3.784ThrVal: 3.784 ± 0.073
0.399ThrTrp: 0.399 ± 0.024
1.33ThrTyr: 1.33 ± 0.041
0.0ThrXaa: 0.0 ± 0.0
Val
4.209ValAla: 4.209 ± 0.083
0.767ValCys: 0.767 ± 0.034
3.422ValAsp: 3.422 ± 0.068
5.363ValGlu: 5.363 ± 0.08
2.971ValPhe: 2.971 ± 0.068
3.986ValGly: 3.986 ± 0.081
1.022ValHis: 1.022 ± 0.038
3.942ValIle: 3.942 ± 0.078
4.342ValLys: 4.342 ± 0.077
6.698ValLeu: 6.698 ± 0.094
1.614ValMet: 1.614 ± 0.049
2.474ValAsn: 2.474 ± 0.056
2.305ValPro: 2.305 ± 0.058
2.009ValGln: 2.009 ± 0.048
2.332ValArg: 2.332 ± 0.056
4.418ValSer: 4.418 ± 0.081
2.561ValThr: 2.561 ± 0.07
4.076ValVal: 4.076 ± 0.093
0.457ValTrp: 0.457 ± 0.026
2.211ValTyr: 2.211 ± 0.059
0.001ValXaa: 0.001 ± 0.001
Trp
0.45TrpAla: 0.45 ± 0.024
0.09TrpCys: 0.09 ± 0.013
0.491TrpAsp: 0.491 ± 0.027
0.727TrpGlu: 0.727 ± 0.034
0.351TrpPhe: 0.351 ± 0.022
0.514TrpGly: 0.514 ± 0.027
0.143TrpHis: 0.143 ± 0.013
0.689TrpIle: 0.689 ± 0.03
0.845TrpLys: 0.845 ± 0.033
0.684TrpLeu: 0.684 ± 0.03
0.287TrpMet: 0.287 ± 0.018
0.449TrpAsn: 0.449 ± 0.027
0.14TrpPro: 0.14 ± 0.015
0.423TrpGln: 0.423 ± 0.027
0.347TrpArg: 0.347 ± 0.025
0.455TrpSer: 0.455 ± 0.027
0.359TrpThr: 0.359 ± 0.024
0.44TrpVal: 0.44 ± 0.024
0.104TrpTrp: 0.104 ± 0.011
0.383TrpTyr: 0.383 ± 0.026
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.589TyrAla: 2.589 ± 0.061
0.601TyrCys: 0.601 ± 0.028
2.055TyrAsp: 2.055 ± 0.059
2.521TyrGlu: 2.521 ± 0.062
2.302TyrPhe: 2.302 ± 0.057
2.895TyrGly: 2.895 ± 0.064
0.898TyrHis: 0.898 ± 0.038
2.381TyrIle: 2.381 ± 0.053
2.212TyrLys: 2.212 ± 0.063
3.983TyrLeu: 3.983 ± 0.08
0.985TyrMet: 0.985 ± 0.036
1.542TyrAsn: 1.542 ± 0.045
1.555TyrPro: 1.555 ± 0.046
1.615TyrGln: 1.615 ± 0.048
1.948TyrArg: 1.948 ± 0.053
2.689TyrSer: 2.689 ± 0.068
1.875TyrThr: 1.875 ± 0.056
2.1TyrVal: 2.1 ± 0.054
0.31TyrTrp: 0.31 ± 0.022
1.78TyrTyr: 1.78 ± 0.066
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.001XaaCys: 0.001 ± 0.001
0.0XaaAsp: 0.0 ± 0.0
0.001XaaGlu: 0.001 ± 0.001
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.001XaaLys: 0.001 ± 0.001
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.001XaaPro: 0.001 ± 0.001
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.001XaaVal: 0.001 ± 0.001
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2638 proteins (777533 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski