Amino acid dipepetide frequency for Simiduia agarivorans (strain DSM 21679 / JCM 13881 / BCRC 17597 / SA1)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.749AlaAla: 11.749 ± 0.144
1.273AlaCys: 1.273 ± 0.036
6.585AlaAsp: 6.585 ± 0.084
6.774AlaGlu: 6.774 ± 0.084
3.802AlaPhe: 3.802 ± 0.051
9.184AlaGly: 9.184 ± 0.117
2.26AlaHis: 2.26 ± 0.04
5.721AlaIle: 5.721 ± 0.075
3.912AlaLys: 3.912 ± 0.071
12.882AlaLeu: 12.882 ± 0.129
2.892AlaMet: 2.892 ± 0.053
3.885AlaAsn: 3.885 ± 0.058
4.51AlaPro: 4.51 ± 0.073
4.943AlaGln: 4.943 ± 0.066
6.489AlaArg: 6.489 ± 0.077
6.272AlaSer: 6.272 ± 0.088
5.045AlaThr: 5.045 ± 0.069
7.022AlaVal: 7.022 ± 0.092
1.347AlaTrp: 1.347 ± 0.035
2.554AlaTyr: 2.554 ± 0.045
0.0AlaXaa: 0.0 ± 0.0
Cys
1.042CysAla: 1.042 ± 0.031
0.174CysCys: 0.174 ± 0.012
0.592CysAsp: 0.592 ± 0.018
0.608CysGlu: 0.608 ± 0.023
0.422CysPhe: 0.422 ± 0.019
0.941CysGly: 0.941 ± 0.036
0.297CysHis: 0.297 ± 0.02
0.499CysIle: 0.499 ± 0.024
0.343CysLys: 0.343 ± 0.017
1.173CysLeu: 1.173 ± 0.033
0.188CysMet: 0.188 ± 0.013
0.349CysAsn: 0.349 ± 0.019
0.488CysPro: 0.488 ± 0.018
0.403CysGln: 0.403 ± 0.02
0.536CysArg: 0.536 ± 0.02
0.626CysSer: 0.626 ± 0.02
0.497CysThr: 0.497 ± 0.022
0.713CysVal: 0.713 ± 0.024
0.159CysTrp: 0.159 ± 0.011
0.297CysTyr: 0.297 ± 0.018
0.0CysXaa: 0.0 ± 0.0
Asp
6.359AspAla: 6.359 ± 0.069
0.6AspCys: 0.6 ± 0.022
3.205AspAsp: 3.205 ± 0.066
3.381AspGlu: 3.381 ± 0.057
2.337AspPhe: 2.337 ± 0.043
4.447AspGly: 4.447 ± 0.08
1.196AspHis: 1.196 ± 0.032
3.22AspIle: 3.22 ± 0.052
2.416AspLys: 2.416 ± 0.052
5.661AspLeu: 5.661 ± 0.066
1.378AspMet: 1.378 ± 0.036
2.139AspAsn: 2.139 ± 0.045
2.618AspPro: 2.618 ± 0.045
2.753AspGln: 2.753 ± 0.051
2.977AspArg: 2.977 ± 0.051
3.166AspSer: 3.166 ± 0.055
3.013AspThr: 3.013 ± 0.052
3.883AspVal: 3.883 ± 0.053
1.012AspTrp: 1.012 ± 0.03
2.076AspTyr: 2.076 ± 0.047
0.0AspXaa: 0.0 ± 0.0
Glu
6.075GluAla: 6.075 ± 0.094
0.449GluCys: 0.449 ± 0.02
2.717GluAsp: 2.717 ± 0.048
2.857GluGlu: 2.857 ± 0.058
2.026GluPhe: 2.026 ± 0.038
3.717GluGly: 3.717 ± 0.057
1.426GluHis: 1.426 ± 0.039
3.003GluIle: 3.003 ± 0.055
2.688GluLys: 2.688 ± 0.05
6.098GluLeu: 6.098 ± 0.074
1.365GluMet: 1.365 ± 0.028
1.965GluAsn: 1.965 ± 0.042
2.537GluPro: 2.537 ± 0.048
3.712GluGln: 3.712 ± 0.058
3.812GluArg: 3.812 ± 0.062
3.313GluSer: 3.313 ± 0.048
2.914GluThr: 2.914 ± 0.053
3.644GluVal: 3.644 ± 0.06
0.789GluTrp: 0.789 ± 0.024
1.451GluTyr: 1.451 ± 0.034
0.0GluXaa: 0.0 ± 0.0
Phe
3.763PheAla: 3.763 ± 0.057
0.457PheCys: 0.457 ± 0.021
2.7PheAsp: 2.7 ± 0.049
2.297PheGlu: 2.297 ± 0.041
1.541PhePhe: 1.541 ± 0.042
3.203PheGly: 3.203 ± 0.053
0.825PheHis: 0.825 ± 0.027
1.923PheIle: 1.923 ± 0.036
1.386PheLys: 1.386 ± 0.031
3.309PheLeu: 3.309 ± 0.059
0.895PheMet: 0.895 ± 0.025
1.71PheAsn: 1.71 ± 0.037
1.392PhePro: 1.392 ± 0.036
1.21PheGln: 1.21 ± 0.03
1.81PheArg: 1.81 ± 0.036
2.889PheSer: 2.889 ± 0.053
2.299PheThr: 2.299 ± 0.046
2.688PheVal: 2.688 ± 0.044
0.581PheTrp: 0.581 ± 0.024
1.27PheTyr: 1.27 ± 0.032
0.0PheXaa: 0.0 ± 0.0
Gly
7.04GlyAla: 7.04 ± 0.109
0.883GlyCys: 0.883 ± 0.026
4.365GlyAsp: 4.365 ± 0.072
4.527GlyGlu: 4.527 ± 0.067
3.531GlyPhe: 3.531 ± 0.047
5.712GlyGly: 5.712 ± 0.109
1.777GlyHis: 1.777 ± 0.038
4.205GlyIle: 4.205 ± 0.065
3.491GlyLys: 3.491 ± 0.059
8.171GlyLeu: 8.171 ± 0.082
2.066GlyMet: 2.066 ± 0.04
2.828GlyAsn: 2.828 ± 0.07
2.268GlyPro: 2.268 ± 0.044
3.269GlyGln: 3.269 ± 0.052
3.985GlyArg: 3.985 ± 0.056
4.439GlySer: 4.439 ± 0.091
3.698GlyThr: 3.698 ± 0.068
5.844GlyVal: 5.844 ± 0.082
1.277GlyTrp: 1.277 ± 0.032
2.517GlyTyr: 2.517 ± 0.045
0.0GlyXaa: 0.0 ± 0.0
His
2.178HisAla: 2.178 ± 0.037
0.35HisCys: 0.35 ± 0.017
0.988HisAsp: 0.988 ± 0.03
1.002HisGlu: 1.002 ± 0.029
1.036HisPhe: 1.036 ± 0.029
1.567HisGly: 1.567 ± 0.037
0.687HisHis: 0.687 ± 0.027
1.161HisIle: 1.161 ± 0.032
0.905HisLys: 0.905 ± 0.03
2.426HisLeu: 2.426 ± 0.05
0.56HisMet: 0.56 ± 0.022
0.774HisAsn: 0.774 ± 0.027
1.367HisPro: 1.367 ± 0.035
1.236HisGln: 1.236 ± 0.032
1.371HisArg: 1.371 ± 0.037
1.253HisSer: 1.253 ± 0.035
1.185HisThr: 1.185 ± 0.035
1.174HisVal: 1.174 ± 0.03
0.568HisTrp: 0.568 ± 0.02
0.917HisTyr: 0.917 ± 0.027
0.0HisXaa: 0.0 ± 0.0
Ile
5.945IleAla: 5.945 ± 0.074
0.52IleCys: 0.52 ± 0.019
3.563IleAsp: 3.563 ± 0.061
3.376IleGlu: 3.376 ± 0.052
1.595IlePhe: 1.595 ± 0.037
3.904IleGly: 3.904 ± 0.06
1.097IleHis: 1.097 ± 0.032
2.288IleIle: 2.288 ± 0.048
2.204IleLys: 2.204 ± 0.046
3.895IleLeu: 3.895 ± 0.068
0.961IleMet: 0.961 ± 0.032
2.408IleAsn: 2.408 ± 0.054
2.189IlePro: 2.189 ± 0.039
1.817IleGln: 1.817 ± 0.037
2.777IleArg: 2.777 ± 0.04
3.195IleSer: 3.195 ± 0.058
3.063IleThr: 3.063 ± 0.052
3.281IleVal: 3.281 ± 0.056
0.56IleTrp: 0.56 ± 0.022
1.405IleTyr: 1.405 ± 0.031
0.0IleXaa: 0.0 ± 0.0
Lys
4.439LysAla: 4.439 ± 0.084
0.236LysCys: 0.236 ± 0.017
2.059LysAsp: 2.059 ± 0.045
1.846LysGlu: 1.846 ± 0.048
1.121LysPhe: 1.121 ± 0.032
2.713LysGly: 2.713 ± 0.057
0.929LysHis: 0.929 ± 0.03
2.002LysIle: 2.002 ± 0.04
1.758LysLys: 1.758 ± 0.047
3.996LysLeu: 3.996 ± 0.063
0.954LysMet: 0.954 ± 0.031
1.419LysAsn: 1.419 ± 0.038
2.244LysPro: 2.244 ± 0.046
1.911LysGln: 1.911 ± 0.041
2.315LysArg: 2.315 ± 0.046
2.245LysSer: 2.245 ± 0.044
2.223LysThr: 2.223 ± 0.051
2.818LysVal: 2.818 ± 0.052
0.36LysTrp: 0.36 ± 0.016
0.888LysTyr: 0.888 ± 0.029
0.0LysXaa: 0.0 ± 0.0
Leu
13.562LeuAla: 13.562 ± 0.141
1.249LeuCys: 1.249 ± 0.029
6.099LeuAsp: 6.099 ± 0.074
5.586LeuGlu: 5.586 ± 0.079
3.941LeuPhe: 3.941 ± 0.063
7.632LeuGly: 7.632 ± 0.098
2.211LeuHis: 2.211 ± 0.046
5.192LeuIle: 5.192 ± 0.07
4.208LeuLys: 4.208 ± 0.065
11.789LeuLeu: 11.789 ± 0.146
2.591LeuMet: 2.591 ± 0.048
3.986LeuAsn: 3.986 ± 0.054
5.462LeuPro: 5.462 ± 0.078
4.318LeuGln: 4.318 ± 0.07
5.697LeuArg: 5.697 ± 0.079
6.935LeuSer: 6.935 ± 0.095
5.965LeuThr: 5.965 ± 0.081
7.976LeuVal: 7.976 ± 0.098
1.51LeuTrp: 1.51 ± 0.043
2.567LeuTyr: 2.567 ± 0.048
0.0LeuXaa: 0.0 ± 0.0
Met
2.978MetAla: 2.978 ± 0.05
0.191MetCys: 0.191 ± 0.014
1.278MetAsp: 1.278 ± 0.032
1.191MetGlu: 1.191 ± 0.031
0.662MetPhe: 0.662 ± 0.025
1.795MetGly: 1.795 ± 0.043
0.543MetHis: 0.543 ± 0.023
1.128MetIle: 1.128 ± 0.032
1.066MetLys: 1.066 ± 0.028
2.416MetLeu: 2.416 ± 0.054
0.538MetMet: 0.538 ± 0.021
0.951MetAsn: 0.951 ± 0.028
1.307MetPro: 1.307 ± 0.033
1.083MetGln: 1.083 ± 0.028
1.297MetArg: 1.297 ± 0.034
1.571MetSer: 1.571 ± 0.037
1.445MetThr: 1.445 ± 0.034
1.649MetVal: 1.649 ± 0.037
0.202MetTrp: 0.202 ± 0.011
0.413MetTyr: 0.413 ± 0.017
0.0MetXaa: 0.0 ± 0.0
Asn
3.814AsnAla: 3.814 ± 0.068
0.358AsnCys: 0.358 ± 0.019
1.928AsnAsp: 1.928 ± 0.041
1.725AsnGlu: 1.725 ± 0.034
1.311AsnPhe: 1.311 ± 0.036
3.06AsnGly: 3.06 ± 0.076
0.822AsnHis: 0.822 ± 0.026
1.79AsnIle: 1.79 ± 0.037
1.37AsnLys: 1.37 ± 0.042
3.747AsnLeu: 3.747 ± 0.061
0.766AsnMet: 0.766 ± 0.024
1.502AsnAsn: 1.502 ± 0.045
2.276AsnPro: 2.276 ± 0.038
1.78AsnGln: 1.78 ± 0.041
2.134AsnArg: 2.134 ± 0.039
1.938AsnSer: 1.938 ± 0.038
2.069AsnThr: 2.069 ± 0.045
2.18AsnVal: 2.18 ± 0.049
0.571AsnTrp: 0.571 ± 0.02
1.223AsnTyr: 1.223 ± 0.04
0.0AsnXaa: 0.0 ± 0.0
Pro
5.602ProAla: 5.602 ± 0.072
0.345ProCys: 0.345 ± 0.014
3.256ProAsp: 3.256 ± 0.055
3.512ProGlu: 3.512 ± 0.058
1.69ProPhe: 1.69 ± 0.039
3.592ProGly: 3.592 ± 0.06
0.989ProHis: 0.989 ± 0.031
1.91ProIle: 1.91 ± 0.037
1.535ProLys: 1.535 ± 0.044
4.904ProLeu: 4.904 ± 0.072
1.046ProMet: 1.046 ± 0.029
1.378ProAsn: 1.378 ± 0.033
1.534ProPro: 1.534 ± 0.04
1.707ProGln: 1.707 ± 0.038
1.896ProArg: 1.896 ± 0.039
2.367ProSer: 2.367 ± 0.047
2.386ProThr: 2.386 ± 0.083
4.046ProVal: 4.046 ± 0.052
0.657ProTrp: 0.657 ± 0.023
1.15ProTyr: 1.15 ± 0.033
0.0ProXaa: 0.0 ± 0.0
Gln
5.698GlnAla: 5.698 ± 0.085
0.449GlnCys: 0.449 ± 0.02
1.991GlnAsp: 1.991 ± 0.038
1.923GlnGlu: 1.923 ± 0.041
1.722GlnPhe: 1.722 ± 0.038
3.094GlnGly: 3.094 ± 0.053
1.151GlnHis: 1.151 ± 0.027
2.045GlnIle: 2.045 ± 0.04
1.561GlnLys: 1.561 ± 0.035
5.858GlnLeu: 5.858 ± 0.091
1.127GlnMet: 1.127 ± 0.027
1.236GlnAsn: 1.236 ± 0.033
2.219GlnPro: 2.219 ± 0.047
2.871GlnGln: 2.871 ± 0.061
2.788GlnArg: 2.788 ± 0.053
2.758GlnSer: 2.758 ± 0.051
2.236GlnThr: 2.236 ± 0.047
3.397GlnVal: 3.397 ± 0.061
0.987GlnTrp: 0.987 ± 0.031
1.155GlnTyr: 1.155 ± 0.036
0.001GlnXaa: 0.001 ± 0.001
Arg
5.776ArgAla: 5.776 ± 0.079
0.558ArgCys: 0.558 ± 0.023
3.14ArgAsp: 3.14 ± 0.055
3.553ArgGlu: 3.553 ± 0.054
2.607ArgPhe: 2.607 ± 0.044
3.403ArgGly: 3.403 ± 0.053
1.396ArgHis: 1.396 ± 0.039
3.106ArgIle: 3.106 ± 0.055
2.141ArgLys: 2.141 ± 0.042
6.54ArgLeu: 6.54 ± 0.087
1.435ArgMet: 1.435 ± 0.038
1.892ArgAsn: 1.892 ± 0.035
2.18ArgPro: 2.18 ± 0.043
2.812ArgGln: 2.812 ± 0.056
3.212ArgArg: 3.212 ± 0.063
2.79ArgSer: 2.79 ± 0.048
2.508ArgThr: 2.508 ± 0.04
3.839ArgVal: 3.839 ± 0.061
1.114ArgTrp: 1.114 ± 0.03
2.012ArgTyr: 2.012 ± 0.041
0.0ArgXaa: 0.0 ± 0.0
Ser
6.439SerAla: 6.439 ± 0.093
0.552SerCys: 0.552 ± 0.022
3.453SerAsp: 3.453 ± 0.049
3.32SerGlu: 3.32 ± 0.052
2.377SerPhe: 2.377 ± 0.05
5.132SerGly: 5.132 ± 0.093
1.436SerHis: 1.436 ± 0.033
2.744SerIle: 2.744 ± 0.05
1.836SerLys: 1.836 ± 0.04
6.877SerLeu: 6.877 ± 0.08
1.361SerMet: 1.361 ± 0.035
1.954SerAsn: 1.954 ± 0.045
2.692SerPro: 2.692 ± 0.053
2.669SerGln: 2.669 ± 0.056
3.258SerArg: 3.258 ± 0.055
3.304SerSer: 3.304 ± 0.075
2.794SerThr: 2.794 ± 0.054
4.349SerVal: 4.349 ± 0.079
0.877SerTrp: 0.877 ± 0.027
1.711SerTyr: 1.711 ± 0.038
0.0SerXaa: 0.0 ± 0.0
Thr
5.158ThrAla: 5.158 ± 0.074
0.42ThrCys: 0.42 ± 0.02
3.209ThrAsp: 3.209 ± 0.056
2.963ThrGlu: 2.963 ± 0.054
1.815ThrPhe: 1.815 ± 0.036
4.806ThrGly: 4.806 ± 0.076
1.194ThrHis: 1.194 ± 0.035
2.424ThrIle: 2.424 ± 0.05
1.485ThrLys: 1.485 ± 0.032
6.402ThrLeu: 6.402 ± 0.084
0.883ThrMet: 0.883 ± 0.024
1.726ThrAsn: 1.726 ± 0.038
3.279ThrPro: 3.279 ± 0.088
2.372ThrGln: 2.372 ± 0.041
2.913ThrArg: 2.913 ± 0.052
2.748ThrSer: 2.748 ± 0.057
2.575ThrThr: 2.575 ± 0.055
3.621ThrVal: 3.621 ± 0.061
0.625ThrTrp: 0.625 ± 0.021
1.381ThrTyr: 1.381 ± 0.038
0.0ThrXaa: 0.0 ± 0.0
Val
7.745ValAla: 7.745 ± 0.086
0.754ValCys: 0.754 ± 0.027
4.413ValAsp: 4.413 ± 0.064
4.133ValGlu: 4.133 ± 0.059
2.751ValPhe: 2.751 ± 0.047
4.829ValGly: 4.829 ± 0.061
1.368ValHis: 1.368 ± 0.036
3.834ValIle: 3.834 ± 0.054
2.788ValLys: 2.788 ± 0.053
6.991ValLeu: 6.991 ± 0.084
1.775ValMet: 1.775 ± 0.035
2.795ValAsn: 2.795 ± 0.057
2.998ValPro: 2.998 ± 0.049
2.715ValGln: 2.715 ± 0.049
3.625ValArg: 3.625 ± 0.056
4.733ValSer: 4.733 ± 0.07
3.926ValThr: 3.926 ± 0.061
5.257ValVal: 5.257 ± 0.07
0.905ValTrp: 0.905 ± 0.025
1.941ValTyr: 1.941 ± 0.038
0.0ValXaa: 0.0 ± 0.0
Trp
1.23TrpAla: 1.23 ± 0.033
0.157TrpCys: 0.157 ± 0.011
0.732TrpAsp: 0.732 ± 0.026
0.591TrpGlu: 0.591 ± 0.02
0.629TrpPhe: 0.629 ± 0.023
0.955TrpGly: 0.955 ± 0.029
0.401TrpHis: 0.401 ± 0.018
0.683TrpIle: 0.683 ± 0.024
0.36TrpLys: 0.36 ± 0.018
2.246TrpLeu: 2.246 ± 0.059
0.369TrpMet: 0.369 ± 0.017
0.473TrpAsn: 0.473 ± 0.022
0.655TrpPro: 0.655 ± 0.02
1.066TrpGln: 1.066 ± 0.032
1.086TrpArg: 1.086 ± 0.028
0.831TrpSer: 0.831 ± 0.028
0.642TrpThr: 0.642 ± 0.02
1.086TrpVal: 1.086 ± 0.029
0.287TrpTrp: 0.287 ± 0.014
0.403TrpTyr: 0.403 ± 0.017
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.508TyrAla: 2.508 ± 0.047
0.37TyrCys: 0.37 ± 0.017
1.564TyrAsp: 1.564 ± 0.034
1.396TyrGlu: 1.396 ± 0.034
1.229TyrPhe: 1.229 ± 0.03
2.205TyrGly: 2.205 ± 0.043
0.699TyrHis: 0.699 ± 0.022
1.163TyrIle: 1.163 ± 0.03
1.021TyrLys: 1.021 ± 0.03
3.093TyrLeu: 3.093 ± 0.05
0.554TyrMet: 0.554 ± 0.022
0.995TyrAsn: 0.995 ± 0.034
1.287TyrPro: 1.287 ± 0.031
1.622TyrGln: 1.622 ± 0.04
2.039TyrArg: 2.039 ± 0.044
1.764TyrSer: 1.764 ± 0.043
1.517TyrThr: 1.517 ± 0.04
1.809TyrVal: 1.809 ± 0.038
0.494TyrTrp: 0.494 ± 0.02
0.948TyrTyr: 0.948 ± 0.033
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.001XaaPro: 0.001 ± 0.001
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3812 proteins (1294979 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski