Amino acid dipepetide frequency for Smithella sp. D17

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.922AlaAla: 6.922 ± 0.159
0.871AlaCys: 0.871 ± 0.051
4.131AlaAsp: 4.131 ± 0.106
4.653AlaGlu: 4.653 ± 0.126
3.17AlaPhe: 3.17 ± 0.091
6.222AlaGly: 6.222 ± 0.143
1.311AlaHis: 1.311 ± 0.058
6.021AlaIle: 6.021 ± 0.137
4.845AlaLys: 4.845 ± 0.123
7.666AlaLeu: 7.666 ± 0.162
2.22AlaMet: 2.22 ± 0.092
2.858AlaAsn: 2.858 ± 0.102
2.169AlaPro: 2.169 ± 0.066
2.6AlaGln: 2.6 ± 0.082
3.903AlaArg: 3.903 ± 0.106
4.229AlaSer: 4.229 ± 0.109
3.379AlaThr: 3.379 ± 0.111
5.404AlaVal: 5.404 ± 0.111
0.695AlaTrp: 0.695 ± 0.046
2.353AlaTyr: 2.353 ± 0.09
0.0AlaXaa: 0.0 ± 0.0
Cys
0.855CysAla: 0.855 ± 0.053
0.166CysCys: 0.166 ± 0.023
0.54CysAsp: 0.54 ± 0.038
0.556CysGlu: 0.556 ± 0.043
0.483CysPhe: 0.483 ± 0.035
1.148CysGly: 1.148 ± 0.065
0.337CysHis: 0.337 ± 0.03
0.711CysIle: 0.711 ± 0.044
0.635CysLys: 0.635 ± 0.049
1.211CysLeu: 1.211 ± 0.067
0.236CysMet: 0.236 ± 0.026
0.432CysAsn: 0.432 ± 0.039
0.643CysPro: 0.643 ± 0.039
0.309CysGln: 0.309 ± 0.03
0.649CysArg: 0.649 ± 0.041
0.741CysSer: 0.741 ± 0.051
0.508CysThr: 0.508 ± 0.045
0.706CysVal: 0.706 ± 0.04
0.119CysTrp: 0.119 ± 0.017
0.396CysTyr: 0.396 ± 0.031
0.0CysXaa: 0.0 ± 0.0
Asp
3.789AspAla: 3.789 ± 0.1
0.597AspCys: 0.597 ± 0.044
2.712AspAsp: 2.712 ± 0.08
3.656AspGlu: 3.656 ± 0.111
2.78AspPhe: 2.78 ± 0.085
3.257AspGly: 3.257 ± 0.101
0.934AspHis: 0.934 ± 0.053
4.938AspIle: 4.938 ± 0.116
3.944AspLys: 3.944 ± 0.114
5.331AspLeu: 5.331 ± 0.113
1.458AspMet: 1.458 ± 0.068
2.256AspAsn: 2.256 ± 0.078
1.979AspPro: 1.979 ± 0.075
1.477AspGln: 1.477 ± 0.061
2.381AspArg: 2.381 ± 0.081
2.785AspSer: 2.785 ± 0.103
2.313AspThr: 2.313 ± 0.077
3.749AspVal: 3.749 ± 0.103
0.573AspTrp: 0.573 ± 0.034
2.117AspTyr: 2.117 ± 0.081
0.0AspXaa: 0.0 ± 0.0
Glu
4.807GluAla: 4.807 ± 0.125
0.453GluCys: 0.453 ± 0.034
3.151GluAsp: 3.151 ± 0.109
4.767GluGlu: 4.767 ± 0.127
2.4GluPhe: 2.4 ± 0.082
3.857GluGly: 3.857 ± 0.122
1.04GluHis: 1.04 ± 0.052
5.858GluIle: 5.858 ± 0.13
6.637GluLys: 6.637 ± 0.154
5.882GluLeu: 5.882 ± 0.123
1.884GluMet: 1.884 ± 0.062
3.569GluAsn: 3.569 ± 0.101
1.781GluPro: 1.781 ± 0.069
1.941GluGln: 1.941 ± 0.076
3.16GluArg: 3.16 ± 0.104
3.401GluSer: 3.401 ± 0.102
3.423GluThr: 3.423 ± 0.098
3.963GluVal: 3.963 ± 0.121
0.554GluTrp: 0.554 ± 0.038
1.968GluTyr: 1.968 ± 0.073
0.0GluXaa: 0.0 ± 0.0
Phe
3.17PheAla: 3.17 ± 0.091
0.662PheCys: 0.662 ± 0.04
2.505PheAsp: 2.505 ± 0.073
2.37PheGlu: 2.37 ± 0.081
2.562PhePhe: 2.562 ± 0.105
3.051PheGly: 3.051 ± 0.091
0.871PheHis: 0.871 ± 0.048
3.453PheIle: 3.453 ± 0.102
2.592PheLys: 2.592 ± 0.081
4.533PheLeu: 4.533 ± 0.147
1.094PheMet: 1.094 ± 0.051
1.954PheAsn: 1.954 ± 0.065
1.77PhePro: 1.77 ± 0.07
1.395PheGln: 1.395 ± 0.066
2.001PheArg: 2.001 ± 0.067
3.298PheSer: 3.298 ± 0.1
2.41PheThr: 2.41 ± 0.082
2.883PheVal: 2.883 ± 0.102
0.521PheTrp: 0.521 ± 0.042
1.688PheTyr: 1.688 ± 0.077
0.0PheXaa: 0.0 ± 0.0
Gly
4.75GlyAla: 4.75 ± 0.122
0.996GlyCys: 0.996 ± 0.058
3.382GlyAsp: 3.382 ± 0.108
3.977GlyGlu: 3.977 ± 0.1
3.084GlyPhe: 3.084 ± 0.085
4.788GlyGly: 4.788 ± 0.15
1.344GlyHis: 1.344 ± 0.058
5.961GlyIle: 5.961 ± 0.12
5.66GlyLys: 5.66 ± 0.141
6.042GlyLeu: 6.042 ± 0.134
2.177GlyMet: 2.177 ± 0.064
2.972GlyAsn: 2.972 ± 0.091
1.846GlyPro: 1.846 ± 0.088
2.15GlyGln: 2.15 ± 0.084
3.415GlyArg: 3.415 ± 0.102
3.971GlySer: 3.971 ± 0.12
3.879GlyThr: 3.879 ± 0.137
4.579GlyVal: 4.579 ± 0.126
0.793GlyTrp: 0.793 ± 0.051
2.372GlyTyr: 2.372 ± 0.079
0.0GlyXaa: 0.0 ± 0.0
His
1.251HisAla: 1.251 ± 0.061
0.271HisCys: 0.271 ± 0.03
0.942HisAsp: 0.942 ± 0.06
0.939HisGlu: 0.939 ± 0.046
0.909HisPhe: 0.909 ± 0.055
1.317HisGly: 1.317 ± 0.052
0.513HisHis: 0.513 ± 0.041
1.327HisIle: 1.327 ± 0.061
1.11HisLys: 1.11 ± 0.061
1.794HisLeu: 1.794 ± 0.067
0.407HisMet: 0.407 ± 0.038
0.708HisAsn: 0.708 ± 0.048
1.145HisPro: 1.145 ± 0.058
0.63HisGln: 0.63 ± 0.039
0.972HisArg: 0.972 ± 0.047
1.048HisSer: 1.048 ± 0.055
0.85HisThr: 0.85 ± 0.05
1.124HisVal: 1.124 ± 0.057
0.22HisTrp: 0.22 ± 0.023
0.681HisTyr: 0.681 ± 0.039
0.0HisXaa: 0.0 ± 0.0
Ile
6.414IleAla: 6.414 ± 0.131
0.934IleCys: 0.934 ± 0.046
4.582IleAsp: 4.582 ± 0.119
5.133IleGlu: 5.133 ± 0.134
3.969IlePhe: 3.969 ± 0.137
5.106IleGly: 5.106 ± 0.132
1.344IleHis: 1.344 ± 0.055
7.106IleIle: 7.106 ± 0.185
6.27IleLys: 6.27 ± 0.15
7.299IleLeu: 7.299 ± 0.159
1.919IleMet: 1.919 ± 0.082
4.104IleAsn: 4.104 ± 0.123
3.436IlePro: 3.436 ± 0.09
2.085IleGln: 2.085 ± 0.073
3.814IleArg: 3.814 ± 0.1
5.586IleSer: 5.586 ± 0.137
4.406IleThr: 4.406 ± 0.11
5.391IleVal: 5.391 ± 0.127
0.665IleTrp: 0.665 ± 0.043
2.611IleTyr: 2.611 ± 0.093
0.0IleXaa: 0.0 ± 0.0
Lys
5.277LysAla: 5.277 ± 0.131
0.66LysCys: 0.66 ± 0.047
3.977LysAsp: 3.977 ± 0.122
5.945LysGlu: 5.945 ± 0.154
2.367LysPhe: 2.367 ± 0.086
4.381LysGly: 4.381 ± 0.123
1.021LysHis: 1.021 ± 0.061
6.987LysIle: 6.987 ± 0.148
7.546LysLys: 7.546 ± 0.168
6.083LysLeu: 6.083 ± 0.145
2.326LysMet: 2.326 ± 0.093
4.123LysAsn: 4.123 ± 0.103
2.554LysPro: 2.554 ± 0.091
2.413LysGln: 2.413 ± 0.078
3.415LysArg: 3.415 ± 0.096
4.34LysSer: 4.34 ± 0.112
4.365LysThr: 4.365 ± 0.113
4.555LysVal: 4.555 ± 0.11
0.654LysTrp: 0.654 ± 0.039
2.755LysTyr: 2.755 ± 0.093
0.0LysXaa: 0.0 ± 0.0
Leu
7.568LeuAla: 7.568 ± 0.15
1.091LeuCys: 1.091 ± 0.055
4.954LeuAsp: 4.954 ± 0.108
5.863LeuGlu: 5.863 ± 0.138
4.289LeuPhe: 4.289 ± 0.117
5.877LeuGly: 5.877 ± 0.143
1.743LeuHis: 1.743 ± 0.066
7.419LeuIle: 7.419 ± 0.171
7.641LeuLys: 7.641 ± 0.164
8.765LeuLeu: 8.765 ± 0.17
2.454LeuMet: 2.454 ± 0.085
4.373LeuAsn: 4.373 ± 0.12
4.197LeuPro: 4.197 ± 0.11
3.195LeuGln: 3.195 ± 0.096
4.71LeuArg: 4.71 ± 0.124
6.336LeuSer: 6.336 ± 0.129
5.106LeuThr: 5.106 ± 0.107
5.418LeuVal: 5.418 ± 0.131
0.936LeuTrp: 0.936 ± 0.053
2.785LeuTyr: 2.785 ± 0.095
0.0LeuXaa: 0.0 ± 0.0
Met
2.402MetAla: 2.402 ± 0.08
0.198MetCys: 0.198 ± 0.023
1.498MetAsp: 1.498 ± 0.063
1.938MetGlu: 1.938 ± 0.079
0.936MetPhe: 0.936 ± 0.055
1.971MetGly: 1.971 ± 0.075
0.461MetHis: 0.461 ± 0.043
2.041MetIle: 2.041 ± 0.085
2.389MetLys: 2.389 ± 0.082
2.343MetLeu: 2.343 ± 0.078
0.741MetMet: 0.741 ± 0.047
1.281MetAsn: 1.281 ± 0.059
1.298MetPro: 1.298 ± 0.059
0.86MetGln: 0.86 ± 0.05
1.265MetArg: 1.265 ± 0.062
1.585MetSer: 1.585 ± 0.071
1.452MetThr: 1.452 ± 0.057
1.702MetVal: 1.702 ± 0.063
0.179MetTrp: 0.179 ± 0.021
0.562MetTyr: 0.562 ± 0.038
0.0MetXaa: 0.0 ± 0.0
Asn
3.17AsnAla: 3.17 ± 0.099
0.483AsnCys: 0.483 ± 0.035
2.077AsnAsp: 2.077 ± 0.079
2.788AsnGlu: 2.788 ± 0.085
1.935AsnPhe: 1.935 ± 0.078
2.82AsnGly: 2.82 ± 0.095
0.736AsnHis: 0.736 ± 0.046
4.506AsnIle: 4.506 ± 0.131
3.358AsnLys: 3.358 ± 0.107
4.454AsnLeu: 4.454 ± 0.104
1.2AsnMet: 1.2 ± 0.056
2.074AsnAsn: 2.074 ± 0.081
2.34AsnPro: 2.34 ± 0.099
1.444AsnGln: 1.444 ± 0.066
2.115AsnArg: 2.115 ± 0.081
2.291AsnSer: 2.291 ± 0.079
1.984AsnThr: 1.984 ± 0.074
3.103AsnVal: 3.103 ± 0.092
0.44AsnTrp: 0.44 ± 0.034
1.778AsnTyr: 1.778 ± 0.083
0.0AsnXaa: 0.0 ± 0.0
Pro
3.192ProAla: 3.192 ± 0.117
0.38ProCys: 0.38 ± 0.03
2.818ProAsp: 2.818 ± 0.097
3.198ProGlu: 3.198 ± 0.087
1.884ProPhe: 1.884 ± 0.068
2.698ProGly: 2.698 ± 0.092
0.738ProHis: 0.738 ± 0.042
2.12ProIle: 2.12 ± 0.086
2.109ProLys: 2.109 ± 0.076
3.789ProLeu: 3.789 ± 0.093
0.874ProMet: 0.874 ± 0.044
1.203ProAsn: 1.203 ± 0.063
1.439ProPro: 1.439 ± 0.067
1.528ProGln: 1.528 ± 0.059
1.569ProArg: 1.569 ± 0.073
2.218ProSer: 2.218 ± 0.077
1.707ProThr: 1.707 ± 0.07
3.312ProVal: 3.312 ± 0.088
0.413ProTrp: 0.413 ± 0.03
1.39ProTyr: 1.39 ± 0.055
0.0ProXaa: 0.0 ± 0.0
Gln
2.454GlnAla: 2.454 ± 0.091
0.339GlnCys: 0.339 ± 0.036
1.485GlnAsp: 1.485 ± 0.067
2.158GlnGlu: 2.158 ± 0.082
1.186GlnPhe: 1.186 ± 0.051
2.014GlnGly: 2.014 ± 0.071
0.518GlnHis: 0.518 ± 0.041
2.641GlnIle: 2.641 ± 0.091
2.951GlnLys: 2.951 ± 0.087
2.796GlnLeu: 2.796 ± 0.104
0.95GlnMet: 0.95 ± 0.05
1.574GlnAsn: 1.574 ± 0.063
1.01GlnPro: 1.01 ± 0.056
1.197GlnGln: 1.197 ± 0.065
1.702GlnArg: 1.702 ± 0.06
1.895GlnSer: 1.895 ± 0.075
1.726GlnThr: 1.726 ± 0.079
1.925GlnVal: 1.925 ± 0.069
0.391GlnTrp: 0.391 ± 0.036
1.091GlnTyr: 1.091 ± 0.058
0.0GlnXaa: 0.0 ± 0.0
Arg
3.062ArgAla: 3.062 ± 0.097
0.551ArgCys: 0.551 ± 0.041
2.552ArgAsp: 2.552 ± 0.086
3.548ArgGlu: 3.548 ± 0.099
2.177ArgPhe: 2.177 ± 0.08
2.97ArgGly: 2.97 ± 0.092
0.999ArgHis: 0.999 ± 0.049
3.952ArgIle: 3.952 ± 0.105
3.895ArgLys: 3.895 ± 0.091
4.829ArgLeu: 4.829 ± 0.129
1.493ArgMet: 1.493 ± 0.056
2.313ArgAsn: 2.313 ± 0.071
1.705ArgPro: 1.705 ± 0.071
1.887ArgGln: 1.887 ± 0.081
2.72ArgArg: 2.72 ± 0.102
2.617ArgSer: 2.617 ± 0.08
2.172ArgThr: 2.172 ± 0.067
2.907ArgVal: 2.907 ± 0.092
0.546ArgTrp: 0.546 ± 0.042
1.748ArgTyr: 1.748 ± 0.077
0.0ArgXaa: 0.0 ± 0.0
Ser
4.435SerAla: 4.435 ± 0.109
0.793SerCys: 0.793 ± 0.048
3.062SerAsp: 3.062 ± 0.098
3.434SerGlu: 3.434 ± 0.1
3.206SerPhe: 3.206 ± 0.101
5.065SerGly: 5.065 ± 0.14
1.159SerHis: 1.159 ± 0.051
4.609SerIle: 4.609 ± 0.125
3.534SerLys: 3.534 ± 0.103
6.333SerLeu: 6.333 ± 0.153
1.591SerMet: 1.591 ± 0.06
2.31SerAsn: 2.31 ± 0.082
2.386SerPro: 2.386 ± 0.075
1.805SerGln: 1.805 ± 0.072
3.168SerArg: 3.168 ± 0.097
3.977SerSer: 3.977 ± 0.114
2.872SerThr: 2.872 ± 0.096
3.979SerVal: 3.979 ± 0.109
0.703SerTrp: 0.703 ± 0.045
2.02SerTyr: 2.02 ± 0.075
0.0SerXaa: 0.0 ± 0.0
Thr
4.14ThrAla: 4.14 ± 0.116
0.51ThrCys: 0.51 ± 0.038
2.747ThrAsp: 2.747 ± 0.088
2.788ThrGlu: 2.788 ± 0.081
2.115ThrPhe: 2.115 ± 0.078
4.558ThrGly: 4.558 ± 0.129
0.969ThrHis: 0.969 ± 0.051
4.134ThrIle: 4.134 ± 0.123
3.135ThrLys: 3.135 ± 0.075
4.623ThrLeu: 4.623 ± 0.117
1.338ThrMet: 1.338 ± 0.061
2.123ThrAsn: 2.123 ± 0.082
2.318ThrPro: 2.318 ± 0.081
1.615ThrGln: 1.615 ± 0.065
2.109ThrArg: 2.109 ± 0.077
2.991ThrSer: 2.991 ± 0.094
2.747ThrThr: 2.747 ± 0.108
3.705ThrVal: 3.705 ± 0.135
0.475ThrTrp: 0.475 ± 0.035
1.629ThrTyr: 1.629 ± 0.072
0.0ThrXaa: 0.0 ± 0.0
Val
4.989ValAla: 4.989 ± 0.117
0.817ValCys: 0.817 ± 0.052
3.534ValAsp: 3.534 ± 0.104
4.172ValGlu: 4.172 ± 0.105
3.035ValPhe: 3.035 ± 0.1
4.001ValGly: 4.001 ± 0.101
1.137ValHis: 1.137 ± 0.053
5.347ValIle: 5.347 ± 0.128
4.587ValLys: 4.587 ± 0.127
6.222ValLeu: 6.222 ± 0.138
1.813ValMet: 1.813 ± 0.077
2.883ValAsn: 2.883 ± 0.105
2.717ValPro: 2.717 ± 0.094
1.925ValGln: 1.925 ± 0.073
3.124ValArg: 3.124 ± 0.09
4.43ValSer: 4.43 ± 0.117
3.485ValThr: 3.485 ± 0.123
4.623ValVal: 4.623 ± 0.134
0.668ValTrp: 0.668 ± 0.042
1.916ValTyr: 1.916 ± 0.075
0.0ValXaa: 0.0 ± 0.0
Trp
0.676TrpAla: 0.676 ± 0.047
0.103TrpCys: 0.103 ± 0.017
0.546TrpAsp: 0.546 ± 0.039
0.646TrpGlu: 0.646 ± 0.042
0.494TrpPhe: 0.494 ± 0.037
0.684TrpGly: 0.684 ± 0.045
0.242TrpHis: 0.242 ± 0.023
0.763TrpIle: 0.763 ± 0.048
0.689TrpLys: 0.689 ± 0.042
1.059TrpLeu: 1.059 ± 0.06
0.255TrpMet: 0.255 ± 0.028
0.51TrpAsn: 0.51 ± 0.043
0.41TrpPro: 0.41 ± 0.032
0.491TrpGln: 0.491 ± 0.036
0.521TrpArg: 0.521 ± 0.037
0.54TrpSer: 0.54 ± 0.042
0.47TrpThr: 0.47 ± 0.034
0.551TrpVal: 0.551 ± 0.043
0.155TrpTrp: 0.155 ± 0.021
0.32TrpTyr: 0.32 ± 0.032
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.299TyrAla: 2.299 ± 0.083
0.505TyrCys: 0.505 ± 0.04
1.83TyrAsp: 1.83 ± 0.081
1.968TyrGlu: 1.968 ± 0.085
1.805TyrPhe: 1.805 ± 0.083
2.353TyrGly: 2.353 ± 0.084
0.744TyrHis: 0.744 ± 0.045
2.28TyrIle: 2.28 ± 0.073
2.215TyrLys: 2.215 ± 0.08
3.665TyrLeu: 3.665 ± 0.096
0.67TyrMet: 0.67 ± 0.042
1.477TyrAsn: 1.477 ± 0.077
1.436TyrPro: 1.436 ± 0.063
1.056TyrGln: 1.056 ± 0.057
1.93TyrArg: 1.93 ± 0.077
2.161TyrSer: 2.161 ± 0.081
1.515TyrThr: 1.515 ± 0.076
1.849TyrVal: 1.849 ± 0.068
0.426TyrTrp: 0.426 ± 0.039
1.393TyrTyr: 1.393 ± 0.068
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1343 proteins (368400 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski