Amino acid dipepetide frequency for Candidatus Pandoraea novymonadis

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.998AlaAla: 7.998 ± 0.206
1.154AlaCys: 1.154 ± 0.071
4.117AlaAsp: 4.117 ± 0.122
4.718AlaGlu: 4.718 ± 0.14
3.441AlaPhe: 3.441 ± 0.112
6.13AlaGly: 6.13 ± 0.154
2.11AlaHis: 2.11 ± 0.079
5.968AlaIle: 5.968 ± 0.144
4.514AlaLys: 4.514 ± 0.142
9.914AlaLeu: 9.914 ± 0.175
2.517AlaMet: 2.517 ± 0.088
3.18AlaAsn: 3.18 ± 0.11
2.973AlaPro: 2.973 ± 0.099
3.529AlaGln: 3.529 ± 0.114
5.535AlaArg: 5.535 ± 0.165
4.944AlaSer: 4.944 ± 0.125
4.34AlaThr: 4.34 ± 0.112
6.208AlaVal: 6.208 ± 0.172
1.031AlaTrp: 1.031 ± 0.055
2.427AlaTyr: 2.427 ± 0.083
0.0AlaXaa: 0.0 ± 0.0
Cys
1.063CysAla: 1.063 ± 0.061
0.139CysCys: 0.139 ± 0.022
0.672CysAsp: 0.672 ± 0.044
0.73CysGlu: 0.73 ± 0.056
0.504CysPhe: 0.504 ± 0.038
1.063CysGly: 1.063 ± 0.063
0.304CysHis: 0.304 ± 0.029
0.711CysIle: 0.711 ± 0.053
0.333CysLys: 0.333 ± 0.035
1.066CysLeu: 1.066 ± 0.059
0.301CysMet: 0.301 ± 0.027
0.433CysAsn: 0.433 ± 0.039
0.575CysPro: 0.575 ± 0.052
0.394CysGln: 0.394 ± 0.036
0.617CysArg: 0.617 ± 0.047
0.734CysSer: 0.734 ± 0.047
0.604CysThr: 0.604 ± 0.042
0.905CysVal: 0.905 ± 0.055
0.136CysTrp: 0.136 ± 0.022
0.339CysTyr: 0.339 ± 0.033
0.0CysXaa: 0.0 ± 0.0
Asp
4.976AspAla: 4.976 ± 0.145
0.679AspCys: 0.679 ± 0.05
2.569AspAsp: 2.569 ± 0.099
3.134AspGlu: 3.134 ± 0.114
2.462AspPhe: 2.462 ± 0.083
3.777AspGly: 3.777 ± 0.12
1.086AspHis: 1.086 ± 0.064
3.777AspIle: 3.777 ± 0.12
2.281AspLys: 2.281 ± 0.093
5.245AspLeu: 5.245 ± 0.139
1.322AspMet: 1.322 ± 0.07
1.69AspAsn: 1.69 ± 0.075
2.243AspPro: 2.243 ± 0.084
1.661AspGln: 1.661 ± 0.076
2.782AspArg: 2.782 ± 0.118
2.944AspSer: 2.944 ± 0.106
2.708AspThr: 2.708 ± 0.095
4.236AspVal: 4.236 ± 0.126
0.83AspTrp: 0.83 ± 0.045
1.57AspTyr: 1.57 ± 0.076
0.0AspXaa: 0.0 ± 0.0
Glu
5.371GluAla: 5.371 ± 0.173
0.608GluCys: 0.608 ± 0.047
2.537GluAsp: 2.537 ± 0.091
3.063GluGlu: 3.063 ± 0.127
2.22GluPhe: 2.22 ± 0.104
3.629GluGly: 3.629 ± 0.103
1.331GluHis: 1.331 ± 0.066
4.634GluIle: 4.634 ± 0.124
3.648GluLys: 3.648 ± 0.112
5.59GluLeu: 5.59 ± 0.139
1.768GluMet: 1.768 ± 0.076
2.462GluAsn: 2.462 ± 0.096
1.758GluPro: 1.758 ± 0.082
2.32GluGln: 2.32 ± 0.09
4.149GluArg: 4.149 ± 0.138
3.118GluSer: 3.118 ± 0.1
3.031GluThr: 3.031 ± 0.115
4.414GluVal: 4.414 ± 0.137
0.611GluTrp: 0.611 ± 0.039
1.577GluTyr: 1.577 ± 0.076
0.0GluXaa: 0.0 ± 0.0
Phe
3.335PheAla: 3.335 ± 0.112
0.543PheCys: 0.543 ± 0.048
2.737PheAsp: 2.737 ± 0.094
2.508PheGlu: 2.508 ± 0.103
2.071PhePhe: 2.071 ± 0.103
3.357PheGly: 3.357 ± 0.099
0.966PheHis: 0.966 ± 0.053
2.692PheIle: 2.692 ± 0.125
1.551PheLys: 1.551 ± 0.073
4.243PheLeu: 4.243 ± 0.133
0.956PheMet: 0.956 ± 0.055
1.541PheAsn: 1.541 ± 0.081
1.535PhePro: 1.535 ± 0.072
1.215PheGln: 1.215 ± 0.064
2.155PheArg: 2.155 ± 0.079
3.429PheSer: 3.429 ± 0.121
2.026PheThr: 2.026 ± 0.077
2.899PheVal: 2.899 ± 0.108
0.511PheTrp: 0.511 ± 0.042
1.102PheTyr: 1.102 ± 0.066
0.0PheXaa: 0.0 ± 0.0
Gly
5.606GlyAla: 5.606 ± 0.133
1.044GlyCys: 1.044 ± 0.056
3.661GlyAsp: 3.661 ± 0.121
4.065GlyGlu: 4.065 ± 0.122
3.26GlyPhe: 3.26 ± 0.118
5.138GlyGly: 5.138 ± 0.132
1.978GlyHis: 1.978 ± 0.085
5.477GlyIle: 5.477 ± 0.141
4.065GlyLys: 4.065 ± 0.115
7.219GlyLeu: 7.219 ± 0.175
2.285GlyMet: 2.285 ± 0.103
2.556GlyAsn: 2.556 ± 0.102
2.007GlyPro: 2.007 ± 0.093
2.666GlyGln: 2.666 ± 0.102
4.33GlyArg: 4.33 ± 0.132
4.023GlySer: 4.023 ± 0.126
3.706GlyThr: 3.706 ± 0.108
5.522GlyVal: 5.522 ± 0.158
0.944GlyTrp: 0.944 ± 0.063
2.091GlyTyr: 2.091 ± 0.098
0.0GlyXaa: 0.0 ± 0.0
His
2.32HisAla: 2.32 ± 0.082
0.326HisCys: 0.326 ± 0.034
1.144HisAsp: 1.144 ± 0.068
1.192HisGlu: 1.192 ± 0.057
1.041HisPhe: 1.041 ± 0.058
2.029HisGly: 2.029 ± 0.078
0.83HisHis: 0.83 ± 0.065
1.748HisIle: 1.748 ± 0.077
0.882HisLys: 0.882 ± 0.054
2.605HisLeu: 2.605 ± 0.084
0.624HisMet: 0.624 ± 0.045
0.889HisAsn: 0.889 ± 0.063
1.444HisPro: 1.444 ± 0.081
0.927HisGln: 0.927 ± 0.05
1.393HisArg: 1.393 ± 0.056
1.499HisSer: 1.499 ± 0.066
1.221HisThr: 1.221 ± 0.063
1.664HisVal: 1.664 ± 0.075
0.339HisTrp: 0.339 ± 0.033
0.85HisTyr: 0.85 ± 0.054
0.0HisXaa: 0.0 ± 0.0
Ile
6.708IleAla: 6.708 ± 0.175
0.798IleCys: 0.798 ± 0.052
4.763IleAsp: 4.763 ± 0.128
4.96IleGlu: 4.96 ± 0.149
2.669IlePhe: 2.669 ± 0.103
5.59IleGly: 5.59 ± 0.148
1.629IleHis: 1.629 ± 0.071
3.929IleIle: 3.929 ± 0.131
3.073IleLys: 3.073 ± 0.113
6.615IleLeu: 6.615 ± 0.159
1.415IleMet: 1.415 ± 0.072
2.653IleAsn: 2.653 ± 0.087
2.827IlePro: 2.827 ± 0.089
2.097IleGln: 2.097 ± 0.079
3.461IleArg: 3.461 ± 0.107
4.692IleSer: 4.692 ± 0.169
3.855IleThr: 3.855 ± 0.115
5.015IleVal: 5.015 ± 0.139
0.65IleTrp: 0.65 ± 0.047
1.813IleTyr: 1.813 ± 0.081
0.0IleXaa: 0.0 ± 0.0
Lys
3.82LysAla: 3.82 ± 0.139
0.414LysCys: 0.414 ± 0.037
2.133LysAsp: 2.133 ± 0.092
2.737LysGlu: 2.737 ± 0.104
1.755LysPhe: 1.755 ± 0.089
2.824LysGly: 2.824 ± 0.103
1.209LysHis: 1.209 ± 0.056
3.933LysIle: 3.933 ± 0.104
3.109LysLys: 3.109 ± 0.103
5.122LysLeu: 5.122 ± 0.125
1.441LysMet: 1.441 ± 0.075
2.194LysAsn: 2.194 ± 0.094
2.007LysPro: 2.007 ± 0.089
1.797LysGln: 1.797 ± 0.088
3.202LysArg: 3.202 ± 0.09
2.918LysSer: 2.918 ± 0.103
2.934LysThr: 2.934 ± 0.091
3.189LysVal: 3.189 ± 0.103
0.459LysTrp: 0.459 ± 0.04
1.276LysTyr: 1.276 ± 0.066
0.0LysXaa: 0.0 ± 0.0
Leu
9.982LeuAla: 9.982 ± 0.195
1.335LeuCys: 1.335 ± 0.067
5.671LeuAsp: 5.671 ± 0.138
5.784LeuGlu: 5.784 ± 0.16
3.916LeuPhe: 3.916 ± 0.132
7.342LeuGly: 7.342 ± 0.187
2.608LeuHis: 2.608 ± 0.103
6.799LeuIle: 6.799 ± 0.174
5.138LeuLys: 5.138 ± 0.139
10.903LeuLeu: 10.903 ± 0.289
2.85LeuMet: 2.85 ± 0.098
3.939LeuAsn: 3.939 ± 0.118
5.371LeuPro: 5.371 ± 0.141
3.609LeuGln: 3.609 ± 0.101
6.718LeuArg: 6.718 ± 0.162
7.765LeuSer: 7.765 ± 0.159
5.975LeuThr: 5.975 ± 0.147
6.65LeuVal: 6.65 ± 0.149
1.089LeuTrp: 1.089 ± 0.06
2.495LeuTyr: 2.495 ± 0.096
0.0LeuXaa: 0.0 ± 0.0
Met
2.333MetAla: 2.333 ± 0.087
0.281MetCys: 0.281 ± 0.031
1.147MetAsp: 1.147 ± 0.065
1.396MetGlu: 1.396 ± 0.075
0.931MetPhe: 0.931 ± 0.059
1.693MetGly: 1.693 ± 0.081
0.62MetHis: 0.62 ± 0.046
1.638MetIle: 1.638 ± 0.084
1.486MetLys: 1.486 ± 0.077
3.18MetLeu: 3.18 ± 0.107
0.83MetMet: 0.83 ± 0.054
1.115MetAsn: 1.115 ± 0.056
1.477MetPro: 1.477 ± 0.07
1.005MetGln: 1.005 ± 0.051
2.0MetArg: 2.0 ± 0.092
1.861MetSer: 1.861 ± 0.085
1.793MetThr: 1.793 ± 0.086
1.69MetVal: 1.69 ± 0.076
0.194MetTrp: 0.194 ± 0.028
0.562MetTyr: 0.562 ± 0.052
0.0MetXaa: 0.0 ± 0.0
Asn
3.228AsnAla: 3.228 ± 0.102
0.456AsnCys: 0.456 ± 0.036
1.8AsnAsp: 1.8 ± 0.079
2.013AsnGlu: 2.013 ± 0.091
1.865AsnPhe: 1.865 ± 0.072
2.727AsnGly: 2.727 ± 0.102
0.895AsnHis: 0.895 ± 0.057
2.689AsnIle: 2.689 ± 0.112
1.548AsnLys: 1.548 ± 0.081
3.871AsnLeu: 3.871 ± 0.119
0.863AsnMet: 0.863 ± 0.049
1.486AsnAsn: 1.486 ± 0.077
1.955AsnPro: 1.955 ± 0.077
1.318AsnGln: 1.318 ± 0.071
2.168AsnArg: 2.168 ± 0.077
2.136AsnSer: 2.136 ± 0.092
2.052AsnThr: 2.052 ± 0.083
2.815AsnVal: 2.815 ± 0.094
0.546AsnTrp: 0.546 ± 0.041
1.225AsnTyr: 1.225 ± 0.062
0.0AsnXaa: 0.0 ± 0.0
Pro
2.944ProAla: 2.944 ± 0.091
0.397ProCys: 0.397 ± 0.038
2.446ProAsp: 2.446 ± 0.085
2.815ProGlu: 2.815 ± 0.1
1.764ProPhe: 1.764 ± 0.074
2.976ProGly: 2.976 ± 0.091
1.066ProHis: 1.066 ± 0.061
2.869ProIle: 2.869 ± 0.095
1.884ProLys: 1.884 ± 0.074
4.327ProLeu: 4.327 ± 0.105
1.144ProMet: 1.144 ± 0.061
1.874ProAsn: 1.874 ± 0.077
1.389ProPro: 1.389 ± 0.067
1.509ProGln: 1.509 ± 0.072
2.052ProArg: 2.052 ± 0.085
2.689ProSer: 2.689 ± 0.088
2.197ProThr: 2.197 ± 0.092
3.38ProVal: 3.38 ± 0.106
0.511ProTrp: 0.511 ± 0.04
1.244ProTyr: 1.244 ± 0.072
0.0ProXaa: 0.0 ± 0.0
Gln
3.277GlnAla: 3.277 ± 0.118
0.41GlnCys: 0.41 ± 0.032
1.587GlnAsp: 1.587 ± 0.064
1.981GlnGlu: 1.981 ± 0.086
1.364GlnPhe: 1.364 ± 0.064
2.265GlnGly: 2.265 ± 0.083
0.944GlnHis: 0.944 ± 0.056
2.647GlnIle: 2.647 ± 0.091
2.02GlnLys: 2.02 ± 0.079
3.881GlnLeu: 3.881 ± 0.119
1.108GlnMet: 1.108 ± 0.063
1.234GlnAsn: 1.234 ± 0.063
1.328GlnPro: 1.328 ± 0.064
1.396GlnGln: 1.396 ± 0.076
2.301GlnArg: 2.301 ± 0.1
2.123GlnSer: 2.123 ± 0.083
1.894GlnThr: 1.894 ± 0.079
2.731GlnVal: 2.731 ± 0.104
0.501GlnTrp: 0.501 ± 0.039
1.024GlnTyr: 1.024 ± 0.066
0.0GlnXaa: 0.0 ± 0.0
Arg
5.028ArgAla: 5.028 ± 0.129
0.604ArgCys: 0.604 ± 0.046
3.458ArgAsp: 3.458 ± 0.108
3.978ArgGlu: 3.978 ± 0.134
2.769ArgPhe: 2.769 ± 0.1
3.923ArgGly: 3.923 ± 0.109
1.554ArgHis: 1.554 ± 0.066
4.353ArgIle: 4.353 ± 0.11
2.643ArgLys: 2.643 ± 0.089
6.563ArgLeu: 6.563 ± 0.171
1.729ArgMet: 1.729 ± 0.076
2.268ArgAsn: 2.268 ± 0.097
2.184ArgPro: 2.184 ± 0.082
2.394ArgGln: 2.394 ± 0.104
3.991ArgArg: 3.991 ± 0.129
3.167ArgSer: 3.167 ± 0.107
2.857ArgThr: 2.857 ± 0.096
4.537ArgVal: 4.537 ± 0.137
0.672ArgTrp: 0.672 ± 0.052
2.123ArgTyr: 2.123 ± 0.073
0.0ArgXaa: 0.0 ± 0.0
Ser
4.947SerAla: 4.947 ± 0.131
0.582SerCys: 0.582 ± 0.044
3.154SerAsp: 3.154 ± 0.093
3.606SerGlu: 3.606 ± 0.112
2.517SerPhe: 2.517 ± 0.116
4.96SerGly: 4.96 ± 0.127
1.545SerHis: 1.545 ± 0.07
4.301SerIle: 4.301 ± 0.115
2.74SerLys: 2.74 ± 0.104
6.641SerLeu: 6.641 ± 0.168
1.907SerMet: 1.907 ± 0.089
2.394SerAsn: 2.394 ± 0.108
2.43SerPro: 2.43 ± 0.11
2.117SerGln: 2.117 ± 0.085
3.723SerArg: 3.723 ± 0.109
4.133SerSer: 4.133 ± 0.148
3.506SerThr: 3.506 ± 0.121
4.618SerVal: 4.618 ± 0.14
0.685SerTrp: 0.685 ± 0.049
1.616SerTyr: 1.616 ± 0.074
0.0SerXaa: 0.0 ± 0.0
Thr
4.324ThrAla: 4.324 ± 0.11
0.511ThrCys: 0.511 ± 0.044
2.634ThrAsp: 2.634 ± 0.104
2.84ThrGlu: 2.84 ± 0.113
2.146ThrPhe: 2.146 ± 0.069
4.262ThrGly: 4.262 ± 0.119
1.441ThrHis: 1.441 ± 0.07
3.561ThrIle: 3.561 ± 0.107
2.268ThrLys: 2.268 ± 0.092
6.485ThrLeu: 6.485 ± 0.147
1.231ThrMet: 1.231 ± 0.06
1.887ThrAsn: 1.887 ± 0.071
2.798ThrPro: 2.798 ± 0.102
1.991ThrGln: 1.991 ± 0.087
3.344ThrArg: 3.344 ± 0.109
3.17ThrSer: 3.17 ± 0.103
3.005ThrThr: 3.005 ± 0.12
3.823ThrVal: 3.823 ± 0.115
0.511ThrTrp: 0.511 ± 0.044
1.27ThrTyr: 1.27 ± 0.072
0.0ThrXaa: 0.0 ± 0.0
Val
6.308ValAla: 6.308 ± 0.173
0.818ValCys: 0.818 ± 0.054
3.894ValAsp: 3.894 ± 0.119
4.188ValGlu: 4.188 ± 0.138
2.789ValPhe: 2.789 ± 0.097
5.186ValGly: 5.186 ± 0.147
1.738ValHis: 1.738 ± 0.082
5.047ValIle: 5.047 ± 0.142
3.6ValLys: 3.6 ± 0.123
7.927ValLeu: 7.927 ± 0.177
1.971ValMet: 1.971 ± 0.083
2.595ValAsn: 2.595 ± 0.096
3.5ValPro: 3.5 ± 0.12
2.417ValGln: 2.417 ± 0.095
4.181ValArg: 4.181 ± 0.129
4.511ValSer: 4.511 ± 0.108
3.936ValThr: 3.936 ± 0.123
5.468ValVal: 5.468 ± 0.176
0.785ValTrp: 0.785 ± 0.044
1.642ValTyr: 1.642 ± 0.069
0.0ValXaa: 0.0 ± 0.0
Trp
0.737TrpAla: 0.737 ± 0.048
0.187TrpCys: 0.187 ± 0.026
0.439TrpAsp: 0.439 ± 0.04
0.562TrpGlu: 0.562 ± 0.044
0.53TrpPhe: 0.53 ± 0.04
0.598TrpGly: 0.598 ± 0.043
0.362TrpHis: 0.362 ± 0.044
0.976TrpIle: 0.976 ± 0.056
0.559TrpLys: 0.559 ± 0.042
1.684TrpLeu: 1.684 ± 0.097
0.362TrpMet: 0.362 ± 0.04
0.365TrpAsn: 0.365 ± 0.039
0.462TrpPro: 0.462 ± 0.043
0.585TrpGln: 0.585 ± 0.042
0.83TrpArg: 0.83 ± 0.056
0.556TrpSer: 0.556 ± 0.044
0.472TrpThr: 0.472 ± 0.038
0.795TrpVal: 0.795 ± 0.061
0.132TrpTrp: 0.132 ± 0.02
0.313TrpTyr: 0.313 ± 0.037
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.443TyrAla: 2.443 ± 0.082
0.339TyrCys: 0.339 ± 0.035
1.412TyrAsp: 1.412 ± 0.073
1.567TyrGlu: 1.567 ± 0.071
1.357TyrPhe: 1.357 ± 0.066
2.171TyrGly: 2.171 ± 0.079
0.753TyrHis: 0.753 ± 0.055
1.464TyrIle: 1.464 ± 0.07
1.128TyrLys: 1.128 ± 0.071
2.86TyrLeu: 2.86 ± 0.098
0.608TyrMet: 0.608 ± 0.048
0.902TyrAsn: 0.902 ± 0.053
1.215TyrPro: 1.215 ± 0.073
1.066TyrGln: 1.066 ± 0.065
1.861TyrArg: 1.861 ± 0.086
1.709TyrSer: 1.709 ± 0.078
1.389TyrThr: 1.389 ± 0.057
1.942TyrVal: 1.942 ± 0.082
0.372TyrTrp: 0.372 ± 0.037
0.847TyrTyr: 0.847 ± 0.059
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 967 proteins (309465 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski