Amino acid dipepetide frequency for Buchnera aphidicola (Sarucallis kahawaluokalani)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.59AlaAla: 2.59 ± 0.171
0.661AlaCys: 0.661 ± 0.082
1.355AlaAsp: 1.355 ± 0.099
1.801AlaGlu: 1.801 ± 0.124
1.578AlaPhe: 1.578 ± 0.118
2.478AlaGly: 2.478 ± 0.164
0.893AlaHis: 0.893 ± 0.076
5.403AlaIle: 5.403 ± 0.247
3.594AlaLys: 3.594 ± 0.171
4.495AlaLeu: 4.495 ± 0.212
1.227AlaMet: 1.227 ± 0.112
2.311AlaAsn: 2.311 ± 0.144
1.084AlaPro: 1.084 ± 0.101
1.578AlaGln: 1.578 ± 0.096
1.913AlaArg: 1.913 ± 0.136
2.271AlaSer: 2.271 ± 0.145
2.032AlaThr: 2.032 ± 0.144
2.239AlaVal: 2.239 ± 0.132
0.303AlaTrp: 0.303 ± 0.045
1.618AlaTyr: 1.618 ± 0.113
0.0AlaXaa: 0.0 ± 0.0
Cys
0.566CysAla: 0.566 ± 0.065
0.239CysCys: 0.239 ± 0.043
0.638CysAsp: 0.638 ± 0.067
0.47CysGlu: 0.47 ± 0.06
0.709CysPhe: 0.709 ± 0.077
1.004CysGly: 1.004 ± 0.093
0.335CysHis: 0.335 ± 0.047
2.016CysIle: 2.016 ± 0.129
1.132CysLys: 1.132 ± 0.091
1.275CysLeu: 1.275 ± 0.101
0.303CysMet: 0.303 ± 0.051
1.084CysAsn: 1.084 ± 0.105
0.422CysPro: 0.422 ± 0.064
0.478CysGln: 0.478 ± 0.073
0.367CysArg: 0.367 ± 0.054
0.94CysSer: 0.94 ± 0.078
0.693CysThr: 0.693 ± 0.078
0.582CysVal: 0.582 ± 0.072
0.128CysTrp: 0.128 ± 0.034
0.494CysTyr: 0.494 ± 0.063
0.0CysXaa: 0.0 ± 0.0
Asp
1.945AspAla: 1.945 ± 0.136
0.534AspCys: 0.534 ± 0.057
1.347AspAsp: 1.347 ± 0.106
1.594AspGlu: 1.594 ± 0.123
2.208AspPhe: 2.208 ± 0.138
1.976AspGly: 1.976 ± 0.144
1.028AspHis: 1.028 ± 0.098
6.638AspIle: 6.638 ± 0.243
2.845AspLys: 2.845 ± 0.173
3.873AspLeu: 3.873 ± 0.183
0.972AspMet: 0.972 ± 0.104
2.271AspAsn: 2.271 ± 0.129
1.219AspPro: 1.219 ± 0.112
1.658AspGln: 1.658 ± 0.108
1.355AspArg: 1.355 ± 0.114
1.905AspSer: 1.905 ± 0.1
2.208AspThr: 2.208 ± 0.148
2.646AspVal: 2.646 ± 0.148
0.406AspTrp: 0.406 ± 0.056
1.642AspTyr: 1.642 ± 0.123
0.0AspXaa: 0.0 ± 0.0
Glu
1.857GluAla: 1.857 ± 0.141
0.51GluCys: 0.51 ± 0.071
1.458GluAsp: 1.458 ± 0.117
1.889GluGlu: 1.889 ± 0.141
1.578GluPhe: 1.578 ± 0.111
1.729GluGly: 1.729 ± 0.112
0.964GluHis: 0.964 ± 0.088
5.794GluIle: 5.794 ± 0.241
5.355GluLys: 5.355 ± 0.243
4.224GluLeu: 4.224 ± 0.209
1.02GluMet: 1.02 ± 0.079
3.044GluAsn: 3.044 ± 0.164
0.972GluPro: 0.972 ± 0.092
1.45GluGln: 1.45 ± 0.106
1.57GluArg: 1.57 ± 0.115
2.391GluSer: 2.391 ± 0.142
1.682GluThr: 1.682 ± 0.13
2.367GluVal: 2.367 ± 0.148
0.319GluTrp: 0.319 ± 0.064
2.16GluTyr: 2.16 ± 0.139
0.0GluXaa: 0.0 ± 0.0
Phe
1.697PheAla: 1.697 ± 0.123
0.916PheCys: 0.916 ± 0.09
2.048PheAsp: 2.048 ± 0.115
1.594PheGlu: 1.594 ± 0.11
3.626PhePhe: 3.626 ± 0.209
2.893PheGly: 2.893 ± 0.158
1.243PheHis: 1.243 ± 0.104
5.403PheIle: 5.403 ± 0.289
3.61PheLys: 3.61 ± 0.171
5.475PheLeu: 5.475 ± 0.26
1.084PheMet: 1.084 ± 0.091
3.339PheAsn: 3.339 ± 0.16
1.809PhePro: 1.809 ± 0.115
1.945PheGln: 1.945 ± 0.132
1.554PheArg: 1.554 ± 0.131
3.961PheSer: 3.961 ± 0.18
2.239PheThr: 2.239 ± 0.154
1.976PheVal: 1.976 ± 0.151
0.558PheTrp: 0.558 ± 0.066
2.486PheTyr: 2.486 ± 0.156
0.0PheXaa: 0.0 ± 0.0
Gly
2.582GlyAla: 2.582 ± 0.159
1.068GlyCys: 1.068 ± 0.094
2.287GlyAsp: 2.287 ± 0.142
2.16GlyGlu: 2.16 ± 0.153
2.455GlyPhe: 2.455 ± 0.151
3.379GlyGly: 3.379 ± 0.212
1.267GlyHis: 1.267 ± 0.115
6.822GlyIle: 6.822 ± 0.252
4.527GlyLys: 4.527 ± 0.235
4.415GlyLeu: 4.415 ± 0.204
1.546GlyMet: 1.546 ± 0.119
2.694GlyAsn: 2.694 ± 0.135
1.371GlyPro: 1.371 ± 0.094
1.586GlyGln: 1.586 ± 0.123
2.152GlyArg: 2.152 ± 0.185
3.132GlySer: 3.132 ± 0.166
2.996GlyThr: 2.996 ± 0.167
2.973GlyVal: 2.973 ± 0.167
0.494GlyTrp: 0.494 ± 0.069
2.176GlyTyr: 2.176 ± 0.128
0.0GlyXaa: 0.0 ± 0.0
His
1.546HisAla: 1.546 ± 0.105
0.343HisCys: 0.343 ± 0.051
1.132HisAsp: 1.132 ± 0.092
0.909HisGlu: 0.909 ± 0.068
1.02HisPhe: 1.02 ± 0.083
1.546HisGly: 1.546 ± 0.123
0.789HisHis: 0.789 ± 0.085
3.148HisIle: 3.148 ± 0.142
1.929HisLys: 1.929 ± 0.131
1.905HisLeu: 1.905 ± 0.13
0.526HisMet: 0.526 ± 0.061
1.705HisAsn: 1.705 ± 0.127
1.203HisPro: 1.203 ± 0.106
1.331HisGln: 1.331 ± 0.108
1.124HisArg: 1.124 ± 0.093
1.434HisSer: 1.434 ± 0.111
1.466HisThr: 1.466 ± 0.104
1.395HisVal: 1.395 ± 0.111
0.271HisTrp: 0.271 ± 0.053
0.845HisTyr: 0.845 ± 0.09
0.0HisXaa: 0.0 ± 0.0
Ile
5.547IleAla: 5.547 ± 0.266
1.586IleCys: 1.586 ± 0.118
5.913IleAsp: 5.913 ± 0.219
5.658IleGlu: 5.658 ± 0.232
7.077IlePhe: 7.077 ± 0.299
6.543IleGly: 6.543 ± 0.24
3.61IleHis: 3.61 ± 0.172
15.198IleIle: 15.198 ± 0.464
11.763IleLys: 11.763 ± 0.395
13.699IleLeu: 13.699 ± 0.37
2.63IleMet: 2.63 ± 0.156
9.643IleAsn: 9.643 ± 0.295
4.686IlePro: 4.686 ± 0.189
6.32IleGln: 6.32 ± 0.255
3.738IleArg: 3.738 ± 0.159
9.085IleSer: 9.085 ± 0.294
6.806IleThr: 6.806 ± 0.226
5.387IleVal: 5.387 ± 0.198
1.012IleTrp: 1.012 ± 0.108
4.861IleTyr: 4.861 ± 0.241
0.0IleXaa: 0.0 ± 0.0
Lys
2.064LysAla: 2.064 ± 0.14
1.012LysCys: 1.012 ± 0.103
3.22LysAsp: 3.22 ± 0.158
3.977LysGlu: 3.977 ± 0.224
4.439LysPhe: 4.439 ± 0.201
2.869LysGly: 2.869 ± 0.177
1.937LysHis: 1.937 ± 0.121
15.827LysIle: 15.827 ± 0.447
17.285LysLys: 17.285 ± 0.588
7.77LysLeu: 7.77 ± 0.25
2.319LysMet: 2.319 ± 0.133
12.185LysAsn: 12.185 ± 0.351
1.825LysPro: 1.825 ± 0.121
2.757LysGln: 2.757 ± 0.145
2.71LysArg: 2.71 ± 0.153
4.758LysSer: 4.758 ± 0.199
3.849LysThr: 3.849 ± 0.185
3.395LysVal: 3.395 ± 0.22
0.653LysTrp: 0.653 ± 0.061
5.563LysTyr: 5.563 ± 0.233
0.0LysXaa: 0.0 ± 0.0
Leu
3.467LeuAla: 3.467 ± 0.187
1.291LeuCys: 1.291 ± 0.096
3.889LeuAsp: 3.889 ± 0.202
4.399LeuGlu: 4.399 ± 0.194
4.63LeuPhe: 4.63 ± 0.259
5.124LeuGly: 5.124 ± 0.224
2.861LeuHis: 2.861 ± 0.156
9.213LeuIle: 9.213 ± 0.311
9.46LeuLys: 9.46 ± 0.292
9.173LeuLeu: 9.173 ± 0.378
2.239LeuMet: 2.239 ± 0.162
6.901LeuAsn: 6.901 ± 0.216
3.004LeuPro: 3.004 ± 0.141
3.969LeuGln: 3.969 ± 0.186
3.251LeuArg: 3.251 ± 0.159
7.603LeuSer: 7.603 ± 0.248
4.479LeuThr: 4.479 ± 0.168
3.618LeuVal: 3.618 ± 0.173
0.661LeuTrp: 0.661 ± 0.078
4.288LeuTyr: 4.288 ± 0.192
0.0LeuXaa: 0.0 ± 0.0
Met
0.972MetAla: 0.972 ± 0.086
0.255MetCys: 0.255 ± 0.04
0.781MetAsp: 0.781 ± 0.082
0.829MetGlu: 0.829 ± 0.078
1.044MetPhe: 1.044 ± 0.088
1.307MetGly: 1.307 ± 0.106
0.877MetHis: 0.877 ± 0.09
2.765MetIle: 2.765 ± 0.175
2.654MetLys: 2.654 ± 0.14
2.574MetLeu: 2.574 ± 0.146
0.566MetMet: 0.566 ± 0.082
1.737MetAsn: 1.737 ± 0.113
0.797MetPro: 0.797 ± 0.093
1.243MetGln: 1.243 ± 0.088
0.741MetArg: 0.741 ± 0.077
1.267MetSer: 1.267 ± 0.104
0.877MetThr: 0.877 ± 0.079
1.219MetVal: 1.219 ± 0.1
0.096MetTrp: 0.096 ± 0.027
0.924MetTyr: 0.924 ± 0.085
0.0MetXaa: 0.0 ± 0.0
Asn
2.279AsnAla: 2.279 ± 0.135
0.964AsnCys: 0.964 ± 0.098
2.51AsnAsp: 2.51 ± 0.152
2.359AsnGlu: 2.359 ± 0.135
4.264AsnPhe: 4.264 ± 0.199
2.566AsnGly: 2.566 ± 0.155
1.841AsnHis: 1.841 ± 0.133
14.034AsnIle: 14.034 ± 0.415
7.348AsnLys: 7.348 ± 0.238
6.144AsnLeu: 6.144 ± 0.23
1.841AsnMet: 1.841 ± 0.126
6.662AsnAsn: 6.662 ± 0.286
2.048AsnPro: 2.048 ± 0.145
3.307AsnGln: 3.307 ± 0.158
2.16AsnArg: 2.16 ± 0.147
3.395AsnSer: 3.395 ± 0.166
4.503AsnThr: 4.503 ± 0.195
3.475AsnVal: 3.475 ± 0.167
0.725AsnTrp: 0.725 ± 0.091
3.244AsnTyr: 3.244 ± 0.155
0.0AsnXaa: 0.0 ± 0.0
Pro
1.211ProAla: 1.211 ± 0.101
0.383ProCys: 0.383 ± 0.059
1.148ProAsp: 1.148 ± 0.094
1.825ProGlu: 1.825 ± 0.107
1.243ProPhe: 1.243 ± 0.088
2.152ProGly: 2.152 ± 0.144
0.717ProHis: 0.717 ± 0.07
3.889ProIle: 3.889 ± 0.175
3.028ProLys: 3.028 ± 0.158
2.208ProLeu: 2.208 ± 0.156
0.749ProMet: 0.749 ± 0.074
2.12ProAsn: 2.12 ± 0.137
0.781ProPro: 0.781 ± 0.082
0.693ProGln: 0.693 ± 0.075
0.789ProArg: 0.789 ± 0.068
1.61ProSer: 1.61 ± 0.104
1.586ProThr: 1.586 ± 0.107
1.809ProVal: 1.809 ± 0.114
0.311ProTrp: 0.311 ± 0.058
1.403ProTyr: 1.403 ± 0.107
0.0ProXaa: 0.0 ± 0.0
Gln
1.737GlnAla: 1.737 ± 0.104
0.438GlnCys: 0.438 ± 0.058
1.642GlnAsp: 1.642 ± 0.108
2.064GlnGlu: 2.064 ± 0.154
1.674GlnPhe: 1.674 ± 0.121
1.697GlnGly: 1.697 ± 0.119
0.964GlnHis: 0.964 ± 0.071
3.905GlnIle: 3.905 ± 0.198
5.076GlnLys: 5.076 ± 0.189
3.706GlnLeu: 3.706 ± 0.171
0.749GlnMet: 0.749 ± 0.069
2.71GlnAsn: 2.71 ± 0.15
1.02GlnPro: 1.02 ± 0.09
1.331GlnGln: 1.331 ± 0.131
1.108GlnArg: 1.108 ± 0.106
2.327GlnSer: 2.327 ± 0.139
1.785GlnThr: 1.785 ± 0.122
1.841GlnVal: 1.841 ± 0.118
0.375GlnTrp: 0.375 ± 0.054
1.945GlnTyr: 1.945 ± 0.118
0.0GlnXaa: 0.0 ± 0.0
Arg
1.634ArgAla: 1.634 ± 0.111
0.47ArgCys: 0.47 ± 0.07
1.315ArgAsp: 1.315 ± 0.107
1.53ArgGlu: 1.53 ± 0.122
1.586ArgPhe: 1.586 ± 0.106
1.769ArgGly: 1.769 ± 0.146
0.598ArgHis: 0.598 ± 0.075
4.136ArgIle: 4.136 ± 0.195
3.22ArgLys: 3.22 ± 0.166
2.917ArgLeu: 2.917 ± 0.159
0.948ArgMet: 0.948 ± 0.088
2.654ArgAsn: 2.654 ± 0.151
1.132ArgPro: 1.132 ± 0.104
1.028ArgGln: 1.028 ± 0.086
1.458ArgArg: 1.458 ± 0.119
2.104ArgSer: 2.104 ± 0.143
1.642ArgThr: 1.642 ± 0.122
1.634ArgVal: 1.634 ± 0.144
0.335ArgTrp: 0.335 ± 0.051
1.45ArgTyr: 1.45 ± 0.11
0.0ArgXaa: 0.0 ± 0.0
Ser
3.18SerAla: 3.18 ± 0.171
0.996SerCys: 0.996 ± 0.086
2.773SerAsp: 2.773 ± 0.165
2.805SerGlu: 2.805 ± 0.169
2.909SerPhe: 2.909 ± 0.18
4.535SerGly: 4.535 ± 0.206
1.411SerHis: 1.411 ± 0.119
7.475SerIle: 7.475 ± 0.247
5.387SerLys: 5.387 ± 0.209
5.371SerLeu: 5.371 ± 0.24
1.594SerMet: 1.594 ± 0.097
4.2SerAsn: 4.2 ± 0.188
1.403SerPro: 1.403 ± 0.1
1.769SerGln: 1.769 ± 0.096
2.279SerArg: 2.279 ± 0.144
3.921SerSer: 3.921 ± 0.211
3.06SerThr: 3.06 ± 0.161
3.307SerVal: 3.307 ± 0.158
0.709SerTrp: 0.709 ± 0.07
2.311SerTyr: 2.311 ± 0.123
0.0SerXaa: 0.0 ± 0.0
Thr
2.391ThrAla: 2.391 ± 0.139
0.669ThrCys: 0.669 ± 0.078
2.016ThrAsp: 2.016 ± 0.139
2.287ThrGlu: 2.287 ± 0.144
2.343ThrPhe: 2.343 ± 0.147
3.53ThrGly: 3.53 ± 0.177
1.148ThrHis: 1.148 ± 0.094
6.415ThrIle: 6.415 ± 0.239
4.064ThrLys: 4.064 ± 0.192
4.821ThrLeu: 4.821 ± 0.217
0.988ThrMet: 0.988 ± 0.087
3.204ThrAsn: 3.204 ± 0.156
1.881ThrPro: 1.881 ± 0.13
1.594ThrGln: 1.594 ± 0.1
1.634ThrArg: 1.634 ± 0.112
2.877ThrSer: 2.877 ± 0.189
2.941ThrThr: 2.941 ± 0.164
2.606ThrVal: 2.606 ± 0.139
0.47ThrTrp: 0.47 ± 0.073
1.562ThrTyr: 1.562 ± 0.109
0.0ThrXaa: 0.0 ± 0.0
Val
2.144ValAla: 2.144 ± 0.148
0.725ValCys: 0.725 ± 0.083
2.215ValAsp: 2.215 ± 0.147
2.096ValGlu: 2.096 ± 0.133
2.223ValPhe: 2.223 ± 0.126
2.606ValGly: 2.606 ± 0.181
1.387ValHis: 1.387 ± 0.127
6.224ValIle: 6.224 ± 0.238
3.738ValLys: 3.738 ± 0.203
4.909ValLeu: 4.909 ± 0.244
1.124ValMet: 1.124 ± 0.106
2.71ValAsn: 2.71 ± 0.152
1.53ValPro: 1.53 ± 0.106
1.65ValGln: 1.65 ± 0.129
1.865ValArg: 1.865 ± 0.123
3.251ValSer: 3.251 ± 0.173
2.048ValThr: 2.048 ± 0.136
2.582ValVal: 2.582 ± 0.179
0.327ValTrp: 0.327 ± 0.05
1.833ValTyr: 1.833 ± 0.139
0.0ValXaa: 0.0 ± 0.0
Trp
0.231TrpAla: 0.231 ± 0.042
0.143TrpCys: 0.143 ± 0.042
0.383TrpAsp: 0.383 ± 0.052
0.359TrpGlu: 0.359 ± 0.056
0.478TrpPhe: 0.478 ± 0.073
0.375TrpGly: 0.375 ± 0.062
0.199TrpHis: 0.199 ± 0.039
1.092TrpIle: 1.092 ± 0.109
0.845TrpLys: 0.845 ± 0.081
0.845TrpLeu: 0.845 ± 0.075
0.303TrpMet: 0.303 ± 0.047
0.932TrpAsn: 0.932 ± 0.098
0.255TrpPro: 0.255 ± 0.047
0.215TrpGln: 0.215 ± 0.036
0.343TrpArg: 0.343 ± 0.05
0.574TrpSer: 0.574 ± 0.068
0.223TrpThr: 0.223 ± 0.04
0.327TrpVal: 0.327 ± 0.05
0.072TrpTrp: 0.072 ± 0.022
0.438TrpTyr: 0.438 ± 0.062
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.785TyrAla: 1.785 ± 0.129
0.669TyrCys: 0.669 ± 0.073
2.2TyrAsp: 2.2 ± 0.134
1.729TyrGlu: 1.729 ± 0.117
2.375TyrPhe: 2.375 ± 0.164
2.0TyrGly: 2.0 ± 0.132
1.498TyrHis: 1.498 ± 0.112
5.371TyrIle: 5.371 ± 0.271
3.554TyrLys: 3.554 ± 0.199
3.722TyrLeu: 3.722 ± 0.172
0.789TyrMet: 0.789 ± 0.075
3.562TyrAsn: 3.562 ± 0.175
1.211TyrPro: 1.211 ± 0.091
2.192TyrGln: 2.192 ± 0.147
1.45TyrArg: 1.45 ± 0.103
2.59TyrSer: 2.59 ± 0.143
2.327TyrThr: 2.327 ± 0.126
1.809TyrVal: 1.809 ± 0.119
0.406TyrTrp: 0.406 ± 0.058
1.905TyrTyr: 1.905 ± 0.124
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 392 proteins (125482 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski