Amino acid dipepetide frequency for Psychromonas sp. PRT-SC03

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.723AlaAla: 4.723 ± 0.171
1.204AlaCys: 1.204 ± 0.073
4.02AlaAsp: 4.02 ± 0.152
4.194AlaGlu: 4.194 ± 0.159
3.374AlaPhe: 3.374 ± 0.121
5.154AlaGly: 5.154 ± 0.183
1.921AlaHis: 1.921 ± 0.09
6.358AlaIle: 6.358 ± 0.19
5.257AlaLys: 5.257 ± 0.182
10.392AlaLeu: 10.392 ± 0.232
2.324AlaMet: 2.324 ± 0.115
3.303AlaAsn: 3.303 ± 0.126
2.554AlaPro: 2.554 ± 0.109
4.395AlaGln: 4.395 ± 0.159
3.064AlaArg: 3.064 ± 0.124
5.318AlaSer: 5.318 ± 0.163
4.414AlaThr: 4.414 ± 0.143
4.339AlaVal: 4.339 ± 0.151
0.764AlaTrp: 0.764 ± 0.063
2.385AlaTyr: 2.385 ± 0.104
0.0AlaXaa: 0.0 ± 0.0
Cys
1.148CysAla: 1.148 ± 0.071
0.183CysCys: 0.183 ± 0.033
0.698CysAsp: 0.698 ± 0.071
0.661CysGlu: 0.661 ± 0.051
0.623CysPhe: 0.623 ± 0.061
0.848CysGly: 0.848 ± 0.06
0.384CysHis: 0.384 ± 0.044
0.975CysIle: 0.975 ± 0.076
0.679CysLys: 0.679 ± 0.062
1.312CysLeu: 1.312 ± 0.093
0.253CysMet: 0.253 ± 0.034
0.492CysAsn: 0.492 ± 0.054
0.487CysPro: 0.487 ± 0.051
0.501CysGln: 0.501 ± 0.048
0.412CysArg: 0.412 ± 0.038
0.843CysSer: 0.843 ± 0.06
0.562CysThr: 0.562 ± 0.058
0.825CysVal: 0.825 ± 0.068
0.108CysTrp: 0.108 ± 0.022
0.445CysTyr: 0.445 ± 0.043
0.0CysXaa: 0.0 ± 0.0
Asp
4.756AspAla: 4.756 ± 0.152
0.586AspCys: 0.586 ± 0.054
2.825AspAsp: 2.825 ± 0.13
3.5AspGlu: 3.5 ± 0.161
2.563AspPhe: 2.563 ± 0.12
3.06AspGly: 3.06 ± 0.117
1.162AspHis: 1.162 ± 0.075
4.756AspIle: 4.756 ± 0.157
4.081AspLys: 4.081 ± 0.148
5.22AspLeu: 5.22 ± 0.177
1.335AspMet: 1.335 ± 0.076
2.764AspAsn: 2.764 ± 0.126
1.86AspPro: 1.86 ± 0.093
1.485AspGln: 1.485 ± 0.076
1.654AspArg: 1.654 ± 0.103
2.994AspSer: 2.994 ± 0.135
2.441AspThr: 2.441 ± 0.102
3.725AspVal: 3.725 ± 0.138
0.604AspTrp: 0.604 ± 0.05
1.673AspTyr: 1.673 ± 0.078
0.0AspXaa: 0.0 ± 0.0
Glu
4.133GluAla: 4.133 ± 0.181
0.572GluCys: 0.572 ± 0.053
2.839GluAsp: 2.839 ± 0.128
2.971GluGlu: 2.971 ± 0.189
2.249GluPhe: 2.249 ± 0.114
3.2GluGly: 3.2 ± 0.137
1.574GluHis: 1.574 ± 0.114
4.212GluIle: 4.212 ± 0.176
4.573GluLys: 4.573 ± 0.172
6.171GluLeu: 6.171 ± 0.185
1.841GluMet: 1.841 ± 0.091
3.092GluAsn: 3.092 ± 0.122
1.513GluPro: 1.513 ± 0.088
3.177GluGln: 3.177 ± 0.15
2.507GluArg: 2.507 ± 0.129
2.952GluSer: 2.952 ± 0.141
2.694GluThr: 2.694 ± 0.126
3.659GluVal: 3.659 ± 0.158
0.394GluTrp: 0.394 ± 0.046
1.649GluTyr: 1.649 ± 0.09
0.0GluXaa: 0.0 ± 0.0
Phe
3.275PheAla: 3.275 ± 0.123
0.525PheCys: 0.525 ± 0.053
2.521PheAsp: 2.521 ± 0.126
2.31PheGlu: 2.31 ± 0.126
1.902PhePhe: 1.902 ± 0.118
2.493PheGly: 2.493 ± 0.118
0.736PheHis: 0.736 ± 0.071
3.744PheIle: 3.744 ± 0.157
2.802PheLys: 2.802 ± 0.104
3.987PheLeu: 3.987 ± 0.168
1.139PheMet: 1.139 ± 0.086
2.474PheAsn: 2.474 ± 0.122
1.34PhePro: 1.34 ± 0.088
1.101PheGln: 1.101 ± 0.083
1.228PheArg: 1.228 ± 0.09
4.067PheSer: 4.067 ± 0.169
2.23PheThr: 2.23 ± 0.107
2.596PheVal: 2.596 ± 0.121
0.426PheTrp: 0.426 ± 0.044
1.42PheTyr: 1.42 ± 0.086
0.0PheXaa: 0.0 ± 0.0
Gly
5.271GlyAla: 5.271 ± 0.182
0.923GlyCys: 0.923 ± 0.071
3.294GlyAsp: 3.294 ± 0.13
3.589GlyGlu: 3.589 ± 0.15
2.783GlyPhe: 2.783 ± 0.116
4.217GlyGly: 4.217 ± 0.21
1.453GlyHis: 1.453 ± 0.089
5.009GlyIle: 5.009 ± 0.159
4.334GlyLys: 4.334 ± 0.145
6.176GlyLeu: 6.176 ± 0.2
1.921GlyMet: 1.921 ± 0.101
2.493GlyAsn: 2.493 ± 0.105
1.335GlyPro: 1.335 ± 0.082
2.315GlyGln: 2.315 ± 0.101
2.746GlyArg: 2.746 ± 0.138
3.828GlySer: 3.828 ± 0.126
2.999GlyThr: 2.999 ± 0.135
4.789GlyVal: 4.789 ± 0.166
0.665GlyTrp: 0.665 ± 0.064
2.104GlyTyr: 2.104 ± 0.098
0.0GlyXaa: 0.0 ± 0.0
His
1.766HisAla: 1.766 ± 0.09
0.44HisCys: 0.44 ± 0.047
0.918HisAsp: 0.918 ± 0.071
1.031HisGlu: 1.031 ± 0.09
1.19HisPhe: 1.19 ± 0.073
1.279HisGly: 1.279 ± 0.087
0.595HisHis: 0.595 ± 0.057
1.72HisIle: 1.72 ± 0.093
1.457HisLys: 1.457 ± 0.087
2.446HisLeu: 2.446 ± 0.119
0.431HisMet: 0.431 ± 0.048
0.862HisAsn: 0.862 ± 0.059
0.989HisPro: 0.989 ± 0.067
1.223HisGln: 1.223 ± 0.079
0.904HisArg: 0.904 ± 0.063
1.429HisSer: 1.429 ± 0.081
1.012HisThr: 1.012 ± 0.07
1.274HisVal: 1.274 ± 0.075
0.281HisTrp: 0.281 ± 0.033
1.11HisTyr: 1.11 ± 0.072
0.0HisXaa: 0.0 ± 0.0
Ile
6.883IleAla: 6.883 ± 0.215
1.003IleCys: 1.003 ± 0.075
5.009IleAsp: 5.009 ± 0.16
5.163IleGlu: 5.163 ± 0.171
2.975IlePhe: 2.975 ± 0.14
4.653IleGly: 4.653 ± 0.15
1.307IleHis: 1.307 ± 0.078
5.904IleIle: 5.904 ± 0.176
5.416IleLys: 5.416 ± 0.155
7.169IleLeu: 7.169 ± 0.181
1.602IleMet: 1.602 ± 0.101
4.217IleAsn: 4.217 ± 0.167
2.755IlePro: 2.755 ± 0.11
2.671IleGln: 2.671 ± 0.115
2.971IleArg: 2.971 ± 0.104
5.763IleSer: 5.763 ± 0.156
4.156IleThr: 4.156 ± 0.137
4.803IleVal: 4.803 ± 0.179
0.6IleTrp: 0.6 ± 0.054
2.16IleTyr: 2.16 ± 0.106
0.0IleXaa: 0.0 ± 0.0
Lys
5.046LysAla: 5.046 ± 0.149
0.511LysCys: 0.511 ± 0.056
3.247LysAsp: 3.247 ± 0.116
4.311LysGlu: 4.311 ± 0.129
1.884LysPhe: 1.884 ± 0.098
4.17LysGly: 4.17 ± 0.12
1.574LysHis: 1.574 ± 0.1
5.304LysIle: 5.304 ± 0.179
5.866LysLys: 5.866 ± 0.189
6.658LysLeu: 6.658 ± 0.192
1.944LysMet: 1.944 ± 0.094
3.903LysAsn: 3.903 ± 0.14
2.085LysPro: 2.085 ± 0.114
3.374LysGln: 3.374 ± 0.141
3.285LysArg: 3.285 ± 0.145
4.409LysSer: 4.409 ± 0.137
3.627LysThr: 3.627 ± 0.152
4.315LysVal: 4.315 ± 0.154
0.525LysTrp: 0.525 ± 0.048
1.963LysTyr: 1.963 ± 0.104
0.0LysXaa: 0.0 ± 0.0
Leu
8.832LeuAla: 8.832 ± 0.199
1.588LeuCys: 1.588 ± 0.09
6.021LeuAsp: 6.021 ± 0.177
5.744LeuGlu: 5.744 ± 0.193
4.77LeuPhe: 4.77 ± 0.203
6.541LeuGly: 6.541 ± 0.212
2.291LeuHis: 2.291 ± 0.101
7.459LeuIle: 7.459 ± 0.197
7.0LeuLys: 7.0 ± 0.168
12.96LeuLeu: 12.96 ± 0.371
2.741LeuMet: 2.741 ± 0.106
5.571LeuAsn: 5.571 ± 0.167
4.203LeuPro: 4.203 ± 0.166
5.14LeuGln: 5.14 ± 0.192
4.358LeuArg: 4.358 ± 0.146
8.996LeuSer: 8.996 ± 0.246
5.819LeuThr: 5.819 ± 0.165
6.101LeuVal: 6.101 ± 0.183
1.106LeuTrp: 1.106 ± 0.087
2.793LeuTyr: 2.793 ± 0.115
0.0LeuXaa: 0.0 ± 0.0
Met
2.174MetAla: 2.174 ± 0.101
0.276MetCys: 0.276 ± 0.039
1.064MetAsp: 1.064 ± 0.065
0.895MetGlu: 0.895 ± 0.072
1.003MetPhe: 1.003 ± 0.071
1.785MetGly: 1.785 ± 0.098
0.689MetHis: 0.689 ± 0.054
1.729MetIle: 1.729 ± 0.093
1.546MetLys: 1.546 ± 0.076
2.966MetLeu: 2.966 ± 0.131
0.708MetMet: 0.708 ± 0.07
1.064MetAsn: 1.064 ± 0.068
1.237MetPro: 1.237 ± 0.077
1.818MetGln: 1.818 ± 0.097
1.378MetArg: 1.378 ± 0.082
1.827MetSer: 1.827 ± 0.087
1.284MetThr: 1.284 ± 0.089
1.438MetVal: 1.438 ± 0.096
0.197MetTrp: 0.197 ± 0.031
0.469MetTyr: 0.469 ± 0.038
0.0MetXaa: 0.0 ± 0.0
Asn
4.236AsnAla: 4.236 ± 0.147
0.454AsnCys: 0.454 ± 0.04
2.46AsnAsp: 2.46 ± 0.129
2.877AsnGlu: 2.877 ± 0.126
1.743AsnPhe: 1.743 ± 0.103
2.713AsnGly: 2.713 ± 0.119
0.843AsnHis: 0.843 ± 0.066
4.175AsnIle: 4.175 ± 0.132
4.114AsnLys: 4.114 ± 0.137
4.545AsnLeu: 4.545 ± 0.152
1.223AsnMet: 1.223 ± 0.078
2.638AsnAsn: 2.638 ± 0.147
1.649AsnPro: 1.649 ± 0.093
1.846AsnGln: 1.846 ± 0.103
1.574AsnArg: 1.574 ± 0.087
3.111AsnSer: 3.111 ± 0.116
2.629AsnThr: 2.629 ± 0.118
2.9AsnVal: 2.9 ± 0.126
0.539AsnTrp: 0.539 ± 0.06
1.537AsnTyr: 1.537 ± 0.096
0.0AsnXaa: 0.0 ± 0.0
Pro
2.61ProAla: 2.61 ± 0.11
0.445ProCys: 0.445 ± 0.046
1.743ProAsp: 1.743 ± 0.088
2.291ProGlu: 2.291 ± 0.105
1.813ProPhe: 1.813 ± 0.094
1.757ProGly: 1.757 ± 0.1
0.722ProHis: 0.722 ± 0.06
2.549ProIle: 2.549 ± 0.115
2.043ProLys: 2.043 ± 0.089
4.109ProLeu: 4.109 ± 0.143
0.759ProMet: 0.759 ± 0.056
1.415ProAsn: 1.415 ± 0.074
0.909ProPro: 0.909 ± 0.061
1.387ProGln: 1.387 ± 0.095
1.214ProArg: 1.214 ± 0.08
2.137ProSer: 2.137 ± 0.089
1.893ProThr: 1.893 ± 0.093
2.315ProVal: 2.315 ± 0.102
0.459ProTrp: 0.459 ± 0.048
1.139ProTyr: 1.139 ± 0.072
0.0ProXaa: 0.0 ± 0.0
Gln
3.762GlnAla: 3.762 ± 0.14
0.459GlnCys: 0.459 ± 0.051
2.104GlnAsp: 2.104 ± 0.093
2.258GlnGlu: 2.258 ± 0.107
1.542GlnPhe: 1.542 ± 0.08
3.313GlnGly: 3.313 ± 0.133
1.082GlnHis: 1.082 ± 0.081
3.172GlnIle: 3.172 ± 0.133
3.542GlnLys: 3.542 ± 0.129
5.477GlnLeu: 5.477 ± 0.175
0.965GlnMet: 0.965 ± 0.059
2.034GlnAsn: 2.034 ± 0.096
1.232GlnPro: 1.232 ± 0.072
2.98GlnGln: 2.98 ± 0.161
2.212GlnArg: 2.212 ± 0.089
2.844GlnSer: 2.844 ± 0.12
2.244GlnThr: 2.244 ± 0.11
2.825GlnVal: 2.825 ± 0.142
0.618GlnTrp: 0.618 ± 0.056
1.588GlnTyr: 1.588 ± 0.089
0.0GlnXaa: 0.0 ± 0.0
Arg
3.167ArgAla: 3.167 ± 0.143
0.492ArgCys: 0.492 ± 0.049
1.963ArgAsp: 1.963 ± 0.096
2.347ArgGlu: 2.347 ± 0.116
1.991ArgPhe: 1.991 ± 0.099
2.376ArgGly: 2.376 ± 0.12
0.989ArgHis: 0.989 ± 0.07
3.35ArgIle: 3.35 ± 0.144
2.441ArgLys: 2.441 ± 0.118
4.667ArgLeu: 4.667 ± 0.156
0.876ArgMet: 0.876 ± 0.069
1.823ArgAsn: 1.823 ± 0.099
1.42ArgPro: 1.42 ± 0.073
1.86ArgGln: 1.86 ± 0.105
2.123ArgArg: 2.123 ± 0.131
2.577ArgSer: 2.577 ± 0.118
1.818ArgThr: 1.818 ± 0.098
2.872ArgVal: 2.872 ± 0.125
0.422ArgTrp: 0.422 ± 0.045
1.546ArgTyr: 1.546 ± 0.091
0.0ArgXaa: 0.0 ± 0.0
Ser
5.796SerAla: 5.796 ± 0.192
0.792SerCys: 0.792 ± 0.061
3.406SerAsp: 3.406 ± 0.155
3.566SerGlu: 3.566 ± 0.138
2.994SerPhe: 2.994 ± 0.119
4.653SerGly: 4.653 ± 0.161
1.415SerHis: 1.415 ± 0.096
5.594SerIle: 5.594 ± 0.177
3.978SerLys: 3.978 ± 0.147
7.417SerLeu: 7.417 ± 0.224
1.635SerMet: 1.635 ± 0.094
3.027SerAsn: 3.027 ± 0.121
2.483SerPro: 2.483 ± 0.108
2.732SerGln: 2.732 ± 0.112
2.61SerArg: 2.61 ± 0.111
4.564SerSer: 4.564 ± 0.15
3.678SerThr: 3.678 ± 0.132
4.643SerVal: 4.643 ± 0.17
0.661SerTrp: 0.661 ± 0.044
2.029SerTyr: 2.029 ± 0.096
0.0SerXaa: 0.0 ± 0.0
Thr
3.533ThrAla: 3.533 ± 0.128
0.483ThrCys: 0.483 ± 0.05
2.764ThrAsp: 2.764 ± 0.116
2.699ThrGlu: 2.699 ± 0.129
2.202ThrPhe: 2.202 ± 0.094
3.819ThrGly: 3.819 ± 0.137
1.293ThrHis: 1.293 ± 0.071
3.35ThrIle: 3.35 ± 0.138
2.905ThrLys: 2.905 ± 0.125
6.939ThrLeu: 6.939 ± 0.185
1.115ThrMet: 1.115 ± 0.071
1.991ThrAsn: 1.991 ± 0.092
2.258ThrPro: 2.258 ± 0.116
3.05ThrGln: 3.05 ± 0.126
2.352ThrArg: 2.352 ± 0.1
2.971ThrSer: 2.971 ± 0.142
2.839ThrThr: 2.839 ± 0.134
3.022ThrVal: 3.022 ± 0.132
0.52ThrTrp: 0.52 ± 0.05
1.537ThrTyr: 1.537 ± 0.095
0.0ThrXaa: 0.0 ± 0.0
Val
5.299ValAla: 5.299 ± 0.175
0.857ValCys: 0.857 ± 0.066
4.048ValAsp: 4.048 ± 0.168
3.748ValGlu: 3.748 ± 0.152
2.619ValPhe: 2.619 ± 0.117
3.959ValGly: 3.959 ± 0.16
1.218ValHis: 1.218 ± 0.076
4.934ValIle: 4.934 ± 0.173
3.716ValLys: 3.716 ± 0.145
6.396ValLeu: 6.396 ± 0.186
1.799ValMet: 1.799 ± 0.089
3.008ValAsn: 3.008 ± 0.118
1.87ValPro: 1.87 ± 0.099
2.521ValGln: 2.521 ± 0.118
2.647ValArg: 2.647 ± 0.119
4.423ValSer: 4.423 ± 0.145
3.406ValThr: 3.406 ± 0.123
4.147ValVal: 4.147 ± 0.154
0.45ValTrp: 0.45 ± 0.046
1.706ValTyr: 1.706 ± 0.088
0.0ValXaa: 0.0 ± 0.0
Trp
0.567TrpAla: 0.567 ± 0.046
0.141TrpCys: 0.141 ± 0.028
0.609TrpAsp: 0.609 ± 0.063
0.412TrpGlu: 0.412 ± 0.045
0.497TrpPhe: 0.497 ± 0.047
0.647TrpGly: 0.647 ± 0.053
0.258TrpHis: 0.258 ± 0.036
0.693TrpIle: 0.693 ± 0.06
0.534TrpLys: 0.534 ± 0.055
1.354TrpLeu: 1.354 ± 0.095
0.291TrpMet: 0.291 ± 0.035
0.319TrpAsn: 0.319 ± 0.032
0.356TrpPro: 0.356 ± 0.04
0.764TrpGln: 0.764 ± 0.056
0.497TrpArg: 0.497 ± 0.045
0.539TrpSer: 0.539 ± 0.05
0.38TrpThr: 0.38 ± 0.043
0.618TrpVal: 0.618 ± 0.047
0.108TrpTrp: 0.108 ± 0.02
0.253TrpTyr: 0.253 ± 0.044
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.479TyrAla: 2.479 ± 0.112
0.506TyrCys: 0.506 ± 0.053
1.49TyrAsp: 1.49 ± 0.098
1.406TyrGlu: 1.406 ± 0.082
1.598TyrPhe: 1.598 ± 0.092
1.607TyrGly: 1.607 ± 0.084
0.853TyrHis: 0.853 ± 0.074
2.193TyrIle: 2.193 ± 0.117
1.715TyrLys: 1.715 ± 0.094
3.678TyrLeu: 3.678 ± 0.131
0.637TyrMet: 0.637 ± 0.055
1.317TyrAsn: 1.317 ± 0.083
1.171TyrPro: 1.171 ± 0.075
1.916TyrGln: 1.916 ± 0.109
1.387TyrArg: 1.387 ± 0.081
2.08TyrSer: 2.08 ± 0.098
1.518TyrThr: 1.518 ± 0.095
1.565TyrVal: 1.565 ± 0.088
0.389TyrTrp: 0.389 ± 0.05
1.092TyrTyr: 1.092 ± 0.071
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 787 proteins (213425 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski