Amino acid dipepetide frequency for Mycoplasma amphoriforme A39

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.684AlaAla: 3.684 ± 0.17
0.629AlaCys: 0.629 ± 0.047
2.509AlaAsp: 2.509 ± 0.114
2.578AlaGlu: 2.578 ± 0.106
2.717AlaPhe: 2.717 ± 0.107
2.957AlaGly: 2.957 ± 0.198
0.946AlaHis: 0.946 ± 0.063
5.968AlaIle: 5.968 ± 0.202
5.695AlaLys: 5.695 ± 0.184
5.455AlaLeu: 5.455 ± 0.167
1.189AlaMet: 1.189 ± 0.064
4.35AlaAsn: 4.35 ± 0.143
1.626AlaPro: 1.626 ± 0.079
2.109AlaGln: 2.109 ± 0.104
1.873AlaArg: 1.873 ± 0.095
3.844AlaSer: 3.844 ± 0.138
3.509AlaThr: 3.509 ± 0.15
3.495AlaVal: 3.495 ± 0.138
0.469AlaTrp: 0.469 ± 0.045
2.073AlaTyr: 2.073 ± 0.077
0.0AlaXaa: 0.0 ± 0.0
Cys
0.4CysAla: 0.4 ± 0.038
0.055CysCys: 0.055 ± 0.013
0.436CysAsp: 0.436 ± 0.047
0.473CysGlu: 0.473 ± 0.048
0.385CysPhe: 0.385 ± 0.04
0.426CysGly: 0.426 ± 0.034
0.196CysHis: 0.196 ± 0.025
0.516CysIle: 0.516 ± 0.05
0.542CysLys: 0.542 ± 0.053
0.724CysLeu: 0.724 ± 0.055
0.12CysMet: 0.12 ± 0.022
0.458CysAsn: 0.458 ± 0.042
0.207CysPro: 0.207 ± 0.03
0.331CysGln: 0.331 ± 0.034
0.258CysArg: 0.258 ± 0.03
0.564CysSer: 0.564 ± 0.051
0.305CysThr: 0.305 ± 0.032
0.462CysVal: 0.462 ± 0.046
0.062CysTrp: 0.062 ± 0.015
0.247CysTyr: 0.247 ± 0.029
0.0CysXaa: 0.0 ± 0.0
Asp
3.484AspAla: 3.484 ± 0.162
0.429AspCys: 0.429 ± 0.04
3.0AspAsp: 3.0 ± 0.169
3.786AspGlu: 3.786 ± 0.163
3.288AspPhe: 3.288 ± 0.119
2.637AspGly: 2.637 ± 0.172
1.437AspHis: 1.437 ± 0.086
3.215AspIle: 3.215 ± 0.13
3.92AspLys: 3.92 ± 0.141
6.313AspLeu: 6.313 ± 0.19
0.655AspMet: 0.655 ± 0.052
3.222AspAsn: 3.222 ± 0.134
1.986AspPro: 1.986 ± 0.086
3.0AspGln: 3.0 ± 0.181
1.622AspArg: 1.622 ± 0.083
3.019AspSer: 3.019 ± 0.14
2.066AspThr: 2.066 ± 0.116
3.124AspVal: 3.124 ± 0.127
0.575AspTrp: 0.575 ± 0.05
2.298AspTyr: 2.298 ± 0.098
0.0AspXaa: 0.0 ± 0.0
Glu
2.989GluAla: 2.989 ± 0.14
0.331GluCys: 0.331 ± 0.039
2.2GluAsp: 2.2 ± 0.111
2.891GluGlu: 2.891 ± 0.158
2.666GluPhe: 2.666 ± 0.094
1.982GluGly: 1.982 ± 0.097
1.051GluHis: 1.051 ± 0.072
6.208GluIle: 6.208 ± 0.185
5.808GluLys: 5.808 ± 0.197
6.03GluLeu: 6.03 ± 0.233
1.146GluMet: 1.146 ± 0.057
4.579GluAsn: 4.579 ± 0.166
1.877GluPro: 1.877 ± 0.197
3.08GluGln: 3.08 ± 0.148
2.16GluArg: 2.16 ± 0.139
3.029GluSer: 3.029 ± 0.15
3.32GluThr: 3.32 ± 0.222
3.386GluVal: 3.386 ± 0.156
0.567GluTrp: 0.567 ± 0.051
1.858GluTyr: 1.858 ± 0.083
0.0GluXaa: 0.0 ± 0.0
Phe
3.284PheAla: 3.284 ± 0.12
0.385PheCys: 0.385 ± 0.037
3.339PheAsp: 3.339 ± 0.136
2.888PheGlu: 2.888 ± 0.135
2.815PhePhe: 2.815 ± 0.162
2.786PheGly: 2.786 ± 0.115
0.96PheHis: 0.96 ± 0.063
3.746PheIle: 3.746 ± 0.143
4.273PheLys: 4.273 ± 0.127
5.215PheLeu: 5.215 ± 0.218
0.855PheMet: 0.855 ± 0.058
3.822PheAsn: 3.822 ± 0.157
1.244PhePro: 1.244 ± 0.065
1.862PheGln: 1.862 ± 0.089
1.611PheArg: 1.611 ± 0.084
3.688PheSer: 3.688 ± 0.128
2.346PheThr: 2.346 ± 0.101
3.266PheVal: 3.266 ± 0.12
0.607PheTrp: 0.607 ± 0.06
2.055PheTyr: 2.055 ± 0.108
0.0PheXaa: 0.0 ± 0.0
Gly
2.549GlyAla: 2.549 ± 0.126
0.342GlyCys: 0.342 ± 0.033
2.488GlyAsp: 2.488 ± 0.171
2.142GlyGlu: 2.142 ± 0.097
2.942GlyPhe: 2.942 ± 0.124
3.309GlyGly: 3.309 ± 0.219
0.938GlyHis: 0.938 ± 0.077
4.502GlyIle: 4.502 ± 0.185
3.906GlyLys: 3.906 ± 0.151
4.742GlyLeu: 4.742 ± 0.16
0.873GlyMet: 0.873 ± 0.071
2.971GlyAsn: 2.971 ± 0.153
1.138GlyPro: 1.138 ± 0.069
1.931GlyGln: 1.931 ± 0.217
1.655GlyArg: 1.655 ± 0.093
3.724GlySer: 3.724 ± 0.231
3.204GlyThr: 3.204 ± 0.163
3.149GlyVal: 3.149 ± 0.114
0.491GlyTrp: 0.491 ± 0.041
2.215GlyTyr: 2.215 ± 0.096
0.0GlyXaa: 0.0 ± 0.0
His
1.109HisAla: 1.109 ± 0.068
0.171HisCys: 0.171 ± 0.025
1.473HisAsp: 1.473 ± 0.094
1.386HisGlu: 1.386 ± 0.092
0.975HisPhe: 0.975 ± 0.056
1.084HisGly: 1.084 ± 0.076
0.698HisHis: 0.698 ± 0.058
1.007HisIle: 1.007 ± 0.063
1.327HisLys: 1.327 ± 0.07
2.237HisLeu: 2.237 ± 0.101
0.236HisMet: 0.236 ± 0.031
1.124HisAsn: 1.124 ± 0.061
0.709HisPro: 0.709 ± 0.052
1.066HisGln: 1.066 ± 0.071
0.724HisArg: 0.724 ± 0.059
1.149HisSer: 1.149 ± 0.07
0.829HisThr: 0.829 ± 0.055
1.215HisVal: 1.215 ± 0.067
0.175HisTrp: 0.175 ± 0.024
0.895HisTyr: 0.895 ± 0.066
0.0HisXaa: 0.0 ± 0.0
Ile
5.677IleAla: 5.677 ± 0.178
0.731IleCys: 0.731 ± 0.062
5.63IleAsp: 5.63 ± 0.192
4.855IleGlu: 4.855 ± 0.154
4.27IlePhe: 4.27 ± 0.191
4.36IleGly: 4.36 ± 0.144
1.418IleHis: 1.418 ± 0.077
6.961IleIle: 6.961 ± 0.217
7.681IleLys: 7.681 ± 0.206
6.775IleLeu: 6.775 ± 0.196
1.407IleMet: 1.407 ± 0.069
6.863IleAsn: 6.863 ± 0.151
3.117IlePro: 3.117 ± 0.117
2.804IleGln: 2.804 ± 0.113
2.582IleArg: 2.582 ± 0.11
5.841IleSer: 5.841 ± 0.154
4.688IleThr: 4.688 ± 0.16
5.572IleVal: 5.572 ± 0.168
0.702IleTrp: 0.702 ± 0.054
3.099IleTyr: 3.099 ± 0.145
0.0IleXaa: 0.0 ± 0.0
Lys
4.026LysAla: 4.026 ± 0.128
0.393LysCys: 0.393 ± 0.038
4.03LysAsp: 4.03 ± 0.134
5.513LysGlu: 5.513 ± 0.227
3.771LysPhe: 3.771 ± 0.151
2.913LysGly: 2.913 ± 0.137
1.629LysHis: 1.629 ± 0.098
9.43LysIle: 9.43 ± 0.231
10.605LysLys: 10.605 ± 0.267
7.979LysLeu: 7.979 ± 0.218
2.302LysMet: 2.302 ± 0.097
8.132LysAsn: 8.132 ± 0.2
3.317LysPro: 3.317 ± 0.228
4.615LysGln: 4.615 ± 0.17
3.146LysArg: 3.146 ± 0.12
4.75LysSer: 4.75 ± 0.135
6.75LysThr: 6.75 ± 0.163
4.684LysVal: 4.684 ± 0.166
0.724LysTrp: 0.724 ± 0.055
3.495LysTyr: 3.495 ± 0.137
0.0LysXaa: 0.0 ± 0.0
Leu
5.932LeuAla: 5.932 ± 0.145
0.72LeuCys: 0.72 ± 0.061
5.041LeuAsp: 5.041 ± 0.157
5.39LeuGlu: 5.39 ± 0.196
4.597LeuPhe: 4.597 ± 0.176
4.582LeuGly: 4.582 ± 0.174
1.666LeuHis: 1.666 ± 0.075
8.452LeuIle: 8.452 ± 0.231
9.867LeuLys: 9.867 ± 0.251
8.339LeuLeu: 8.339 ± 0.227
1.669LeuMet: 1.669 ± 0.084
6.884LeuAsn: 6.884 ± 0.149
3.669LeuPro: 3.669 ± 0.127
4.059LeuGln: 4.059 ± 0.178
3.015LeuArg: 3.015 ± 0.112
6.557LeuSer: 6.557 ± 0.155
5.866LeuThr: 5.866 ± 0.159
5.906LeuVal: 5.906 ± 0.15
0.855LeuTrp: 0.855 ± 0.064
2.822LeuTyr: 2.822 ± 0.099
0.0LeuXaa: 0.0 ± 0.0
Met
1.266MetAla: 1.266 ± 0.067
0.12MetCys: 0.12 ± 0.018
0.786MetAsp: 0.786 ± 0.054
0.833MetGlu: 0.833 ± 0.055
0.873MetPhe: 0.873 ± 0.056
0.796MetGly: 0.796 ± 0.064
0.415MetHis: 0.415 ± 0.042
1.698MetIle: 1.698 ± 0.093
1.615MetLys: 1.615 ± 0.078
1.757MetLeu: 1.757 ± 0.09
0.444MetMet: 0.444 ± 0.05
1.251MetAsn: 1.251 ± 0.067
0.647MetPro: 0.647 ± 0.074
0.815MetGln: 0.815 ± 0.06
0.753MetArg: 0.753 ± 0.051
1.011MetSer: 1.011 ± 0.066
0.873MetThr: 0.873 ± 0.057
1.116MetVal: 1.116 ± 0.069
0.135MetTrp: 0.135 ± 0.025
0.527MetTyr: 0.527 ± 0.045
0.0MetXaa: 0.0 ± 0.0
Asn
4.211AsnAla: 4.211 ± 0.149
0.527AsnCys: 0.527 ± 0.047
4.051AsnAsp: 4.051 ± 0.152
4.779AsnGlu: 4.779 ± 0.153
3.913AsnPhe: 3.913 ± 0.142
3.295AsnGly: 3.295 ± 0.138
1.953AsnHis: 1.953 ± 0.105
4.815AsnIle: 4.815 ± 0.153
6.255AsnLys: 6.255 ± 0.191
7.503AsnLeu: 7.503 ± 0.196
1.029AsnMet: 1.029 ± 0.064
5.772AsnAsn: 5.772 ± 0.186
3.32AsnPro: 3.32 ± 0.185
4.848AsnGln: 4.848 ± 0.176
2.477AsnArg: 2.477 ± 0.11
4.644AsnSer: 4.644 ± 0.174
2.993AsnThr: 2.993 ± 0.119
3.83AsnVal: 3.83 ± 0.108
0.858AsnTrp: 0.858 ± 0.062
3.117AsnTyr: 3.117 ± 0.147
0.0AsnXaa: 0.0 ± 0.0
Pro
1.706ProAla: 1.706 ± 0.153
0.171ProCys: 0.171 ± 0.025
1.757ProAsp: 1.757 ± 0.088
2.535ProGlu: 2.535 ± 0.23
1.669ProPhe: 1.669 ± 0.071
1.844ProGly: 1.844 ± 0.097
0.553ProHis: 0.553 ± 0.048
2.815ProIle: 2.815 ± 0.107
3.229ProLys: 3.229 ± 0.205
3.044ProLeu: 3.044 ± 0.111
0.629ProMet: 0.629 ± 0.065
2.538ProAsn: 2.538 ± 0.119
1.4ProPro: 1.4 ± 0.163
1.215ProGln: 1.215 ± 0.087
0.938ProArg: 0.938 ± 0.069
2.426ProSer: 2.426 ± 0.119
2.135ProThr: 2.135 ± 0.105
2.309ProVal: 2.309 ± 0.131
0.302ProTrp: 0.302 ± 0.034
1.306ProTyr: 1.306 ± 0.072
0.0ProXaa: 0.0 ± 0.0
Gln
2.695GlnAla: 2.695 ± 0.098
0.193GlnCys: 0.193 ± 0.028
1.946GlnAsp: 1.946 ± 0.102
2.593GlnGlu: 2.593 ± 0.149
1.927GlnPhe: 1.927 ± 0.093
1.807GlnGly: 1.807 ± 0.08
0.796GlnHis: 0.796 ± 0.052
4.273GlnIle: 4.273 ± 0.132
4.993GlnLys: 4.993 ± 0.182
4.299GlnLeu: 4.299 ± 0.162
0.847GlnMet: 0.847 ± 0.058
3.586GlnAsn: 3.586 ± 0.196
1.764GlnPro: 1.764 ± 0.201
2.058GlnGln: 2.058 ± 0.148
1.527GlnArg: 1.527 ± 0.093
2.844GlnSer: 2.844 ± 0.184
2.931GlnThr: 2.931 ± 0.11
2.222GlnVal: 2.222 ± 0.088
0.436GlnTrp: 0.436 ± 0.045
1.662GlnTyr: 1.662 ± 0.098
0.0GlnXaa: 0.0 ± 0.0
Arg
1.677ArgAla: 1.677 ± 0.08
0.269ArgCys: 0.269 ± 0.03
1.531ArgAsp: 1.531 ± 0.081
1.88ArgGlu: 1.88 ± 0.095
2.058ArgPhe: 2.058 ± 0.08
1.407ArgGly: 1.407 ± 0.086
0.64ArgHis: 0.64 ± 0.049
3.055ArgIle: 3.055 ± 0.117
2.88ArgLys: 2.88 ± 0.111
3.357ArgLeu: 3.357 ± 0.125
0.815ArgMet: 0.815 ± 0.058
2.284ArgAsn: 2.284 ± 0.099
1.175ArgPro: 1.175 ± 0.065
1.462ArgGln: 1.462 ± 0.092
1.2ArgArg: 1.2 ± 0.073
2.182ArgSer: 2.182 ± 0.101
1.866ArgThr: 1.866 ± 0.078
1.855ArgVal: 1.855 ± 0.086
0.367ArgTrp: 0.367 ± 0.039
1.353ArgTyr: 1.353 ± 0.062
0.0ArgXaa: 0.0 ± 0.0
Ser
3.619SerAla: 3.619 ± 0.153
0.404SerCys: 0.404 ± 0.043
3.2SerAsp: 3.2 ± 0.126
3.924SerGlu: 3.924 ± 0.144
3.444SerPhe: 3.444 ± 0.133
4.004SerGly: 4.004 ± 0.245
1.229SerHis: 1.229 ± 0.072
4.71SerIle: 4.71 ± 0.141
5.957SerLys: 5.957 ± 0.178
6.663SerLeu: 6.663 ± 0.156
0.946SerMet: 0.946 ± 0.056
4.324SerAsn: 4.324 ± 0.148
1.884SerPro: 1.884 ± 0.08
2.982SerGln: 2.982 ± 0.162
2.248SerArg: 2.248 ± 0.089
4.52SerSer: 4.52 ± 0.176
3.346SerThr: 3.346 ± 0.146
3.651SerVal: 3.651 ± 0.14
0.92SerTrp: 0.92 ± 0.078
2.491SerTyr: 2.491 ± 0.099
0.0SerXaa: 0.0 ± 0.0
Thr
2.895ThrAla: 2.895 ± 0.128
0.345ThrCys: 0.345 ± 0.037
2.858ThrAsp: 2.858 ± 0.177
2.633ThrGlu: 2.633 ± 0.115
2.644ThrPhe: 2.644 ± 0.099
3.404ThrGly: 3.404 ± 0.196
0.938ThrHis: 0.938 ± 0.06
5.131ThrIle: 5.131 ± 0.147
5.386ThrLys: 5.386 ± 0.142
4.666ThrLeu: 4.666 ± 0.124
0.818ThrMet: 0.818 ± 0.046
4.808ThrAsn: 4.808 ± 0.165
2.28ThrPro: 2.28 ± 0.1
2.058ThrGln: 2.058 ± 0.099
1.829ThrArg: 1.829 ± 0.079
3.549ThrSer: 3.549 ± 0.131
3.68ThrThr: 3.68 ± 0.132
3.226ThrVal: 3.226 ± 0.123
0.462ThrTrp: 0.462 ± 0.048
1.953ThrTyr: 1.953 ± 0.085
0.0ThrXaa: 0.0 ± 0.0
Val
4.04ValAla: 4.04 ± 0.139
0.458ValCys: 0.458 ± 0.038
3.64ValAsp: 3.64 ± 0.145
3.222ValGlu: 3.222 ± 0.16
3.117ValPhe: 3.117 ± 0.128
3.204ValGly: 3.204 ± 0.116
1.066ValHis: 1.066 ± 0.057
5.342ValIle: 5.342 ± 0.155
4.724ValLys: 4.724 ± 0.14
5.39ValLeu: 5.39 ± 0.151
0.996ValMet: 0.996 ± 0.058
4.393ValAsn: 4.393 ± 0.134
1.771ValPro: 1.771 ± 0.09
2.088ValGln: 2.088 ± 0.096
1.902ValArg: 1.902 ± 0.08
4.27ValSer: 4.27 ± 0.14
2.858ValThr: 2.858 ± 0.105
4.317ValVal: 4.317 ± 0.136
0.556ValTrp: 0.556 ± 0.056
2.062ValTyr: 2.062 ± 0.094
0.0ValXaa: 0.0 ± 0.0
Trp
0.364TrpAla: 0.364 ± 0.032
0.091TrpCys: 0.091 ± 0.016
0.546TrpAsp: 0.546 ± 0.053
0.342TrpGlu: 0.342 ± 0.041
0.716TrpPhe: 0.716 ± 0.06
0.335TrpGly: 0.335 ± 0.036
0.178TrpHis: 0.178 ± 0.029
0.931TrpIle: 0.931 ± 0.074
0.924TrpLys: 0.924 ± 0.065
0.996TrpLeu: 0.996 ± 0.066
0.247TrpMet: 0.247 ± 0.031
0.764TrpAsn: 0.764 ± 0.057
0.229TrpPro: 0.229 ± 0.029
0.462TrpGln: 0.462 ± 0.037
0.353TrpArg: 0.353 ± 0.036
0.786TrpSer: 0.786 ± 0.08
0.531TrpThr: 0.531 ± 0.045
0.491TrpVal: 0.491 ± 0.051
0.156TrpTrp: 0.156 ± 0.025
0.407TrpTyr: 0.407 ± 0.044
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.069TyrAla: 2.069 ± 0.085
0.404TyrCys: 0.404 ± 0.041
2.615TyrAsp: 2.615 ± 0.105
2.517TyrGlu: 2.517 ± 0.104
2.258TyrPhe: 2.258 ± 0.104
2.08TyrGly: 2.08 ± 0.074
0.88TyrHis: 0.88 ± 0.057
2.255TyrIle: 2.255 ± 0.11
2.618TyrLys: 2.618 ± 0.101
4.404TyrLeu: 4.404 ± 0.163
0.502TyrMet: 0.502 ± 0.049
2.066TyrAsn: 2.066 ± 0.102
1.16TyrPro: 1.16 ± 0.07
2.557TyrGln: 2.557 ± 0.117
1.44TyrArg: 1.44 ± 0.065
2.069TyrSer: 2.069 ± 0.109
1.469TyrThr: 1.469 ± 0.081
2.142TyrVal: 2.142 ± 0.087
0.429TyrTrp: 0.429 ± 0.04
1.571TyrTyr: 1.571 ± 0.096
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 715 proteins (274971 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski