Amino acid dipepetide frequency for Streptococcus sp. DD13

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.031AlaAla: 6.031 ± 0.136
0.5AlaCys: 0.5 ± 0.03
4.28AlaAsp: 4.28 ± 0.111
5.026AlaGlu: 5.026 ± 0.107
3.387AlaPhe: 3.387 ± 0.089
5.721AlaGly: 5.721 ± 0.115
1.459AlaHis: 1.459 ± 0.051
5.665AlaIle: 5.665 ± 0.11
4.59AlaLys: 4.59 ± 0.098
7.827AlaLeu: 7.827 ± 0.151
1.837AlaMet: 1.837 ± 0.064
2.908AlaAsn: 2.908 ± 0.094
2.207AlaPro: 2.207 ± 0.063
3.54AlaGln: 3.54 ± 0.097
3.42AlaArg: 3.42 ± 0.084
5.322AlaSer: 5.322 ± 0.237
4.348AlaThr: 4.348 ± 0.092
5.57AlaVal: 5.57 ± 0.11
0.581AlaTrp: 0.581 ± 0.04
2.976AlaTyr: 2.976 ± 0.075
0.0AlaXaa: 0.0 ± 0.0
Cys
0.308CysAla: 0.308 ± 0.025
0.039CysCys: 0.039 ± 0.008
0.289CysAsp: 0.289 ± 0.027
0.227CysGlu: 0.227 ± 0.023
0.248CysPhe: 0.248 ± 0.024
0.504CysGly: 0.504 ± 0.033
0.149CysHis: 0.149 ± 0.021
0.258CysIle: 0.258 ± 0.023
0.19CysLys: 0.19 ± 0.021
0.525CysLeu: 0.525 ± 0.037
0.097CysMet: 0.097 ± 0.014
0.186CysAsn: 0.186 ± 0.019
0.265CysPro: 0.265 ± 0.023
0.31CysGln: 0.31 ± 0.028
0.223CysArg: 0.223 ± 0.025
0.322CysSer: 0.322 ± 0.028
0.223CysThr: 0.223 ± 0.022
0.304CysVal: 0.304 ± 0.027
0.045CysTrp: 0.045 ± 0.01
0.213CysTyr: 0.213 ± 0.022
0.0CysXaa: 0.0 ± 0.0
Asp
3.846AspAla: 3.846 ± 0.095
0.277AspCys: 0.277 ± 0.022
2.373AspAsp: 2.373 ± 0.088
3.995AspGlu: 3.995 ± 0.103
3.106AspPhe: 3.106 ± 0.087
4.055AspGly: 4.055 ± 0.104
1.246AspHis: 1.246 ± 0.048
3.844AspIle: 3.844 ± 0.092
3.214AspLys: 3.214 ± 0.079
6.124AspLeu: 6.124 ± 0.121
1.364AspMet: 1.364 ± 0.052
1.79AspAsn: 1.79 ± 0.069
1.955AspPro: 1.955 ± 0.074
3.119AspGln: 3.119 ± 0.088
2.647AspArg: 2.647 ± 0.073
2.86AspSer: 2.86 ± 0.075
2.635AspThr: 2.635 ± 0.08
3.718AspVal: 3.718 ± 0.088
0.68AspTrp: 0.68 ± 0.039
2.652AspTyr: 2.652 ± 0.08
0.0AspXaa: 0.0 ± 0.0
Glu
5.952GluAla: 5.952 ± 0.115
0.267GluCys: 0.267 ± 0.027
3.997GluAsp: 3.997 ± 0.088
6.742GluGlu: 6.742 ± 0.143
2.482GluPhe: 2.482 ± 0.072
4.251GluGly: 4.251 ± 0.108
1.248GluHis: 1.248 ± 0.059
5.212GluIle: 5.212 ± 0.115
5.595GluLys: 5.595 ± 0.117
6.855GluLeu: 6.855 ± 0.15
1.926GluMet: 1.926 ± 0.06
3.348GluAsn: 3.348 ± 0.099
1.68GluPro: 1.68 ± 0.061
3.152GluGln: 3.152 ± 0.088
3.617GluArg: 3.617 ± 0.093
3.619GluSer: 3.619 ± 0.22
3.865GluThr: 3.865 ± 0.089
5.057GluVal: 5.057 ± 0.128
0.616GluTrp: 0.616 ± 0.034
1.846GluTyr: 1.846 ± 0.051
0.0GluXaa: 0.0 ± 0.0
Phe
3.259PheAla: 3.259 ± 0.085
0.289PheCys: 0.289 ± 0.026
2.714PheAsp: 2.714 ± 0.072
3.04PheGlu: 3.04 ± 0.082
2.112PhePhe: 2.112 ± 0.092
3.383PheGly: 3.383 ± 0.089
0.94PheHis: 0.94 ± 0.049
2.796PheIle: 2.796 ± 0.088
2.166PheLys: 2.166 ± 0.058
4.842PheLeu: 4.842 ± 0.154
1.075PheMet: 1.075 ± 0.045
1.515PheAsn: 1.515 ± 0.056
1.608PhePro: 1.608 ± 0.062
1.868PheGln: 1.868 ± 0.072
1.753PheArg: 1.753 ± 0.059
3.369PheSer: 3.369 ± 0.093
2.484PheThr: 2.484 ± 0.064
3.212PheVal: 3.212 ± 0.089
0.502PheTrp: 0.502 ± 0.035
1.856PheTyr: 1.856 ± 0.061
0.0PheXaa: 0.0 ± 0.0
Gly
4.77GlyAla: 4.77 ± 0.136
0.37GlyCys: 0.37 ± 0.029
3.487GlyAsp: 3.487 ± 0.093
4.016GlyGlu: 4.016 ± 0.099
3.41GlyPhe: 3.41 ± 0.081
4.555GlyGly: 4.555 ± 0.118
1.405GlyHis: 1.405 ± 0.055
5.363GlyIle: 5.363 ± 0.11
4.396GlyLys: 4.396 ± 0.107
7.302GlyLeu: 7.302 ± 0.135
1.862GlyMet: 1.862 ± 0.063
2.689GlyAsn: 2.689 ± 0.083
1.62GlyPro: 1.62 ± 0.065
3.387GlyGln: 3.387 ± 0.098
3.272GlyArg: 3.272 ± 0.092
3.916GlySer: 3.916 ± 0.092
3.873GlyThr: 3.873 ± 0.093
5.105GlyVal: 5.105 ± 0.107
0.68GlyTrp: 0.68 ± 0.043
2.705GlyTyr: 2.705 ± 0.064
0.0GlyXaa: 0.0 ± 0.0
His
1.288HisAla: 1.288 ± 0.057
0.134HisCys: 0.134 ± 0.018
0.885HisAsp: 0.885 ± 0.045
1.166HisGlu: 1.166 ± 0.054
1.207HisPhe: 1.207 ± 0.045
1.292HisGly: 1.292 ± 0.051
0.653HisHis: 0.653 ± 0.034
1.232HisIle: 1.232 ± 0.051
0.885HisLys: 0.885 ± 0.043
2.352HisLeu: 2.352 ± 0.083
0.475HisMet: 0.475 ± 0.028
0.595HisAsn: 0.595 ± 0.037
1.112HisPro: 1.112 ± 0.051
1.17HisGln: 1.17 ± 0.051
1.044HisArg: 1.044 ± 0.046
1.199HisSer: 1.199 ± 0.056
1.058HisThr: 1.058 ± 0.05
1.236HisVal: 1.236 ± 0.049
0.192HisTrp: 0.192 ± 0.022
1.046HisTyr: 1.046 ± 0.056
0.0HisXaa: 0.0 ± 0.0
Ile
6.012IleAla: 6.012 ± 0.12
0.42IleCys: 0.42 ± 0.031
4.104IleAsp: 4.104 ± 0.111
4.631IleGlu: 4.631 ± 0.122
2.962IlePhe: 2.962 ± 0.095
4.789IleGly: 4.789 ± 0.104
1.405IleHis: 1.405 ± 0.063
4.111IleIle: 4.111 ± 0.108
3.466IleLys: 3.466 ± 0.092
7.419IleLeu: 7.419 ± 0.159
1.337IleMet: 1.337 ± 0.06
2.428IleAsn: 2.428 ± 0.07
2.918IlePro: 2.918 ± 0.075
3.21IleGln: 3.21 ± 0.081
3.396IleArg: 3.396 ± 0.088
4.441IleSer: 4.441 ± 0.096
3.598IleThr: 3.598 ± 0.11
4.863IleVal: 4.863 ± 0.106
0.591IleTrp: 0.591 ± 0.035
2.304IleTyr: 2.304 ± 0.074
0.0IleXaa: 0.0 ± 0.0
Lys
4.557LysAla: 4.557 ± 0.112
0.153LysCys: 0.153 ± 0.015
3.58LysAsp: 3.58 ± 0.084
5.967LysGlu: 5.967 ± 0.109
1.529LysPhe: 1.529 ± 0.058
3.879LysGly: 3.879 ± 0.082
1.035LysHis: 1.035 ± 0.047
4.042LysIle: 4.042 ± 0.1
4.638LysLys: 4.638 ± 0.126
4.762LysLeu: 4.762 ± 0.106
1.781LysMet: 1.781 ± 0.064
2.782LysAsn: 2.782 ± 0.083
1.819LysPro: 1.819 ± 0.067
2.414LysGln: 2.414 ± 0.069
3.259LysArg: 3.259 ± 0.091
3.332LysSer: 3.332 ± 0.083
3.476LysThr: 3.476 ± 0.089
4.109LysVal: 4.109 ± 0.102
0.572LysTrp: 0.572 ± 0.037
1.74LysTyr: 1.74 ± 0.077
0.0LysXaa: 0.0 ± 0.0
Leu
8.98LeuAla: 8.98 ± 0.137
0.469LeuCys: 0.469 ± 0.03
5.905LeuAsp: 5.905 ± 0.121
7.192LeuGlu: 7.192 ± 0.143
4.545LeuPhe: 4.545 ± 0.133
6.752LeuGly: 6.752 ± 0.121
1.934LeuHis: 1.934 ± 0.072
6.378LeuIle: 6.378 ± 0.151
5.663LeuLys: 5.663 ± 0.112
10.929LeuLeu: 10.929 ± 0.235
2.381LeuMet: 2.381 ± 0.08
3.724LeuAsn: 3.724 ± 0.084
4.243LeuPro: 4.243 ± 0.105
4.121LeuGln: 4.121 ± 0.099
4.218LeuArg: 4.218 ± 0.109
7.853LeuSer: 7.853 ± 0.168
6.417LeuThr: 6.417 ± 0.119
7.186LeuVal: 7.186 ± 0.125
0.761LeuTrp: 0.761 ± 0.048
3.329LeuTyr: 3.329 ± 0.077
0.0LeuXaa: 0.0 ± 0.0
Met
1.972MetAla: 1.972 ± 0.071
0.11MetCys: 0.11 ± 0.014
1.502MetAsp: 1.502 ± 0.054
1.575MetGlu: 1.575 ± 0.065
0.709MetPhe: 0.709 ± 0.043
1.75MetGly: 1.75 ± 0.067
0.329MetHis: 0.329 ± 0.023
1.837MetIle: 1.837 ± 0.068
1.908MetLys: 1.908 ± 0.057
2.131MetLeu: 2.131 ± 0.075
0.717MetMet: 0.717 ± 0.035
1.172MetAsn: 1.172 ± 0.048
0.765MetPro: 0.765 ± 0.044
0.87MetGln: 0.87 ± 0.036
1.184MetArg: 1.184 ± 0.057
1.556MetSer: 1.556 ± 0.052
1.788MetThr: 1.788 ± 0.057
1.666MetVal: 1.666 ± 0.064
0.159MetTrp: 0.159 ± 0.019
0.548MetTyr: 0.548 ± 0.033
0.0MetXaa: 0.0 ± 0.0
Asn
2.85AsnAla: 2.85 ± 0.082
0.221AsnCys: 0.221 ± 0.024
1.98AsnAsp: 1.98 ± 0.069
2.259AsnGlu: 2.259 ± 0.068
1.761AsnPhe: 1.761 ± 0.066
3.133AsnGly: 3.133 ± 0.095
0.971AsnHis: 0.971 ± 0.051
2.68AsnIle: 2.68 ± 0.082
2.149AsnLys: 2.149 ± 0.07
3.972AsnLeu: 3.972 ± 0.094
1.031AsnMet: 1.031 ± 0.052
1.445AsnAsn: 1.445 ± 0.069
2.112AsnPro: 2.112 ± 0.07
2.193AsnGln: 2.193 ± 0.07
2.102AsnArg: 2.102 ± 0.076
2.058AsnSer: 2.058 ± 0.075
1.848AsnThr: 1.848 ± 0.073
2.373AsnVal: 2.373 ± 0.079
0.496AsnTrp: 0.496 ± 0.033
1.567AsnTyr: 1.567 ± 0.05
0.0AsnXaa: 0.0 ± 0.0
Pro
2.654ProAla: 2.654 ± 0.08
0.138ProCys: 0.138 ± 0.019
2.151ProAsp: 2.151 ± 0.069
2.829ProGlu: 2.829 ± 0.079
1.829ProPhe: 1.829 ± 0.064
2.106ProGly: 2.106 ± 0.07
0.802ProHis: 0.802 ± 0.042
2.358ProIle: 2.358 ± 0.066
1.852ProLys: 1.852 ± 0.065
3.265ProLeu: 3.265 ± 0.088
0.727ProMet: 0.727 ± 0.039
1.496ProAsn: 1.496 ± 0.05
0.603ProPro: 0.603 ± 0.038
1.581ProGln: 1.581 ± 0.064
1.254ProArg: 1.254 ± 0.05
2.37ProSer: 2.37 ± 0.074
2.143ProThr: 2.143 ± 0.064
2.815ProVal: 2.815 ± 0.073
0.262ProTrp: 0.262 ± 0.024
1.44ProTyr: 1.44 ± 0.052
0.0ProXaa: 0.0 ± 0.0
Gln
4.268GlnAla: 4.268 ± 0.136
0.124GlnCys: 0.124 ± 0.015
2.383GlnAsp: 2.383 ± 0.073
4.173GlnGlu: 4.173 ± 0.11
1.829GlnPhe: 1.829 ± 0.068
2.662GlnGly: 2.662 ± 0.077
0.839GlnHis: 0.839 ± 0.044
3.061GlnIle: 3.061 ± 0.079
2.854GlnLys: 2.854 ± 0.071
4.865GlnLeu: 4.865 ± 0.133
1.091GlnMet: 1.091 ± 0.051
1.728GlnAsn: 1.728 ± 0.061
1.438GlnPro: 1.438 ± 0.069
1.963GlnGln: 1.963 ± 0.078
1.804GlnArg: 1.804 ± 0.061
2.724GlnSer: 2.724 ± 0.089
2.732GlnThr: 2.732 ± 0.085
3.666GlnVal: 3.666 ± 0.085
0.355GlnTrp: 0.355 ± 0.028
1.55GlnTyr: 1.55 ± 0.06
0.0GlnXaa: 0.0 ± 0.0
Arg
2.918ArgAla: 2.918 ± 0.086
0.182ArgCys: 0.182 ± 0.018
2.385ArgAsp: 2.385 ± 0.081
3.391ArgGlu: 3.391 ± 0.099
2.373ArgPhe: 2.373 ± 0.065
2.579ArgGly: 2.579 ± 0.084
1.006ArgHis: 1.006 ± 0.051
3.263ArgIle: 3.263 ± 0.094
3.061ArgLys: 3.061 ± 0.078
4.877ArgLeu: 4.877 ± 0.122
1.261ArgMet: 1.261 ± 0.053
1.831ArgAsn: 1.831 ± 0.065
1.579ArgPro: 1.579 ± 0.065
2.337ArgGln: 2.337 ± 0.076
2.375ArgArg: 2.375 ± 0.076
2.641ArgSer: 2.641 ± 0.079
2.275ArgThr: 2.275 ± 0.066
3.162ArgVal: 3.162 ± 0.094
0.349ArgTrp: 0.349 ± 0.027
1.885ArgTyr: 1.885 ± 0.061
0.0ArgXaa: 0.0 ± 0.0
Ser
4.16SerAla: 4.16 ± 0.246
0.312SerCys: 0.312 ± 0.029
3.381SerAsp: 3.381 ± 0.089
3.735SerGlu: 3.735 ± 0.244
3.176SerPhe: 3.176 ± 0.09
4.578SerGly: 4.578 ± 0.113
1.374SerHis: 1.374 ± 0.062
4.177SerIle: 4.177 ± 0.101
3.569SerLys: 3.569 ± 0.073
6.899SerLeu: 6.899 ± 0.135
1.554SerMet: 1.554 ± 0.067
2.641SerAsn: 2.641 ± 0.081
2.228SerPro: 2.228 ± 0.076
3.404SerGln: 3.404 ± 0.105
2.691SerArg: 2.691 ± 0.075
4.524SerSer: 4.524 ± 0.131
3.518SerThr: 3.518 ± 0.114
4.379SerVal: 4.379 ± 0.239
0.628SerTrp: 0.628 ± 0.04
2.641SerTyr: 2.641 ± 0.071
0.0SerXaa: 0.0 ± 0.0
Thr
4.326ThrAla: 4.326 ± 0.089
0.281ThrCys: 0.281 ± 0.026
3.251ThrAsp: 3.251 ± 0.091
3.518ThrGlu: 3.518 ± 0.076
2.598ThrPhe: 2.598 ± 0.083
4.361ThrGly: 4.361 ± 0.107
1.029ThrHis: 1.029 ± 0.047
4.528ThrIle: 4.528 ± 0.083
2.92ThrLys: 2.92 ± 0.089
5.566ThrLeu: 5.566 ± 0.123
1.089ThrMet: 1.089 ± 0.049
2.329ThrAsn: 2.329 ± 0.076
2.393ThrPro: 2.393 ± 0.085
1.957ThrGln: 1.957 ± 0.067
2.118ThrArg: 2.118 ± 0.07
3.879ThrSer: 3.879 ± 0.126
3.224ThrThr: 3.224 ± 0.086
4.609ThrVal: 4.609 ± 0.114
0.463ThrTrp: 0.463 ± 0.035
2.236ThrTyr: 2.236 ± 0.066
0.0ThrXaa: 0.0 ± 0.0
Val
5.95ValAla: 5.95 ± 0.138
0.393ValCys: 0.393 ± 0.03
4.243ValAsp: 4.243 ± 0.095
5.003ValGlu: 5.003 ± 0.119
3.216ValPhe: 3.216 ± 0.085
4.689ValGly: 4.689 ± 0.114
1.279ValHis: 1.279 ± 0.049
4.716ValIle: 4.716 ± 0.105
3.962ValLys: 3.962 ± 0.084
7.246ValLeu: 7.246 ± 0.135
1.643ValMet: 1.643 ± 0.061
2.716ValAsn: 2.716 ± 0.08
2.377ValPro: 2.377 ± 0.071
2.629ValGln: 2.629 ± 0.078
3.117ValArg: 3.117 ± 0.089
4.91ValSer: 4.91 ± 0.234
4.648ValThr: 4.648 ± 0.105
5.303ValVal: 5.303 ± 0.12
0.548ValTrp: 0.548 ± 0.033
2.476ValTyr: 2.476 ± 0.063
0.0ValXaa: 0.0 ± 0.0
Trp
0.554TrpAla: 0.554 ± 0.039
0.05TrpCys: 0.05 ± 0.01
0.488TrpAsp: 0.488 ± 0.028
0.554TrpGlu: 0.554 ± 0.035
0.442TrpPhe: 0.442 ± 0.032
0.63TrpGly: 0.63 ± 0.041
0.165TrpHis: 0.165 ± 0.019
0.641TrpIle: 0.641 ± 0.037
0.548TrpLys: 0.548 ± 0.043
1.056TrpLeu: 1.056 ± 0.05
0.273TrpMet: 0.273 ± 0.024
0.465TrpAsn: 0.465 ± 0.028
0.207TrpPro: 0.207 ± 0.023
0.407TrpGln: 0.407 ± 0.028
0.368TrpArg: 0.368 ± 0.028
0.577TrpSer: 0.577 ± 0.032
0.564TrpThr: 0.564 ± 0.036
0.519TrpVal: 0.519 ± 0.03
0.099TrpTrp: 0.099 ± 0.016
0.283TrpTyr: 0.283 ± 0.027
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.544TyrAla: 2.544 ± 0.071
0.198TyrCys: 0.198 ± 0.021
2.122TyrAsp: 2.122 ± 0.063
2.288TyrGlu: 2.288 ± 0.076
1.862TyrPhe: 1.862 ± 0.074
2.513TyrGly: 2.513 ± 0.072
0.947TyrHis: 0.947 ± 0.042
2.242TyrIle: 2.242 ± 0.062
1.802TyrLys: 1.802 ± 0.069
4.166TyrLeu: 4.166 ± 0.101
0.721TyrMet: 0.721 ± 0.038
1.414TyrAsn: 1.414 ± 0.062
1.55TyrPro: 1.55 ± 0.063
2.478TyrGln: 2.478 ± 0.082
1.87TyrArg: 1.87 ± 0.065
2.096TyrSer: 2.096 ± 0.073
1.957TyrThr: 1.957 ± 0.072
2.213TyrVal: 2.213 ± 0.067
0.308TyrTrp: 0.308 ± 0.027
1.604TyrTyr: 1.604 ± 0.061
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1649 proteins (483867 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski