Amino acid dipepetide frequency for Cardinium endosymbiont of Culicoides punctatus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.425AlaAla: 4.425 ± 0.179
1.001AlaCys: 1.001 ± 0.056
2.83AlaAsp: 2.83 ± 0.106
3.342AlaGlu: 3.342 ± 0.12
2.764AlaPhe: 2.764 ± 0.1
3.303AlaGly: 3.303 ± 0.131
1.767AlaHis: 1.767 ± 0.082
6.186AlaIle: 6.186 ± 0.143
4.723AlaLys: 4.723 ± 0.133
6.83AlaLeu: 6.83 ± 0.182
1.737AlaMet: 1.737 ± 0.082
2.814AlaAsn: 2.814 ± 0.107
1.78AlaPro: 1.78 ± 0.083
2.414AlaGln: 2.414 ± 0.097
2.266AlaArg: 2.266 ± 0.097
4.118AlaSer: 4.118 ± 0.125
3.791AlaThr: 3.791 ± 0.115
3.791AlaVal: 3.791 ± 0.125
0.449AlaTrp: 0.449 ± 0.043
2.741AlaTyr: 2.741 ± 0.107
0.0AlaXaa: 0.0 ± 0.0
Cys
0.667CysAla: 0.667 ± 0.048
0.261CysCys: 0.261 ± 0.031
0.515CysAsp: 0.515 ± 0.04
0.499CysGlu: 0.499 ± 0.041
0.862CysPhe: 0.862 ± 0.057
0.746CysGly: 0.746 ± 0.056
0.37CysHis: 0.37 ± 0.032
1.288CysIle: 1.288 ± 0.069
1.047CysLys: 1.047 ± 0.065
1.199CysLeu: 1.199 ± 0.063
0.406CysMet: 0.406 ± 0.039
0.72CysAsn: 0.72 ± 0.06
0.443CysPro: 0.443 ± 0.045
0.466CysGln: 0.466 ± 0.042
0.443CysArg: 0.443 ± 0.044
1.087CysSer: 1.087 ± 0.056
0.816CysThr: 0.816 ± 0.05
0.661CysVal: 0.661 ± 0.048
0.132CysTrp: 0.132 ± 0.023
0.598CysTyr: 0.598 ± 0.045
0.0CysXaa: 0.0 ± 0.0
Asp
3.081AspAla: 3.081 ± 0.11
0.72AspCys: 0.72 ± 0.054
1.975AspAsp: 1.975 ± 0.089
2.711AspGlu: 2.711 ± 0.115
2.507AspPhe: 2.507 ± 0.087
2.348AspGly: 2.348 ± 0.095
1.308AspHis: 1.308 ± 0.066
4.855AspIle: 4.855 ± 0.138
3.927AspLys: 3.927 ± 0.133
4.871AspLeu: 4.871 ± 0.122
1.11AspMet: 1.11 ± 0.062
2.437AspAsn: 2.437 ± 0.086
1.991AspPro: 1.991 ± 0.081
1.721AspGln: 1.721 ± 0.077
2.054AspArg: 2.054 ± 0.085
2.764AspSer: 2.764 ± 0.11
2.758AspThr: 2.758 ± 0.113
2.893AspVal: 2.893 ± 0.099
0.542AspTrp: 0.542 ± 0.048
2.285AspTyr: 2.285 ± 0.085
0.003AspXaa: 0.003 ± 0.003
Glu
3.996GluAla: 3.996 ± 0.117
0.651GluCys: 0.651 ± 0.046
2.91GluAsp: 2.91 ± 0.122
4.974GluGlu: 4.974 ± 0.157
1.688GluPhe: 1.688 ± 0.08
3.312GluGly: 3.312 ± 0.11
1.562GluHis: 1.562 ± 0.066
5.403GluIle: 5.403 ± 0.163
5.845GluLys: 5.845 ± 0.156
5.855GluLeu: 5.855 ± 0.144
1.413GluMet: 1.413 ± 0.058
3.091GluAsn: 3.091 ± 0.124
1.45GluPro: 1.45 ± 0.072
3.233GluGln: 3.233 ± 0.115
2.659GluArg: 2.659 ± 0.123
3.312GluSer: 3.312 ± 0.109
3.131GluThr: 3.131 ± 0.11
3.587GluVal: 3.587 ± 0.132
0.489GluTrp: 0.489 ± 0.04
1.83GluTyr: 1.83 ± 0.078
0.007GluXaa: 0.007 ± 0.005
Phe
2.315PheAla: 2.315 ± 0.099
0.71PheCys: 0.71 ± 0.046
2.11PheAsp: 2.11 ± 0.085
1.952PheGlu: 1.952 ± 0.081
2.668PhePhe: 2.668 ± 0.128
2.526PheGly: 2.526 ± 0.098
1.09PheHis: 1.09 ± 0.064
3.785PheIle: 3.785 ± 0.128
2.692PheLys: 2.692 ± 0.095
5.238PheLeu: 5.238 ± 0.159
0.978PheMet: 0.978 ± 0.062
2.057PheAsn: 2.057 ± 0.083
1.674PhePro: 1.674 ± 0.075
1.684PheGln: 1.684 ± 0.076
1.688PheArg: 1.688 ± 0.084
4.019PheSer: 4.019 ± 0.148
2.341PheThr: 2.341 ± 0.092
2.553PheVal: 2.553 ± 0.089
0.406PheTrp: 0.406 ± 0.038
1.995PheTyr: 1.995 ± 0.098
0.0PheXaa: 0.0 ± 0.0
Gly
3.524GlyAla: 3.524 ± 0.128
0.776GlyCys: 0.776 ± 0.053
2.493GlyAsp: 2.493 ± 0.11
2.734GlyGlu: 2.734 ± 0.108
2.444GlyPhe: 2.444 ± 0.092
3.464GlyGly: 3.464 ± 0.137
1.483GlyHis: 1.483 ± 0.096
5.697GlyIle: 5.697 ± 0.152
4.716GlyLys: 4.716 ± 0.128
5.228GlyLeu: 5.228 ± 0.165
1.668GlyMet: 1.668 ± 0.086
3.058GlyAsn: 3.058 ± 0.118
1.351GlyPro: 1.351 ± 0.063
1.631GlyGln: 1.631 ± 0.081
2.203GlyArg: 2.203 ± 0.088
3.491GlySer: 3.491 ± 0.122
3.061GlyThr: 3.061 ± 0.111
3.276GlyVal: 3.276 ± 0.124
0.598GlyTrp: 0.598 ± 0.049
2.906GlyTyr: 2.906 ± 0.116
0.0GlyXaa: 0.0 ± 0.0
His
1.998HisAla: 1.998 ± 0.083
0.396HisCys: 0.396 ± 0.038
0.918HisAsp: 0.918 ± 0.054
1.136HisGlu: 1.136 ± 0.061
1.301HisPhe: 1.301 ± 0.058
1.483HisGly: 1.483 ± 0.078
0.845HisHis: 0.845 ± 0.065
2.602HisIle: 2.602 ± 0.099
2.051HisLys: 2.051 ± 0.089
2.645HisLeu: 2.645 ± 0.104
0.515HisMet: 0.515 ± 0.045
1.476HisAsn: 1.476 ± 0.067
1.159HisPro: 1.159 ± 0.065
1.057HisGln: 1.057 ± 0.061
0.925HisArg: 0.925 ± 0.055
1.437HisSer: 1.437 ± 0.064
1.631HisThr: 1.631 ± 0.081
1.44HisVal: 1.44 ± 0.076
0.268HisTrp: 0.268 ± 0.026
1.314HisTyr: 1.314 ± 0.072
0.0HisXaa: 0.0 ± 0.0
Ile
6.767IleAla: 6.767 ± 0.175
1.189IleCys: 1.189 ± 0.075
4.974IleAsp: 4.974 ± 0.127
5.4IleGlu: 5.4 ± 0.12
3.679IlePhe: 3.679 ± 0.133
5.096IleGly: 5.096 ± 0.147
2.196IleHis: 2.196 ± 0.095
6.493IleIle: 6.493 ± 0.186
6.585IleLys: 6.585 ± 0.142
7.873IleLeu: 7.873 ± 0.171
1.628IleMet: 1.628 ± 0.083
4.178IleAsn: 4.178 ± 0.135
3.587IlePro: 3.587 ± 0.109
3.722IleGln: 3.722 ± 0.125
3.435IleArg: 3.435 ± 0.09
6.024IleSer: 6.024 ± 0.164
5.291IleThr: 5.291 ± 0.141
5.598IleVal: 5.598 ± 0.167
0.73IleTrp: 0.73 ± 0.055
3.005IleTyr: 3.005 ± 0.098
0.003IleXaa: 0.003 ± 0.003
Lys
4.287LysAla: 4.287 ± 0.145
0.667LysCys: 0.667 ± 0.049
4.224LysAsp: 4.224 ± 0.154
6.562LysGlu: 6.562 ± 0.153
2.394LysPhe: 2.394 ± 0.094
4.141LysGly: 4.141 ± 0.13
1.995LysHis: 1.995 ± 0.085
6.522LysIle: 6.522 ± 0.156
8.061LysLys: 8.061 ± 0.179
6.853LysLeu: 6.853 ± 0.177
1.797LysMet: 1.797 ± 0.067
5.079LysAsn: 5.079 ± 0.145
2.229LysPro: 2.229 ± 0.091
3.537LysGln: 3.537 ± 0.115
3.547LysArg: 3.547 ± 0.105
4.624LysSer: 4.624 ± 0.142
4.31LysThr: 4.31 ± 0.113
4.158LysVal: 4.158 ± 0.125
0.67LysTrp: 0.67 ± 0.052
2.985LysTyr: 2.985 ± 0.105
0.0LysXaa: 0.0 ± 0.0
Leu
6.546LeuAla: 6.546 ± 0.165
1.443LeuCys: 1.443 ± 0.076
5.063LeuAsp: 5.063 ± 0.126
6.592LeuGlu: 6.592 ± 0.166
5.043LeuPhe: 5.043 ± 0.16
5.426LeuGly: 5.426 ± 0.153
3.147LeuHis: 3.147 ± 0.153
7.361LeuIle: 7.361 ± 0.187
7.107LeuLys: 7.107 ± 0.148
12.48LeuLeu: 12.48 ± 0.316
2.031LeuMet: 2.031 ± 0.086
4.884LeuAsn: 4.884 ± 0.111
4.214LeuPro: 4.214 ± 0.136
4.234LeuGln: 4.234 ± 0.138
3.537LeuArg: 3.537 ± 0.118
7.794LeuSer: 7.794 ± 0.168
5.301LeuThr: 5.301 ± 0.127
5.489LeuVal: 5.489 ± 0.135
0.73LeuTrp: 0.73 ± 0.049
3.613LeuTyr: 3.613 ± 0.112
0.0LeuXaa: 0.0 ± 0.0
Met
1.638MetAla: 1.638 ± 0.076
0.258MetCys: 0.258 ± 0.032
1.288MetAsp: 1.288 ± 0.067
1.456MetGlu: 1.456 ± 0.069
0.845MetPhe: 0.845 ± 0.055
1.304MetGly: 1.304 ± 0.067
0.75MetHis: 0.75 ± 0.056
1.826MetIle: 1.826 ± 0.081
1.334MetLys: 1.334 ± 0.06
2.457MetLeu: 2.457 ± 0.095
0.4MetMet: 0.4 ± 0.035
1.03MetAsn: 1.03 ± 0.053
0.925MetPro: 0.925 ± 0.052
1.073MetGln: 1.073 ± 0.063
1.044MetArg: 1.044 ± 0.057
1.377MetSer: 1.377 ± 0.06
0.921MetThr: 0.921 ± 0.062
1.734MetVal: 1.734 ± 0.079
0.122MetTrp: 0.122 ± 0.019
0.842MetTyr: 0.842 ± 0.051
0.0MetXaa: 0.0 ± 0.0
Asn
3.233AsnAla: 3.233 ± 0.123
0.651AsnCys: 0.651 ± 0.053
2.48AsnAsp: 2.48 ± 0.091
2.797AsnGlu: 2.797 ± 0.121
2.021AsnPhe: 2.021 ± 0.086
2.91AsnGly: 2.91 ± 0.121
1.38AsnHis: 1.38 ± 0.067
4.706AsnIle: 4.706 ± 0.144
4.858AsnLys: 4.858 ± 0.175
4.759AsnLeu: 4.759 ± 0.129
1.162AsnMet: 1.162 ± 0.06
3.534AsnAsn: 3.534 ± 0.148
2.12AsnPro: 2.12 ± 0.089
2.49AsnGln: 2.49 ± 0.08
2.166AsnArg: 2.166 ± 0.09
2.698AsnSer: 2.698 ± 0.111
3.177AsnThr: 3.177 ± 0.125
2.876AsnVal: 2.876 ± 0.102
0.555AsnTrp: 0.555 ± 0.043
1.925AsnTyr: 1.925 ± 0.097
0.0AsnXaa: 0.0 ± 0.0
Pro
1.724ProAla: 1.724 ± 0.086
0.449ProCys: 0.449 ± 0.036
1.697ProAsp: 1.697 ± 0.074
2.223ProGlu: 2.223 ± 0.091
1.783ProPhe: 1.783 ± 0.081
1.688ProGly: 1.688 ± 0.093
0.829ProHis: 0.829 ± 0.053
3.705ProIle: 3.705 ± 0.105
2.48ProLys: 2.48 ± 0.097
4.164ProLeu: 4.164 ± 0.13
0.799ProMet: 0.799 ± 0.051
2.186ProAsn: 2.186 ± 0.094
1.04ProPro: 1.04 ± 0.066
1.03ProGln: 1.03 ± 0.058
1.106ProArg: 1.106 ± 0.057
2.464ProSer: 2.464 ± 0.091
2.127ProThr: 2.127 ± 0.079
2.097ProVal: 2.097 ± 0.084
0.376ProTrp: 0.376 ± 0.036
1.44ProTyr: 1.44 ± 0.075
0.0ProXaa: 0.0 ± 0.0
Gln
2.563GlnAla: 2.563 ± 0.099
0.357GlnCys: 0.357 ± 0.035
2.005GlnAsp: 2.005 ± 0.087
3.299GlnGlu: 3.299 ± 0.111
1.447GlnPhe: 1.447 ± 0.067
2.209GlnGly: 2.209 ± 0.094
1.08GlnHis: 1.08 ± 0.054
2.979GlnIle: 2.979 ± 0.104
3.487GlnLys: 3.487 ± 0.123
4.3GlnLeu: 4.3 ± 0.132
0.796GlnMet: 0.796 ± 0.049
2.044GlnAsn: 2.044 ± 0.086
1.262GlnPro: 1.262 ± 0.07
2.153GlnGln: 2.153 ± 0.107
1.849GlnArg: 1.849 ± 0.083
2.375GlnSer: 2.375 ± 0.092
1.902GlnThr: 1.902 ± 0.091
2.279GlnVal: 2.279 ± 0.092
0.343GlnTrp: 0.343 ± 0.034
1.509GlnTyr: 1.509 ± 0.071
0.003GlnXaa: 0.003 ± 0.003
Arg
2.312ArgAla: 2.312 ± 0.101
0.334ArgCys: 0.334 ± 0.036
1.833ArgAsp: 1.833 ± 0.09
2.252ArgGlu: 2.252 ± 0.11
1.965ArgPhe: 1.965 ± 0.082
2.064ArgGly: 2.064 ± 0.092
0.852ArgHis: 0.852 ± 0.057
3.2ArgIle: 3.2 ± 0.095
3.369ArgLys: 3.369 ± 0.104
3.953ArgLeu: 3.953 ± 0.104
0.961ArgMet: 0.961 ± 0.051
2.295ArgAsn: 2.295 ± 0.085
1.364ArgPro: 1.364 ± 0.067
1.519ArgGln: 1.519 ± 0.076
1.688ArgArg: 1.688 ± 0.079
2.517ArgSer: 2.517 ± 0.092
2.143ArgThr: 2.143 ± 0.084
2.256ArgVal: 2.256 ± 0.095
0.386ArgTrp: 0.386 ± 0.035
1.721ArgTyr: 1.721 ± 0.08
0.0ArgXaa: 0.0 ± 0.0
Ser
3.256SerAla: 3.256 ± 0.095
1.133SerCys: 1.133 ± 0.064
3.174SerAsp: 3.174 ± 0.116
3.23SerGlu: 3.23 ± 0.1
3.662SerPhe: 3.662 ± 0.117
4.039SerGly: 4.039 ± 0.114
1.42SerHis: 1.42 ± 0.059
6.483SerIle: 6.483 ± 0.144
4.766SerLys: 4.766 ± 0.127
6.863SerLeu: 6.863 ± 0.144
1.618SerMet: 1.618 ± 0.089
3.623SerAsn: 3.623 ± 0.125
2.143SerPro: 2.143 ± 0.099
1.995SerGln: 1.995 ± 0.076
2.457SerArg: 2.457 ± 0.094
4.881SerSer: 4.881 ± 0.18
3.844SerThr: 3.844 ± 0.115
3.623SerVal: 3.623 ± 0.105
0.585SerTrp: 0.585 ± 0.046
2.738SerTyr: 2.738 ± 0.105
0.0SerXaa: 0.0 ± 0.0
Thr
3.563ThrAla: 3.563 ± 0.098
0.68ThrCys: 0.68 ± 0.048
2.764ThrAsp: 2.764 ± 0.1
3.015ThrGlu: 3.015 ± 0.094
2.612ThrPhe: 2.612 ± 0.091
3.395ThrGly: 3.395 ± 0.109
1.546ThrHis: 1.546 ± 0.08
5.585ThrIle: 5.585 ± 0.149
3.884ThrLys: 3.884 ± 0.114
5.971ThrLeu: 5.971 ± 0.14
1.129ThrMet: 1.129 ± 0.056
2.556ThrAsn: 2.556 ± 0.079
3.028ThrPro: 3.028 ± 0.152
2.256ThrGln: 2.256 ± 0.106
1.773ThrArg: 1.773 ± 0.072
3.58ThrSer: 3.58 ± 0.103
3.927ThrThr: 3.927 ± 0.128
3.121ThrVal: 3.121 ± 0.111
0.505ThrTrp: 0.505 ± 0.042
2.186ThrTyr: 2.186 ± 0.092
0.003ThrXaa: 0.003 ± 0.003
Val
4.145ValAla: 4.145 ± 0.118
0.786ValCys: 0.786 ± 0.044
3.223ValAsp: 3.223 ± 0.116
3.596ValGlu: 3.596 ± 0.128
2.487ValPhe: 2.487 ± 0.097
3.425ValGly: 3.425 ± 0.122
1.552ValHis: 1.552 ± 0.074
4.683ValIle: 4.683 ± 0.122
3.983ValLys: 3.983 ± 0.115
5.68ValLeu: 5.68 ± 0.165
1.304ValMet: 1.304 ± 0.068
2.688ValAsn: 2.688 ± 0.107
1.915ValPro: 1.915 ± 0.077
2.061ValGln: 2.061 ± 0.087
2.13ValArg: 2.13 ± 0.065
4.009ValSer: 4.009 ± 0.107
3.514ValThr: 3.514 ± 0.114
4.019ValVal: 4.019 ± 0.142
0.522ValTrp: 0.522 ± 0.038
2.17ValTyr: 2.17 ± 0.098
0.0ValXaa: 0.0 ± 0.0
Trp
0.443TrpAla: 0.443 ± 0.042
0.178TrpCys: 0.178 ± 0.024
0.4TrpAsp: 0.4 ± 0.034
0.581TrpGlu: 0.581 ± 0.043
0.343TrpPhe: 0.343 ± 0.032
0.505TrpGly: 0.505 ± 0.041
0.291TrpHis: 0.291 ± 0.033
0.845TrpIle: 0.845 ± 0.062
0.687TrpLys: 0.687 ± 0.043
0.994TrpLeu: 0.994 ± 0.061
0.261TrpMet: 0.261 ± 0.028
0.443TrpAsn: 0.443 ± 0.046
0.248TrpPro: 0.248 ± 0.031
0.376TrpGln: 0.376 ± 0.038
0.304TrpArg: 0.304 ± 0.028
0.651TrpSer: 0.651 ± 0.049
0.499TrpThr: 0.499 ± 0.044
0.479TrpVal: 0.479 ± 0.041
0.135TrpTrp: 0.135 ± 0.026
0.357TrpTyr: 0.357 ± 0.037
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.589TyrAla: 2.589 ± 0.101
0.585TyrCys: 0.585 ± 0.042
1.985TyrAsp: 1.985 ± 0.1
2.048TyrGlu: 2.048 ± 0.086
1.962TyrPhe: 1.962 ± 0.084
2.417TyrGly: 2.417 ± 0.084
1.136TyrHis: 1.136 ± 0.053
3.246TyrIle: 3.246 ± 0.112
3.042TyrLys: 3.042 ± 0.116
3.814TyrLeu: 3.814 ± 0.111
0.908TyrMet: 0.908 ± 0.058
2.302TyrAsn: 2.302 ± 0.105
1.486TyrPro: 1.486 ± 0.076
1.582TyrGln: 1.582 ± 0.076
1.711TyrArg: 1.711 ± 0.071
2.312TyrSer: 2.312 ± 0.09
2.596TyrThr: 2.596 ± 0.08
1.965TyrVal: 1.965 ± 0.08
0.476TyrTrp: 0.476 ± 0.044
1.886TyrTyr: 1.886 ± 0.102
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.003XaaPhe: 0.003 ± 0.003
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.007XaaLeu: 0.007 ± 0.004
0.0XaaMet: 0.0 ± 0.0
0.003XaaAsn: 0.003 ± 0.003
0.0XaaPro: 0.0 ± 0.0
0.007XaaGln: 0.007 ± 0.004
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.086XaaXaa: 0.086 ± 0.061
Statistics based on 917 proteins (302800 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski