Amino acid dipepetide frequency for Megasphaera vaginalis

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
14.451AlaAla: 14.451 ± 0.231
1.322AlaCys: 1.322 ± 0.049
5.663AlaAsp: 5.663 ± 0.105
6.279AlaGlu: 6.279 ± 0.123
3.762AlaPhe: 3.762 ± 0.078
8.002AlaGly: 8.002 ± 0.142
1.572AlaHis: 1.572 ± 0.05
5.502AlaIle: 5.502 ± 0.102
4.719AlaLys: 4.719 ± 0.106
9.31AlaLeu: 9.31 ± 0.135
2.851AlaMet: 2.851 ± 0.067
2.744AlaAsn: 2.744 ± 0.092
2.798AlaPro: 2.798 ± 0.073
2.908AlaGln: 2.908 ± 0.07
3.808AlaArg: 3.808 ± 0.084
4.321AlaSer: 4.321 ± 0.09
4.135AlaThr: 4.135 ± 0.078
9.759AlaVal: 9.759 ± 0.171
0.843AlaTrp: 0.843 ± 0.039
3.434AlaTyr: 3.434 ± 0.071
0.0AlaXaa: 0.0 ± 0.0
Cys
1.113CysAla: 1.113 ± 0.042
0.275CysCys: 0.275 ± 0.024
0.703CysAsp: 0.703 ± 0.035
0.619CysGlu: 0.619 ± 0.029
0.488CysPhe: 0.488 ± 0.027
1.501CysGly: 1.501 ± 0.054
0.373CysHis: 0.373 ± 0.027
0.844CysIle: 0.844 ± 0.033
0.557CysLys: 0.557 ± 0.031
1.32CysLeu: 1.32 ± 0.053
0.313CysMet: 0.313 ± 0.022
0.409CysAsn: 0.409 ± 0.023
0.637CysPro: 0.637 ± 0.042
0.35CysGln: 0.35 ± 0.022
1.147CysArg: 1.147 ± 0.046
0.811CysSer: 0.811 ± 0.038
0.741CysThr: 0.741 ± 0.039
0.798CysVal: 0.798 ± 0.04
0.122CysTrp: 0.122 ± 0.014
0.394CysTyr: 0.394 ± 0.025
0.0CysXaa: 0.0 ± 0.0
Asp
4.841AspAla: 4.841 ± 0.095
0.724AspCys: 0.724 ± 0.031
3.002AspAsp: 3.002 ± 0.08
3.662AspGlu: 3.662 ± 0.077
2.322AspPhe: 2.322 ± 0.058
4.92AspGly: 4.92 ± 0.099
1.037AspHis: 1.037 ± 0.04
3.901AspIle: 3.901 ± 0.072
3.229AspLys: 3.229 ± 0.076
4.824AspLeu: 4.824 ± 0.095
1.687AspMet: 1.687 ± 0.055
1.921AspAsn: 1.921 ± 0.064
1.995AspPro: 1.995 ± 0.052
1.195AspGln: 1.195 ± 0.046
2.744AspArg: 2.744 ± 0.06
2.798AspSer: 2.798 ± 0.063
3.119AspThr: 3.119 ± 0.08
4.276AspVal: 4.276 ± 0.076
0.595AspTrp: 0.595 ± 0.036
2.26AspTyr: 2.26 ± 0.068
0.0AspXaa: 0.0 ± 0.0
Glu
5.914GluAla: 5.914 ± 0.096
0.578GluCys: 0.578 ± 0.031
2.97GluAsp: 2.97 ± 0.072
4.984GluGlu: 4.984 ± 0.102
1.986GluPhe: 1.986 ± 0.055
3.902GluGly: 3.902 ± 0.072
1.138GluHis: 1.138 ± 0.042
4.36GluIle: 4.36 ± 0.1
4.991GluLys: 4.991 ± 0.091
5.884GluLeu: 5.884 ± 0.099
2.0GluMet: 2.0 ± 0.056
2.716GluAsn: 2.716 ± 0.069
1.784GluPro: 1.784 ± 0.055
2.202GluGln: 2.202 ± 0.063
3.994GluArg: 3.994 ± 0.091
2.721GluSer: 2.721 ± 0.056
3.633GluThr: 3.633 ± 0.074
3.557GluVal: 3.557 ± 0.082
0.595GluTrp: 0.595 ± 0.037
1.918GluTyr: 1.918 ± 0.053
0.0GluXaa: 0.0 ± 0.0
Phe
3.685PheAla: 3.685 ± 0.076
0.675PheCys: 0.675 ± 0.033
2.271PheAsp: 2.271 ± 0.05
1.749PheGlu: 1.749 ± 0.055
2.001PhePhe: 2.001 ± 0.062
3.268PheGly: 3.268 ± 0.085
0.852PheHis: 0.852 ± 0.037
2.859PheIle: 2.859 ± 0.077
1.548PheLys: 1.548 ± 0.046
3.907PheLeu: 3.907 ± 0.093
1.092PheMet: 1.092 ± 0.048
1.395PheAsn: 1.395 ± 0.043
1.455PhePro: 1.455 ± 0.046
1.027PheGln: 1.027 ± 0.042
1.868PheArg: 1.868 ± 0.055
2.818PheSer: 2.818 ± 0.074
2.458PheThr: 2.458 ± 0.069
2.75PheVal: 2.75 ± 0.069
0.432PheTrp: 0.432 ± 0.027
1.376PheTyr: 1.376 ± 0.056
0.0PheXaa: 0.0 ± 0.0
Gly
7.005GlyAla: 7.005 ± 0.157
1.223GlyCys: 1.223 ± 0.044
4.003GlyAsp: 4.003 ± 0.093
4.286GlyGlu: 4.286 ± 0.082
3.008GlyPhe: 3.008 ± 0.073
6.015GlyGly: 6.015 ± 0.123
1.553GlyHis: 1.553 ± 0.054
5.981GlyIle: 5.981 ± 0.12
5.371GlyLys: 5.371 ± 0.092
6.862GlyLeu: 6.862 ± 0.114
2.289GlyMet: 2.289 ± 0.061
2.999GlyAsn: 2.999 ± 0.108
1.852GlyPro: 1.852 ± 0.061
2.374GlyGln: 2.374 ± 0.059
4.089GlyArg: 4.089 ± 0.082
4.454GlySer: 4.454 ± 0.099
4.944GlyThr: 4.944 ± 0.122
5.524GlyVal: 5.524 ± 0.105
0.715GlyTrp: 0.715 ± 0.04
2.912GlyTyr: 2.912 ± 0.068
0.0GlyXaa: 0.0 ± 0.0
His
1.566HisAla: 1.566 ± 0.047
0.367HisCys: 0.367 ± 0.026
1.154HisAsp: 1.154 ± 0.047
1.112HisGlu: 1.112 ± 0.043
0.861HisPhe: 0.861 ± 0.035
1.662HisGly: 1.662 ± 0.055
0.573HisHis: 0.573 ± 0.036
1.509HisIle: 1.509 ± 0.054
0.914HisLys: 0.914 ± 0.042
1.903HisLeu: 1.903 ± 0.056
0.564HisMet: 0.564 ± 0.032
0.735HisAsn: 0.735 ± 0.036
0.95HisPro: 0.95 ± 0.04
0.651HisGln: 0.651 ± 0.033
1.163HisArg: 1.163 ± 0.044
1.083HisSer: 1.083 ± 0.043
1.133HisThr: 1.133 ± 0.041
1.576HisVal: 1.576 ± 0.055
0.271HisTrp: 0.271 ± 0.023
0.736HisTyr: 0.736 ± 0.033
0.0HisXaa: 0.0 ± 0.0
Ile
6.489IleAla: 6.489 ± 0.104
1.11IleCys: 1.11 ± 0.046
4.013IleAsp: 4.013 ± 0.083
3.536IleGlu: 3.536 ± 0.08
2.447IlePhe: 2.447 ± 0.073
5.721IleGly: 5.721 ± 0.118
1.375IleHis: 1.375 ± 0.049
4.585IleIle: 4.585 ± 0.102
2.979IleLys: 2.979 ± 0.068
5.96IleLeu: 5.96 ± 0.113
1.779IleMet: 1.779 ± 0.058
2.391IleAsn: 2.391 ± 0.063
2.899IlePro: 2.899 ± 0.075
1.75IleGln: 1.75 ± 0.055
3.703IleArg: 3.703 ± 0.087
4.112IleSer: 4.112 ± 0.082
3.93IleThr: 3.93 ± 0.086
4.982IleVal: 4.982 ± 0.092
0.56IleTrp: 0.56 ± 0.029
2.118IleTyr: 2.118 ± 0.062
0.0IleXaa: 0.0 ± 0.0
Lys
5.152LysAla: 5.152 ± 0.115
0.409LysCys: 0.409 ± 0.028
3.209LysAsp: 3.209 ± 0.083
4.742LysGlu: 4.742 ± 0.098
1.43LysPhe: 1.43 ± 0.05
3.919LysGly: 3.919 ± 0.088
0.916LysHis: 0.916 ± 0.038
3.787LysIle: 3.787 ± 0.078
4.422LysLys: 4.422 ± 0.087
4.46LysLeu: 4.46 ± 0.085
1.773LysMet: 1.773 ± 0.051
2.588LysAsn: 2.588 ± 0.075
1.655LysPro: 1.655 ± 0.05
1.852LysGln: 1.852 ± 0.061
3.077LysArg: 3.077 ± 0.076
2.518LysSer: 2.518 ± 0.064
3.481LysThr: 3.481 ± 0.087
3.475LysVal: 3.475 ± 0.079
0.468LysTrp: 0.468 ± 0.027
1.836LysTyr: 1.836 ± 0.059
0.0LysXaa: 0.0 ± 0.0
Leu
9.678LeuAla: 9.678 ± 0.153
1.384LeuCys: 1.384 ± 0.047
4.946LeuAsp: 4.946 ± 0.083
5.09LeuGlu: 5.09 ± 0.103
4.009LeuPhe: 4.009 ± 0.103
6.661LeuGly: 6.661 ± 0.113
2.231LeuHis: 2.231 ± 0.062
5.653LeuIle: 5.653 ± 0.105
4.567LeuLys: 4.567 ± 0.085
9.744LeuLeu: 9.744 ± 0.154
2.269LeuMet: 2.269 ± 0.062
2.994LeuAsn: 2.994 ± 0.077
4.605LeuPro: 4.605 ± 0.089
4.162LeuGln: 4.162 ± 0.098
5.387LeuArg: 5.387 ± 0.1
6.077LeuSer: 6.077 ± 0.105
6.077LeuThr: 6.077 ± 0.096
5.651LeuVal: 5.651 ± 0.114
0.836LeuTrp: 0.836 ± 0.035
3.183LeuTyr: 3.183 ± 0.083
0.0LeuXaa: 0.0 ± 0.0
Met
2.92MetAla: 2.92 ± 0.064
0.216MetCys: 0.216 ± 0.017
1.538MetAsp: 1.538 ± 0.053
1.931MetGlu: 1.931 ± 0.058
0.78MetPhe: 0.78 ± 0.033
1.948MetGly: 1.948 ± 0.05
0.499MetHis: 0.499 ± 0.027
1.808MetIle: 1.808 ± 0.054
2.106MetLys: 2.106 ± 0.057
2.386MetLeu: 2.386 ± 0.056
0.833MetMet: 0.833 ± 0.038
1.396MetAsn: 1.396 ± 0.042
1.218MetPro: 1.218 ± 0.032
0.929MetGln: 0.929 ± 0.037
1.381MetArg: 1.381 ± 0.045
1.512MetSer: 1.512 ± 0.048
1.931MetThr: 1.931 ± 0.056
1.604MetVal: 1.604 ± 0.053
0.14MetTrp: 0.14 ± 0.014
0.789MetTyr: 0.789 ± 0.038
0.0MetXaa: 0.0 ± 0.0
Asn
2.867AsnAla: 2.867 ± 0.085
0.49AsnCys: 0.49 ± 0.033
2.053AsnAsp: 2.053 ± 0.062
2.096AsnGlu: 2.096 ± 0.062
1.239AsnPhe: 1.239 ± 0.043
3.163AsnGly: 3.163 ± 0.088
0.776AsnHis: 0.776 ± 0.041
2.598AsnIle: 2.598 ± 0.075
1.965AsnLys: 1.965 ± 0.067
3.113AsnLeu: 3.113 ± 0.066
1.007AsnMet: 1.007 ± 0.037
1.344AsnAsn: 1.344 ± 0.071
1.699AsnPro: 1.699 ± 0.05
1.092AsnGln: 1.092 ± 0.043
2.003AsnArg: 2.003 ± 0.062
1.816AsnSer: 1.816 ± 0.066
2.096AsnThr: 2.096 ± 0.087
2.59AsnVal: 2.59 ± 0.077
0.414AsnTrp: 0.414 ± 0.024
1.34AsnTyr: 1.34 ± 0.048
0.0AsnXaa: 0.0 ± 0.0
Pro
3.574ProAla: 3.574 ± 0.088
0.447ProCys: 0.447 ± 0.028
2.391ProAsp: 2.391 ± 0.066
2.74ProGlu: 2.74 ± 0.066
1.779ProPhe: 1.779 ± 0.053
2.442ProGly: 2.442 ± 0.065
0.832ProHis: 0.832 ± 0.035
2.106ProIle: 2.106 ± 0.057
1.58ProLys: 1.58 ± 0.05
3.904ProLeu: 3.904 ± 0.081
0.92ProMet: 0.92 ± 0.035
1.147ProAsn: 1.147 ± 0.043
1.209ProPro: 1.209 ± 0.046
1.594ProGln: 1.594 ± 0.051
1.398ProArg: 1.398 ± 0.049
1.953ProSer: 1.953 ± 0.054
1.906ProThr: 1.906 ± 0.045
3.326ProVal: 3.326 ± 0.085
0.376ProTrp: 0.376 ± 0.03
1.569ProTyr: 1.569 ± 0.046
0.0ProXaa: 0.0 ± 0.0
Gln
3.361GlnAla: 3.361 ± 0.074
0.359GlnCys: 0.359 ± 0.028
1.551GlnAsp: 1.551 ± 0.052
2.359GlnGlu: 2.359 ± 0.062
1.109GlnPhe: 1.109 ± 0.043
2.413GlnGly: 2.413 ± 0.061
0.712GlnHis: 0.712 ± 0.033
2.299GlnIle: 2.299 ± 0.058
2.052GlnLys: 2.052 ± 0.052
3.122GlnLeu: 3.122 ± 0.07
1.031GlnMet: 1.031 ± 0.038
1.118GlnAsn: 1.118 ± 0.044
1.192GlnPro: 1.192 ± 0.046
1.635GlnGln: 1.635 ± 0.063
2.111GlnArg: 2.111 ± 0.065
1.76GlnSer: 1.76 ± 0.056
1.635GlnThr: 1.635 ± 0.057
2.118GlnVal: 2.118 ± 0.056
0.391GlnTrp: 0.391 ± 0.028
1.25GlnTyr: 1.25 ± 0.045
0.0GlnXaa: 0.0 ± 0.0
Arg
3.708ArgAla: 3.708 ± 0.07
0.756ArgCys: 0.756 ± 0.036
2.744ArgAsp: 2.744 ± 0.069
3.747ArgGlu: 3.747 ± 0.086
2.275ArgPhe: 2.275 ± 0.049
3.098ArgGly: 3.098 ± 0.077
1.392ArgHis: 1.392 ± 0.05
3.759ArgIle: 3.759 ± 0.08
3.096ArgLys: 3.096 ± 0.068
5.578ArgLeu: 5.578 ± 0.121
1.433ArgMet: 1.433 ± 0.051
2.036ArgAsn: 2.036 ± 0.054
1.945ArgPro: 1.945 ± 0.063
2.713ArgGln: 2.713 ± 0.061
4.106ArgArg: 4.106 ± 0.103
2.873ArgSer: 2.873 ± 0.071
2.652ArgThr: 2.652 ± 0.064
3.121ArgVal: 3.121 ± 0.066
0.541ArgTrp: 0.541 ± 0.032
2.067ArgTyr: 2.067 ± 0.065
0.0ArgXaa: 0.0 ± 0.0
Ser
4.876SerAla: 4.876 ± 0.095
0.759SerCys: 0.759 ± 0.037
2.809SerAsp: 2.809 ± 0.073
2.88SerGlu: 2.88 ± 0.068
2.716SerPhe: 2.716 ± 0.059
4.979SerGly: 4.979 ± 0.117
1.239SerHis: 1.239 ± 0.045
3.165SerIle: 3.165 ± 0.084
2.49SerLys: 2.49 ± 0.07
5.987SerLeu: 5.987 ± 0.113
1.338SerMet: 1.338 ± 0.044
1.674SerAsn: 1.674 ± 0.051
2.061SerPro: 2.061 ± 0.053
1.691SerGln: 1.691 ± 0.056
2.891SerArg: 2.891 ± 0.073
2.847SerSer: 2.847 ± 0.083
2.658SerThr: 2.658 ± 0.07
4.261SerVal: 4.261 ± 0.096
0.599SerTrp: 0.599 ± 0.033
1.977SerTyr: 1.977 ± 0.058
0.0SerXaa: 0.0 ± 0.0
Thr
6.708ThrAla: 6.708 ± 0.116
0.643ThrCys: 0.643 ± 0.031
3.075ThrAsp: 3.075 ± 0.066
3.437ThrGlu: 3.437 ± 0.075
2.187ThrPhe: 2.187 ± 0.054
4.876ThrGly: 4.876 ± 0.115
0.937ThrHis: 0.937 ± 0.037
3.785ThrIle: 3.785 ± 0.074
2.689ThrLys: 2.689 ± 0.06
5.434ThrLeu: 5.434 ± 0.097
1.437ThrMet: 1.437 ± 0.046
1.857ThrAsn: 1.857 ± 0.08
2.439ThrPro: 2.439 ± 0.062
1.477ThrGln: 1.477 ± 0.049
2.144ThrArg: 2.144 ± 0.061
2.515ThrSer: 2.515 ± 0.067
3.115ThrThr: 3.115 ± 0.096
5.409ThrVal: 5.409 ± 0.097
0.552ThrTrp: 0.552 ± 0.031
2.153ThrTyr: 2.153 ± 0.061
0.0ThrXaa: 0.0 ± 0.0
Val
6.643ValAla: 6.643 ± 0.129
1.121ValCys: 1.121 ± 0.047
4.015ValAsp: 4.015 ± 0.079
3.864ValGlu: 3.864 ± 0.088
3.148ValPhe: 3.148 ± 0.078
4.987ValGly: 4.987 ± 0.115
1.395ValHis: 1.395 ± 0.051
5.157ValIle: 5.157 ± 0.099
3.729ValLys: 3.729 ± 0.089
7.098ValLeu: 7.098 ± 0.118
2.103ValMet: 2.103 ± 0.059
2.654ValAsn: 2.654 ± 0.067
3.175ValPro: 3.175 ± 0.079
2.211ValGln: 2.211 ± 0.064
3.933ValArg: 3.933 ± 0.079
4.596ValSer: 4.596 ± 0.092
4.692ValThr: 4.692 ± 0.103
5.479ValVal: 5.479 ± 0.129
0.634ValTrp: 0.634 ± 0.03
2.581ValTyr: 2.581 ± 0.07
0.0ValXaa: 0.0 ± 0.0
Trp
0.672TrpAla: 0.672 ± 0.036
0.12TrpCys: 0.12 ± 0.014
0.538TrpAsp: 0.538 ± 0.034
0.531TrpGlu: 0.531 ± 0.03
0.467TrpPhe: 0.467 ± 0.028
0.633TrpGly: 0.633 ± 0.031
0.248TrpHis: 0.248 ± 0.023
0.607TrpIle: 0.607 ± 0.033
0.534TrpLys: 0.534 ± 0.027
1.027TrpLeu: 1.027 ± 0.043
0.255TrpMet: 0.255 ± 0.024
0.423TrpAsn: 0.423 ± 0.024
0.31TrpPro: 0.31 ± 0.021
0.646TrpGln: 0.646 ± 0.035
0.659TrpArg: 0.659 ± 0.041
0.467TrpSer: 0.467 ± 0.026
0.441TrpThr: 0.441 ± 0.026
0.491TrpVal: 0.491 ± 0.025
0.094TrpTrp: 0.094 ± 0.011
0.356TrpTyr: 0.356 ± 0.028
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.961TyrAla: 2.961 ± 0.076
0.537TyrCys: 0.537 ± 0.026
2.404TyrAsp: 2.404 ± 0.067
2.184TyrGlu: 2.184 ± 0.063
1.469TyrPhe: 1.469 ± 0.047
3.347TyrGly: 3.347 ± 0.072
0.835TyrHis: 0.835 ± 0.038
2.175TyrIle: 2.175 ± 0.066
1.632TyrLys: 1.632 ± 0.053
3.382TyrLeu: 3.382 ± 0.089
0.89TyrMet: 0.89 ± 0.035
1.218TyrAsn: 1.218 ± 0.048
1.312TyrPro: 1.312 ± 0.04
1.118TyrGln: 1.118 ± 0.037
2.036TyrArg: 2.036 ± 0.062
1.848TyrSer: 1.848 ± 0.056
2.094TyrThr: 2.094 ± 0.054
2.465TyrVal: 2.465 ± 0.06
0.342TyrTrp: 0.342 ± 0.025
1.402TyrTyr: 1.402 ± 0.051
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2196 proteins (657555 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski