Amino acid dipepetide frequency for Melissococcus plutonius (strain ATCC 35311 / CIP 104052 / LMG 20360 / NCIMB 702443)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.345AlaAla: 4.345 ± 0.096
0.545AlaCys: 0.545 ± 0.037
2.973AlaAsp: 2.973 ± 0.075
3.848AlaGlu: 3.848 ± 0.087
2.943AlaPhe: 2.943 ± 0.078
4.286AlaGly: 4.286 ± 0.092
1.023AlaHis: 1.023 ± 0.051
6.026AlaIle: 6.026 ± 0.11
4.929AlaLys: 4.929 ± 0.103
6.16AlaLeu: 6.16 ± 0.112
1.835AlaMet: 1.835 ± 0.071
3.077AlaAsn: 3.077 ± 0.086
1.611AlaPro: 1.611 ± 0.054
2.33AlaGln: 2.33 ± 0.073
2.281AlaArg: 2.281 ± 0.077
3.633AlaSer: 3.633 ± 0.086
3.757AlaThr: 3.757 ± 0.093
3.899AlaVal: 3.899 ± 0.079
0.508AlaTrp: 0.508 ± 0.033
2.404AlaTyr: 2.404 ± 0.061
0.0AlaXaa: 0.0 ± 0.0
Cys
0.389CysAla: 0.389 ± 0.029
0.102CysCys: 0.102 ± 0.013
0.315CysAsp: 0.315 ± 0.027
0.37CysGlu: 0.37 ± 0.024
0.402CysPhe: 0.402 ± 0.027
0.556CysGly: 0.556 ± 0.033
0.188CysHis: 0.188 ± 0.022
0.605CysIle: 0.605 ± 0.037
0.304CysLys: 0.304 ± 0.026
0.865CysLeu: 0.865 ± 0.043
0.176CysMet: 0.176 ± 0.018
0.296CysAsn: 0.296 ± 0.025
0.311CysPro: 0.311 ± 0.024
0.323CysGln: 0.323 ± 0.023
0.216CysArg: 0.216 ± 0.02
0.51CysSer: 0.51 ± 0.029
0.357CysThr: 0.357 ± 0.024
0.414CysVal: 0.414 ± 0.027
0.083CysTrp: 0.083 ± 0.012
0.359CysTyr: 0.359 ± 0.027
0.0CysXaa: 0.0 ± 0.0
Asp
2.681AspAla: 2.681 ± 0.073
0.321AspCys: 0.321 ± 0.023
2.144AspAsp: 2.144 ± 0.071
3.825AspGlu: 3.825 ± 0.113
2.364AspPhe: 2.364 ± 0.063
3.075AspGly: 3.075 ± 0.083
1.117AspHis: 1.117 ± 0.049
4.193AspIle: 4.193 ± 0.094
3.633AspLys: 3.633 ± 0.091
4.561AspLeu: 4.561 ± 0.099
1.205AspMet: 1.205 ± 0.05
2.165AspAsn: 2.165 ± 0.071
1.77AspPro: 1.77 ± 0.059
2.26AspGln: 2.26 ± 0.063
1.85AspArg: 1.85 ± 0.058
2.662AspSer: 2.662 ± 0.07
2.808AspThr: 2.808 ± 0.071
2.84AspVal: 2.84 ± 0.072
0.579AspTrp: 0.579 ± 0.035
2.521AspTyr: 2.521 ± 0.076
0.0AspXaa: 0.0 ± 0.0
Glu
4.314GluAla: 4.314 ± 0.099
0.357GluCys: 0.357 ± 0.025
3.102GluAsp: 3.102 ± 0.085
5.811GluGlu: 5.811 ± 0.119
2.334GluPhe: 2.334 ± 0.069
3.258GluGly: 3.258 ± 0.084
1.089GluHis: 1.089 ± 0.047
6.079GluIle: 6.079 ± 0.102
7.342GluLys: 7.342 ± 0.113
6.648GluLeu: 6.648 ± 0.142
2.081GluMet: 2.081 ± 0.057
4.447GluAsn: 4.447 ± 0.091
1.527GluPro: 1.527 ± 0.053
3.544GluGln: 3.544 ± 0.1
2.709GluArg: 2.709 ± 0.077
2.804GluSer: 2.804 ± 0.067
3.914GluThr: 3.914 ± 0.089
3.948GluVal: 3.948 ± 0.086
0.628GluTrp: 0.628 ± 0.037
1.996GluTyr: 1.996 ± 0.06
0.0GluXaa: 0.0 ± 0.0
Phe
2.652PheAla: 2.652 ± 0.061
0.397PheCys: 0.397 ± 0.03
2.503PheAsp: 2.503 ± 0.073
2.406PheGlu: 2.406 ± 0.07
2.633PhePhe: 2.633 ± 0.087
3.049PheGly: 3.049 ± 0.075
0.913PheHis: 0.913 ± 0.039
4.468PheIle: 4.468 ± 0.11
2.762PheLys: 2.762 ± 0.069
4.878PheLeu: 4.878 ± 0.123
1.14PheMet: 1.14 ± 0.043
2.432PheAsn: 2.432 ± 0.073
1.567PhePro: 1.567 ± 0.058
1.662PheGln: 1.662 ± 0.056
1.271PheArg: 1.271 ± 0.056
3.694PheSer: 3.694 ± 0.092
2.724PheThr: 2.724 ± 0.083
2.854PheVal: 2.854 ± 0.081
0.459PheTrp: 0.459 ± 0.03
2.005PheTyr: 2.005 ± 0.071
0.0PheXaa: 0.0 ± 0.0
Gly
3.647GlyAla: 3.647 ± 0.093
0.52GlyCys: 0.52 ± 0.03
2.743GlyAsp: 2.743 ± 0.072
3.87GlyGlu: 3.87 ± 0.083
2.935GlyPhe: 2.935 ± 0.072
4.047GlyGly: 4.047 ± 0.128
1.142GlyHis: 1.142 ± 0.047
6.502GlyIle: 6.502 ± 0.113
5.223GlyLys: 5.223 ± 0.101
5.642GlyLeu: 5.642 ± 0.114
1.844GlyMet: 1.844 ± 0.065
3.193GlyAsn: 3.193 ± 0.088
1.362GlyPro: 1.362 ± 0.055
2.243GlyGln: 2.243 ± 0.057
2.258GlyArg: 2.258 ± 0.076
3.487GlySer: 3.487 ± 0.085
3.872GlyThr: 3.872 ± 0.089
3.667GlyVal: 3.667 ± 0.087
0.637GlyTrp: 0.637 ± 0.036
2.666GlyTyr: 2.666 ± 0.067
0.0GlyXaa: 0.0 ± 0.0
His
1.106HisAla: 1.106 ± 0.052
0.18HisCys: 0.18 ± 0.019
0.825HisAsp: 0.825 ± 0.041
1.125HisGlu: 1.125 ± 0.053
1.104HisPhe: 1.104 ± 0.051
1.252HisGly: 1.252 ± 0.057
0.556HisHis: 0.556 ± 0.033
1.406HisIle: 1.406 ± 0.056
1.116HisLys: 1.116 ± 0.047
2.208HisLeu: 2.208 ± 0.072
0.459HisMet: 0.459 ± 0.032
0.755HisAsn: 0.755 ± 0.041
1.044HisPro: 1.044 ± 0.049
1.023HisGln: 1.023 ± 0.047
0.751HisArg: 0.751 ± 0.043
1.237HisSer: 1.237 ± 0.05
1.129HisThr: 1.129 ± 0.055
1.042HisVal: 1.042 ± 0.037
0.241HisTrp: 0.241 ± 0.021
1.064HisTyr: 1.064 ± 0.052
0.0HisXaa: 0.0 ± 0.0
Ile
5.982IleAla: 5.982 ± 0.112
0.827IleCys: 0.827 ± 0.045
5.145IleAsp: 5.145 ± 0.088
5.982IleGlu: 5.982 ± 0.118
4.197IlePhe: 4.197 ± 0.118
6.094IleGly: 6.094 ± 0.117
2.055IleHis: 2.055 ± 0.063
7.593IleIle: 7.593 ± 0.153
6.035IleLys: 6.035 ± 0.112
8.568IleLeu: 8.568 ± 0.169
1.905IleMet: 1.905 ± 0.06
4.472IleAsn: 4.472 ± 0.087
3.396IlePro: 3.396 ± 0.082
4.005IleGln: 4.005 ± 0.082
3.043IleArg: 3.043 ± 0.08
5.542IleSer: 5.542 ± 0.111
5.394IleThr: 5.394 ± 0.097
5.586IleVal: 5.586 ± 0.101
0.674IleTrp: 0.674 ± 0.034
3.436IleTyr: 3.436 ± 0.086
0.0IleXaa: 0.0 ± 0.0
Lys
4.417LysAla: 4.417 ± 0.101
0.307LysCys: 0.307 ± 0.024
3.967LysAsp: 3.967 ± 0.1
7.337LysGlu: 7.337 ± 0.126
2.148LysPhe: 2.148 ± 0.059
3.927LysGly: 3.927 ± 0.09
1.283LysHis: 1.283 ± 0.048
7.058LysIle: 7.058 ± 0.124
8.409LysLys: 8.409 ± 0.159
6.155LysLeu: 6.155 ± 0.112
2.402LysMet: 2.402 ± 0.064
5.965LysAsn: 5.965 ± 0.12
2.03LysPro: 2.03 ± 0.061
4.413LysGln: 4.413 ± 0.11
3.074LysArg: 3.074 ± 0.077
3.504LysSer: 3.504 ± 0.079
4.648LysThr: 4.648 ± 0.108
4.235LysVal: 4.235 ± 0.088
0.668LysTrp: 0.668 ± 0.033
2.423LysTyr: 2.423 ± 0.072
0.0LysXaa: 0.0 ± 0.0
Leu
7.227LeuAla: 7.227 ± 0.11
0.651LeuCys: 0.651 ± 0.04
4.874LeuAsp: 4.874 ± 0.106
5.901LeuGlu: 5.901 ± 0.117
5.269LeuPhe: 5.269 ± 0.141
6.052LeuGly: 6.052 ± 0.123
1.616LeuHis: 1.616 ± 0.053
8.344LeuIle: 8.344 ± 0.15
7.145LeuLys: 7.145 ± 0.117
10.746LeuLeu: 10.746 ± 0.168
2.495LeuMet: 2.495 ± 0.061
5.284LeuAsn: 5.284 ± 0.094
4.165LeuPro: 4.165 ± 0.092
3.39LeuGln: 3.39 ± 0.089
3.045LeuArg: 3.045 ± 0.079
6.948LeuSer: 6.948 ± 0.141
6.688LeuThr: 6.688 ± 0.1
5.65LeuVal: 5.65 ± 0.119
0.702LeuTrp: 0.702 ± 0.037
3.121LeuTyr: 3.121 ± 0.079
0.0LeuXaa: 0.0 ± 0.0
Met
1.829MetAla: 1.829 ± 0.052
0.176MetCys: 0.176 ± 0.02
1.402MetAsp: 1.402 ± 0.047
1.67MetGlu: 1.67 ± 0.06
0.896MetPhe: 0.896 ± 0.042
1.596MetGly: 1.596 ± 0.051
0.482MetHis: 0.482 ± 0.03
2.558MetIle: 2.558 ± 0.07
2.421MetLys: 2.421 ± 0.064
2.292MetLeu: 2.292 ± 0.069
0.753MetMet: 0.753 ± 0.039
1.818MetAsn: 1.818 ± 0.059
0.933MetPro: 0.933 ± 0.045
1.03MetGln: 1.03 ± 0.051
0.861MetArg: 0.861 ± 0.042
1.379MetSer: 1.379 ± 0.047
1.694MetThr: 1.694 ± 0.056
1.766MetVal: 1.766 ± 0.06
0.123MetTrp: 0.123 ± 0.014
0.711MetTyr: 0.711 ± 0.036
0.0MetXaa: 0.0 ± 0.0
Asn
3.0AsnAla: 3.0 ± 0.078
0.378AsnCys: 0.378 ± 0.029
2.736AsnAsp: 2.736 ± 0.077
3.948AsnGlu: 3.948 ± 0.093
2.482AsnPhe: 2.482 ± 0.071
3.421AsnGly: 3.421 ± 0.08
1.47AsnHis: 1.47 ± 0.052
4.311AsnIle: 4.311 ± 0.091
4.166AsnLys: 4.166 ± 0.1
4.933AsnLeu: 4.933 ± 0.091
1.313AsnMet: 1.313 ± 0.054
3.326AsnAsn: 3.326 ± 0.097
2.013AsnPro: 2.013 ± 0.061
3.267AsnGln: 3.267 ± 0.08
2.026AsnArg: 2.026 ± 0.058
2.85AsnSer: 2.85 ± 0.075
2.835AsnThr: 2.835 ± 0.08
2.99AsnVal: 2.99 ± 0.084
0.617AsnTrp: 0.617 ± 0.035
2.704AsnTyr: 2.704 ± 0.083
0.0AsnXaa: 0.0 ± 0.0
Pro
1.977ProAla: 1.977 ± 0.062
0.194ProCys: 0.194 ± 0.02
1.704ProAsp: 1.704 ± 0.054
2.497ProGlu: 2.497 ± 0.075
1.766ProPhe: 1.766 ± 0.055
1.846ProGly: 1.846 ± 0.068
0.563ProHis: 0.563 ± 0.041
3.315ProIle: 3.315 ± 0.086
2.582ProLys: 2.582 ± 0.075
3.206ProLeu: 3.206 ± 0.069
0.776ProMet: 0.776 ± 0.036
1.922ProAsn: 1.922 ± 0.06
0.62ProPro: 0.62 ± 0.034
1.114ProGln: 1.114 ± 0.045
0.928ProArg: 0.928 ± 0.042
1.905ProSer: 1.905 ± 0.065
2.258ProThr: 2.258 ± 0.071
2.184ProVal: 2.184 ± 0.067
0.269ProTrp: 0.269 ± 0.021
1.374ProTyr: 1.374 ± 0.05
0.0ProXaa: 0.0 ± 0.0
Gln
3.193GlnAla: 3.193 ± 0.08
0.195GlnCys: 0.195 ± 0.019
1.56GlnAsp: 1.56 ± 0.052
3.057GlnGlu: 3.057 ± 0.086
1.903GlnPhe: 1.903 ± 0.065
2.146GlnGly: 2.146 ± 0.059
0.761GlnHis: 0.761 ± 0.041
3.468GlnIle: 3.468 ± 0.084
3.846GlnLys: 3.846 ± 0.107
5.72GlnLeu: 5.72 ± 0.11
1.163GlnMet: 1.163 ± 0.046
1.988GlnAsn: 1.988 ± 0.068
1.573GlnPro: 1.573 ± 0.057
2.687GlnGln: 2.687 ± 0.101
1.611GlnArg: 1.611 ± 0.062
2.167GlnSer: 2.167 ± 0.063
2.926GlnThr: 2.926 ± 0.078
2.69GlnVal: 2.69 ± 0.073
0.408GlnTrp: 0.408 ± 0.031
1.37GlnTyr: 1.37 ± 0.041
0.0GlnXaa: 0.0 ± 0.0
Arg
1.854ArgAla: 1.854 ± 0.051
0.25ArgCys: 0.25 ± 0.023
1.468ArgAsp: 1.468 ± 0.057
2.463ArgGlu: 2.463 ± 0.075
1.656ArgPhe: 1.656 ± 0.052
1.886ArgGly: 1.886 ± 0.068
0.655ArgHis: 0.655 ± 0.034
3.074ArgIle: 3.074 ± 0.072
3.068ArgLys: 3.068 ± 0.082
3.831ArgLeu: 3.831 ± 0.084
1.119ArgMet: 1.119 ± 0.044
1.759ArgAsn: 1.759 ± 0.056
1.144ArgPro: 1.144 ± 0.051
1.668ArgGln: 1.668 ± 0.05
1.569ArgArg: 1.569 ± 0.059
1.926ArgSer: 1.926 ± 0.061
1.747ArgThr: 1.747 ± 0.059
1.95ArgVal: 1.95 ± 0.067
0.336ArgTrp: 0.336 ± 0.024
1.503ArgTyr: 1.503 ± 0.055
0.0ArgXaa: 0.0 ± 0.0
Ser
3.423SerAla: 3.423 ± 0.082
0.391SerCys: 0.391 ± 0.026
2.573SerAsp: 2.573 ± 0.082
3.666SerGlu: 3.666 ± 0.087
3.197SerPhe: 3.197 ± 0.092
3.941SerGly: 3.941 ± 0.093
1.136SerHis: 1.136 ± 0.048
5.421SerIle: 5.421 ± 0.11
4.109SerLys: 4.109 ± 0.104
6.234SerLeu: 6.234 ± 0.127
1.567SerMet: 1.567 ± 0.052
2.971SerAsn: 2.971 ± 0.077
1.715SerPro: 1.715 ± 0.062
2.269SerGln: 2.269 ± 0.073
1.795SerArg: 1.795 ± 0.061
3.707SerSer: 3.707 ± 0.103
3.366SerThr: 3.366 ± 0.088
3.548SerVal: 3.548 ± 0.088
0.613SerTrp: 0.613 ± 0.031
2.453SerTyr: 2.453 ± 0.075
0.0SerXaa: 0.0 ± 0.0
Thr
3.865ThrAla: 3.865 ± 0.091
0.389ThrCys: 0.389 ± 0.028
2.969ThrAsp: 2.969 ± 0.079
3.605ThrGlu: 3.605 ± 0.088
2.918ThrPhe: 2.918 ± 0.079
4.051ThrGly: 4.051 ± 0.104
1.188ThrHis: 1.188 ± 0.048
6.445ThrIle: 6.445 ± 0.126
4.449ThrLys: 4.449 ± 0.096
5.576ThrLeu: 5.576 ± 0.092
1.503ThrMet: 1.503 ± 0.051
3.5ThrAsn: 3.5 ± 0.086
2.341ThrPro: 2.341 ± 0.079
2.024ThrGln: 2.024 ± 0.064
1.78ThrArg: 1.78 ± 0.065
3.495ThrSer: 3.495 ± 0.084
3.753ThrThr: 3.753 ± 0.111
3.656ThrVal: 3.656 ± 0.095
0.501ThrTrp: 0.501 ± 0.03
2.313ThrTyr: 2.313 ± 0.064
0.0ThrXaa: 0.0 ± 0.0
Val
3.992ValAla: 3.992 ± 0.093
0.537ValCys: 0.537 ± 0.033
3.322ValAsp: 3.322 ± 0.082
3.611ValGlu: 3.611 ± 0.081
2.664ValPhe: 2.664 ± 0.081
4.005ValGly: 4.005 ± 0.107
1.222ValHis: 1.222 ± 0.047
5.495ValIle: 5.495 ± 0.113
3.925ValLys: 3.925 ± 0.093
5.441ValLeu: 5.441 ± 0.098
1.548ValMet: 1.548 ± 0.062
3.079ValAsn: 3.079 ± 0.068
2.214ValPro: 2.214 ± 0.063
2.201ValGln: 2.201 ± 0.066
1.939ValArg: 1.939 ± 0.061
3.872ValSer: 3.872 ± 0.095
3.787ValThr: 3.787 ± 0.074
3.719ValVal: 3.719 ± 0.092
0.478ValTrp: 0.478 ± 0.033
2.055ValTyr: 2.055 ± 0.075
0.0ValXaa: 0.0 ± 0.0
Trp
0.402TrpAla: 0.402 ± 0.028
0.068TrpCys: 0.068 ± 0.011
0.406TrpAsp: 0.406 ± 0.031
0.596TrpGlu: 0.596 ± 0.033
0.471TrpPhe: 0.471 ± 0.029
0.545TrpGly: 0.545 ± 0.035
0.203TrpHis: 0.203 ± 0.022
0.799TrpIle: 0.799 ± 0.036
0.584TrpLys: 0.584 ± 0.032
1.146TrpLeu: 1.146 ± 0.043
0.29TrpMet: 0.29 ± 0.024
0.489TrpAsn: 0.489 ± 0.029
0.203TrpPro: 0.203 ± 0.021
0.687TrpGln: 0.687 ± 0.037
0.343TrpArg: 0.343 ± 0.025
0.425TrpSer: 0.425 ± 0.03
0.438TrpThr: 0.438 ± 0.03
0.465TrpVal: 0.465 ± 0.031
0.121TrpTrp: 0.121 ± 0.015
0.357TrpTyr: 0.357 ± 0.027
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.117TyrAla: 2.117 ± 0.067
0.357TyrCys: 0.357 ± 0.025
1.844TyrAsp: 1.844 ± 0.065
2.349TyrGlu: 2.349 ± 0.065
2.189TyrPhe: 2.189 ± 0.066
2.47TyrGly: 2.47 ± 0.065
0.962TyrHis: 0.962 ± 0.046
2.774TyrIle: 2.774 ± 0.072
2.318TyrLys: 2.318 ± 0.066
4.5TyrLeu: 4.5 ± 0.11
0.814TyrMet: 0.814 ± 0.042
1.844TyrAsn: 1.844 ± 0.055
1.459TyrPro: 1.459 ± 0.052
2.292TyrGln: 2.292 ± 0.069
1.575TyrArg: 1.575 ± 0.052
2.347TyrSer: 2.347 ± 0.067
2.226TyrThr: 2.226 ± 0.063
2.004TyrVal: 2.004 ± 0.055
0.393TyrTrp: 0.393 ± 0.029
1.81TyrTyr: 1.81 ± 0.066
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1876 proteins (527073 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski