Amino acid dipepetide frequency for Neisseria meningitidis serogroup B (strain MC58)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
13.7AlaAla: 13.7 ± 0.271
1.125AlaCys: 1.125 ± 0.046
6.133AlaAsp: 6.133 ± 0.123
7.262AlaGlu: 7.262 ± 0.125
3.872AlaPhe: 3.872 ± 0.095
7.896AlaGly: 7.896 ± 0.184
1.915AlaHis: 1.915 ± 0.062
4.112AlaIle: 4.112 ± 0.11
5.456AlaLys: 5.456 ± 0.113
10.267AlaLeu: 10.267 ± 0.149
2.414AlaMet: 2.414 ± 0.075
3.218AlaAsn: 3.218 ± 0.08
3.517AlaPro: 3.517 ± 0.083
4.619AlaGln: 4.619 ± 0.102
4.928AlaArg: 4.928 ± 0.116
4.47AlaSer: 4.47 ± 0.099
3.75AlaThr: 3.75 ± 0.066
8.892AlaVal: 8.892 ± 0.165
1.037AlaTrp: 1.037 ± 0.044
2.833AlaTyr: 2.833 ± 0.074
0.0AlaXaa: 0.0 ± 0.0
Cys
1.003CysAla: 1.003 ± 0.043
0.185CysCys: 0.185 ± 0.02
0.441CysAsp: 0.441 ± 0.028
0.52CysGlu: 0.52 ± 0.029
0.434CysPhe: 0.434 ± 0.024
1.149CysGly: 1.149 ± 0.05
0.276CysHis: 0.276 ± 0.025
0.566CysIle: 0.566 ± 0.032
0.466CysLys: 0.466 ± 0.024
1.017CysLeu: 1.017 ± 0.047
0.213CysMet: 0.213 ± 0.02
0.319CysAsn: 0.319 ± 0.02
0.552CysPro: 0.552 ± 0.035
0.348CysGln: 0.348 ± 0.024
0.945CysArg: 0.945 ± 0.047
0.631CysSer: 0.631 ± 0.029
0.501CysThr: 0.501 ± 0.034
0.634CysVal: 0.634 ± 0.036
0.065CysTrp: 0.065 ± 0.011
0.267CysTyr: 0.267 ± 0.017
0.0CysXaa: 0.0 ± 0.0
Asp
5.002AspAla: 5.002 ± 0.094
0.484AspCys: 0.484 ± 0.031
2.593AspAsp: 2.593 ± 0.082
3.488AspGlu: 3.488 ± 0.093
2.505AspPhe: 2.505 ± 0.068
4.792AspGly: 4.792 ± 0.114
0.775AspHis: 0.775 ± 0.031
3.759AspIle: 3.759 ± 0.085
3.323AspLys: 3.323 ± 0.077
4.993AspLeu: 4.993 ± 0.105
1.387AspMet: 1.387 ± 0.05
2.227AspAsn: 2.227 ± 0.064
1.704AspPro: 1.704 ± 0.06
1.14AspGln: 1.14 ± 0.045
2.452AspArg: 2.452 ± 0.064
2.699AspSer: 2.699 ± 0.075
3.033AspThr: 3.033 ± 0.098
3.53AspVal: 3.53 ± 0.072
0.813AspTrp: 0.813 ± 0.042
1.975AspTyr: 1.975 ± 0.061
0.0AspXaa: 0.0 ± 0.0
Glu
6.389GluAla: 6.389 ± 0.12
0.576GluCys: 0.576 ± 0.035
2.699GluAsp: 2.699 ± 0.082
3.753GluGlu: 3.753 ± 0.111
2.044GluPhe: 2.044 ± 0.048
4.005GluGly: 4.005 ± 0.117
1.605GluHis: 1.605 ± 0.044
4.276GluIle: 4.276 ± 0.109
4.146GluLys: 4.146 ± 0.108
5.65GluLeu: 5.65 ± 0.129
1.68GluMet: 1.68 ± 0.06
3.198GluAsn: 3.198 ± 0.084
2.003GluPro: 2.003 ± 0.068
2.978GluGln: 2.978 ± 0.078
3.553GluArg: 3.553 ± 0.088
2.975GluSer: 2.975 ± 0.071
3.945GluThr: 3.945 ± 0.096
3.551GluVal: 3.551 ± 0.084
0.797GluTrp: 0.797 ± 0.045
1.749GluTyr: 1.749 ± 0.052
0.0GluXaa: 0.0 ± 0.0
Phe
4.208PheAla: 4.208 ± 0.076
0.59PheCys: 0.59 ± 0.026
2.803PheAsp: 2.803 ± 0.066
2.401PheGlu: 2.401 ± 0.066
1.855PhePhe: 1.855 ± 0.057
3.57PheGly: 3.57 ± 0.079
0.832PheHis: 0.832 ± 0.034
2.19PheIle: 2.19 ± 0.073
2.099PheLys: 2.099 ± 0.061
3.46PheLeu: 3.46 ± 0.099
0.912PheMet: 0.912 ± 0.034
1.667PheAsn: 1.667 ± 0.055
1.629PhePro: 1.629 ± 0.057
1.427PheGln: 1.427 ± 0.05
1.855PheArg: 1.855 ± 0.051
2.75PheSer: 2.75 ± 0.066
2.126PheThr: 2.126 ± 0.065
2.795PheVal: 2.795 ± 0.07
0.54PheTrp: 0.54 ± 0.041
1.329PheTyr: 1.329 ± 0.047
0.0PheXaa: 0.0 ± 0.0
Gly
6.463GlyAla: 6.463 ± 0.127
0.869GlyCys: 0.869 ± 0.038
3.453GlyAsp: 3.453 ± 0.082
4.655GlyGlu: 4.655 ± 0.088
3.601GlyPhe: 3.601 ± 0.086
6.576GlyGly: 6.576 ± 0.14
1.521GlyHis: 1.521 ± 0.053
5.689GlyIle: 5.689 ± 0.119
5.988GlyLys: 5.988 ± 0.124
7.484GlyLeu: 7.484 ± 0.137
2.164GlyMet: 2.164 ± 0.078
3.431GlyAsn: 3.431 ± 0.13
1.173GlyPro: 1.173 ± 0.047
2.699GlyGln: 2.699 ± 0.066
4.571GlyArg: 4.571 ± 0.103
4.693GlySer: 4.693 ± 0.154
3.824GlyThr: 3.824 ± 0.131
5.099GlyVal: 5.099 ± 0.1
1.091GlyTrp: 1.091 ± 0.045
2.617GlyTyr: 2.617 ± 0.078
0.0GlyXaa: 0.0 ± 0.0
His
1.804HisAla: 1.804 ± 0.062
0.291HisCys: 0.291 ± 0.023
0.926HisAsp: 0.926 ± 0.042
1.255HisGlu: 1.255 ± 0.053
0.952HisPhe: 0.952 ± 0.04
1.749HisGly: 1.749 ± 0.054
0.638HisHis: 0.638 ± 0.03
1.737HisIle: 1.737 ± 0.052
1.072HisLys: 1.072 ± 0.038
2.085HisLeu: 2.085 ± 0.066
0.396HisMet: 0.396 ± 0.024
0.892HisAsn: 0.892 ± 0.038
1.379HisPro: 1.379 ± 0.051
0.84HisGln: 0.84 ± 0.044
1.252HisArg: 1.252 ± 0.038
1.233HisSer: 1.233 ± 0.061
1.428HisThr: 1.428 ± 0.055
1.061HisVal: 1.061 ± 0.048
0.276HisTrp: 0.276 ± 0.02
0.773HisTyr: 0.773 ± 0.036
0.0HisXaa: 0.0 ± 0.0
Ile
6.094IleAla: 6.094 ± 0.139
0.672IleCys: 0.672 ± 0.033
3.482IleAsp: 3.482 ± 0.065
3.771IleGlu: 3.771 ± 0.095
2.123IlePhe: 2.123 ± 0.071
5.063IleGly: 5.063 ± 0.111
1.346IleHis: 1.346 ± 0.047
3.174IleIle: 3.174 ± 0.085
2.867IleLys: 2.867 ± 0.076
5.274IleLeu: 5.274 ± 0.111
1.276IleMet: 1.276 ± 0.05
2.389IleAsn: 2.389 ± 0.066
2.733IlePro: 2.733 ± 0.063
1.922IleGln: 1.922 ± 0.069
3.566IleArg: 3.566 ± 0.071
3.477IleSer: 3.477 ± 0.109
3.188IleThr: 3.188 ± 0.087
4.038IleVal: 4.038 ± 0.089
0.549IleTrp: 0.549 ± 0.027
1.584IleTyr: 1.584 ± 0.064
0.0IleXaa: 0.0 ± 0.0
Lys
5.384LysAla: 5.384 ± 0.126
0.329LysCys: 0.329 ± 0.026
2.833LysAsp: 2.833 ± 0.074
3.338LysGlu: 3.338 ± 0.084
1.708LysPhe: 1.708 ± 0.06
3.966LysGly: 3.966 ± 0.105
1.293LysHis: 1.293 ± 0.046
3.755LysIle: 3.755 ± 0.074
3.349LysLys: 3.349 ± 0.093
5.276LysLeu: 5.276 ± 0.094
1.655LysMet: 1.655 ± 0.067
2.872LysAsn: 2.872 ± 0.115
2.714LysPro: 2.714 ± 0.083
2.846LysGln: 2.846 ± 0.071
3.011LysArg: 3.011 ± 0.079
2.934LysSer: 2.934 ± 0.096
3.735LysThr: 3.735 ± 0.083
3.234LysVal: 3.234 ± 0.058
0.629LysTrp: 0.629 ± 0.035
1.543LysTyr: 1.543 ± 0.056
0.0LysXaa: 0.0 ± 0.0
Leu
10.019LeuAla: 10.019 ± 0.173
1.017LeuCys: 1.017 ± 0.039
5.422LeuAsp: 5.422 ± 0.12
5.339LeuGlu: 5.339 ± 0.112
4.129LeuPhe: 4.129 ± 0.097
7.032LeuGly: 7.032 ± 0.117
2.222LeuHis: 2.222 ± 0.057
5.525LeuIle: 5.525 ± 0.114
6.231LeuLys: 6.231 ± 0.108
9.582LeuLeu: 9.582 ± 0.183
2.486LeuMet: 2.486 ± 0.074
4.647LeuAsn: 4.647 ± 0.095
5.585LeuPro: 5.585 ± 0.132
3.57LeuGln: 3.57 ± 0.08
4.595LeuArg: 4.595 ± 0.094
6.476LeuSer: 6.476 ± 0.124
5.471LeuThr: 5.471 ± 0.096
5.693LeuVal: 5.693 ± 0.103
1.137LeuTrp: 1.137 ± 0.053
2.454LeuTyr: 2.454 ± 0.069
0.0LeuXaa: 0.0 ± 0.0
Met
2.25MetAla: 2.25 ± 0.062
0.252MetCys: 0.252 ± 0.023
1.121MetAsp: 1.121 ± 0.047
1.248MetGlu: 1.248 ± 0.049
0.905MetPhe: 0.905 ± 0.035
1.713MetGly: 1.713 ± 0.059
0.439MetHis: 0.439 ± 0.03
1.224MetIle: 1.224 ± 0.049
1.595MetLys: 1.595 ± 0.051
2.596MetLeu: 2.596 ± 0.082
0.818MetMet: 0.818 ± 0.048
1.113MetAsn: 1.113 ± 0.045
1.559MetPro: 1.559 ± 0.053
1.056MetGln: 1.056 ± 0.052
1.406MetArg: 1.406 ± 0.045
1.56MetSer: 1.56 ± 0.053
1.408MetThr: 1.408 ± 0.051
1.526MetVal: 1.526 ± 0.066
0.243MetTrp: 0.243 ± 0.021
0.525MetTyr: 0.525 ± 0.028
0.0MetXaa: 0.0 ± 0.0
Asn
3.88AsnAla: 3.88 ± 0.107
0.362AsnCys: 0.362 ± 0.024
1.91AsnAsp: 1.91 ± 0.092
2.128AsnGlu: 2.128 ± 0.053
1.365AsnPhe: 1.365 ± 0.057
3.872AsnGly: 3.872 ± 0.133
0.981AsnHis: 0.981 ± 0.044
3.073AsnIle: 3.073 ± 0.089
1.975AsnLys: 1.975 ± 0.066
3.954AsnLeu: 3.954 ± 0.102
0.909AsnMet: 0.909 ± 0.037
1.708AsnAsn: 1.708 ± 0.08
2.526AsnPro: 2.526 ± 0.078
1.538AsnGln: 1.538 ± 0.063
2.599AsnArg: 2.599 ± 0.075
1.852AsnSer: 1.852 ± 0.076
2.371AsnThr: 2.371 ± 0.092
2.503AsnVal: 2.503 ± 0.075
0.544AsnTrp: 0.544 ± 0.042
1.116AsnTyr: 1.116 ± 0.049
0.0AsnXaa: 0.0 ± 0.0
Pro
4.086ProAla: 4.086 ± 0.101
0.412ProCys: 0.412 ± 0.031
2.874ProAsp: 2.874 ± 0.073
3.868ProGlu: 3.868 ± 0.09
1.914ProPhe: 1.914 ± 0.062
2.082ProGly: 2.082 ± 0.071
1.034ProHis: 1.034 ± 0.047
1.891ProIle: 1.891 ± 0.062
2.347ProLys: 2.347 ± 0.072
3.949ProLeu: 3.949 ± 0.084
0.986ProMet: 0.986 ± 0.045
1.686ProAsn: 1.686 ± 0.048
1.521ProPro: 1.521 ± 0.067
1.821ProGln: 1.821 ± 0.063
1.665ProArg: 1.665 ± 0.062
2.493ProSer: 2.493 ± 0.068
1.977ProThr: 1.977 ± 0.052
3.452ProVal: 3.452 ± 0.089
0.4ProTrp: 0.4 ± 0.024
1.384ProTyr: 1.384 ± 0.055
0.0ProXaa: 0.0 ± 0.0
Gln
4.39GlnAla: 4.39 ± 0.108
0.303GlnCys: 0.303 ± 0.022
1.881GlnAsp: 1.881 ± 0.064
2.219GlnGlu: 2.219 ± 0.068
1.349GlnPhe: 1.349 ± 0.05
2.838GlnGly: 2.838 ± 0.087
0.974GlnHis: 0.974 ± 0.045
2.57GlnIle: 2.57 ± 0.062
2.253GlnLys: 2.253 ± 0.079
3.229GlnLeu: 3.229 ± 0.073
1.043GlnMet: 1.043 ± 0.044
2.102GlnAsn: 2.102 ± 0.071
1.622GlnPro: 1.622 ± 0.053
1.833GlnGln: 1.833 ± 0.076
1.903GlnArg: 1.903 ± 0.072
2.375GlnSer: 2.375 ± 0.071
3.044GlnThr: 3.044 ± 0.083
2.377GlnVal: 2.377 ± 0.062
0.451GlnTrp: 0.451 ± 0.027
1.295GlnTyr: 1.295 ± 0.055
0.0GlnXaa: 0.0 ± 0.0
Arg
4.595ArgAla: 4.595 ± 0.093
0.487ArgCys: 0.487 ± 0.03
2.582ArgAsp: 2.582 ± 0.072
3.524ArgGlu: 3.524 ± 0.099
2.857ArgPhe: 2.857 ± 0.074
3.121ArgGly: 3.121 ± 0.07
1.593ArgHis: 1.593 ± 0.063
3.393ArgIle: 3.393 ± 0.088
2.915ArgLys: 2.915 ± 0.076
6.106ArgLeu: 6.106 ± 0.102
1.428ArgMet: 1.428 ± 0.048
2.279ArgAsn: 2.279 ± 0.078
2.262ArgPro: 2.262 ± 0.06
2.929ArgGln: 2.929 ± 0.082
3.666ArgArg: 3.666 ± 0.105
2.632ArgSer: 2.632 ± 0.079
2.495ArgThr: 2.495 ± 0.07
3.299ArgVal: 3.299 ± 0.074
0.564ArgTrp: 0.564 ± 0.029
2.088ArgTyr: 2.088 ± 0.072
0.0ArgXaa: 0.0 ± 0.0
Ser
5.411SerAla: 5.411 ± 0.086
0.64SerCys: 0.64 ± 0.035
3.362SerAsp: 3.362 ± 0.083
3.424SerGlu: 3.424 ± 0.069
2.377SerPhe: 2.377 ± 0.073
5.444SerGly: 5.444 ± 0.128
1.147SerHis: 1.147 ± 0.049
2.934SerIle: 2.934 ± 0.089
2.634SerLys: 2.634 ± 0.086
5.686SerLeu: 5.686 ± 0.096
1.188SerMet: 1.188 ± 0.044
1.924SerAsn: 1.924 ± 0.071
2.246SerPro: 2.246 ± 0.071
1.812SerGln: 1.812 ± 0.056
3.275SerArg: 3.275 ± 0.082
3.021SerSer: 3.021 ± 0.095
2.515SerThr: 2.515 ± 0.078
4.191SerVal: 4.191 ± 0.105
0.643SerTrp: 0.643 ± 0.033
1.591SerTyr: 1.591 ± 0.054
0.0SerXaa: 0.0 ± 0.0
Thr
6.485ThrAla: 6.485 ± 0.152
0.473ThrCys: 0.473 ± 0.035
3.059ThrAsp: 3.059 ± 0.088
3.042ThrGlu: 3.042 ± 0.082
1.943ThrPhe: 1.943 ± 0.057
4.468ThrGly: 4.468 ± 0.113
1.147ThrHis: 1.147 ± 0.056
2.596ThrIle: 2.596 ± 0.082
2.023ThrLys: 2.023 ± 0.064
5.832ThrLeu: 5.832 ± 0.114
0.996ThrMet: 0.996 ± 0.041
1.514ThrAsn: 1.514 ± 0.067
2.822ThrPro: 2.822 ± 0.063
2.003ThrGln: 2.003 ± 0.074
2.661ThrArg: 2.661 ± 0.079
2.277ThrSer: 2.277 ± 0.078
2.226ThrThr: 2.226 ± 0.094
4.837ThrVal: 4.837 ± 0.128
0.475ThrTrp: 0.475 ± 0.03
1.384ThrTyr: 1.384 ± 0.047
0.0ThrXaa: 0.0 ± 0.0
Val
6.43ValAla: 6.43 ± 0.128
0.969ValCys: 0.969 ± 0.037
2.972ValAsp: 2.972 ± 0.071
4.081ValGlu: 4.081 ± 0.103
3.068ValPhe: 3.068 ± 0.074
5.072ValGly: 5.072 ± 0.115
1.264ValHis: 1.264 ± 0.054
3.8ValIle: 3.8 ± 0.085
3.764ValLys: 3.764 ± 0.095
7.467ValLeu: 7.467 ± 0.141
1.735ValMet: 1.735 ± 0.056
2.586ValAsn: 2.586 ± 0.082
2.702ValPro: 2.702 ± 0.071
2.455ValGln: 2.455 ± 0.064
3.956ValArg: 3.956 ± 0.095
4.767ValSer: 4.767 ± 0.094
3.068ValThr: 3.068 ± 0.074
4.763ValVal: 4.763 ± 0.101
0.953ValTrp: 0.953 ± 0.041
1.974ValTyr: 1.974 ± 0.054
0.0ValXaa: 0.0 ± 0.0
Trp
0.96TrpAla: 0.96 ± 0.043
0.149TrpCys: 0.149 ± 0.017
0.573TrpAsp: 0.573 ± 0.028
0.609TrpGlu: 0.609 ± 0.028
0.592TrpPhe: 0.592 ± 0.037
0.778TrpGly: 0.778 ± 0.032
0.343TrpHis: 0.343 ± 0.023
0.643TrpIle: 0.643 ± 0.041
0.592TrpLys: 0.592 ± 0.029
1.608TrpLeu: 1.608 ± 0.061
0.322TrpMet: 0.322 ± 0.024
0.439TrpAsn: 0.439 ± 0.024
0.262TrpPro: 0.262 ± 0.023
0.88TrpGln: 0.88 ± 0.034
0.744TrpArg: 0.744 ± 0.036
0.492TrpSer: 0.492 ± 0.031
0.511TrpThr: 0.511 ± 0.033
0.77TrpVal: 0.77 ± 0.037
0.17TrpTrp: 0.17 ± 0.021
0.333TrpTyr: 0.333 ± 0.022
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.865TyrAla: 2.865 ± 0.078
0.346TyrCys: 0.346 ± 0.025
1.555TyrAsp: 1.555 ± 0.054
1.65TyrGlu: 1.65 ± 0.051
1.447TyrPhe: 1.447 ± 0.062
2.476TyrGly: 2.476 ± 0.069
0.694TyrHis: 0.694 ± 0.042
1.682TyrIle: 1.682 ± 0.061
1.31TyrLys: 1.31 ± 0.055
3.116TyrLeu: 3.116 ± 0.072
0.547TyrMet: 0.547 ± 0.031
0.957TyrAsn: 0.957 ± 0.048
1.329TyrPro: 1.329 ± 0.048
1.332TyrGln: 1.332 ± 0.059
2.21TyrArg: 2.21 ± 0.073
1.658TyrSer: 1.658 ± 0.058
1.648TyrThr: 1.648 ± 0.059
1.598TyrVal: 1.598 ± 0.055
0.393TyrTrp: 0.393 ± 0.026
0.866TyrTyr: 0.866 ± 0.043
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2001 proteins (583208 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski