Amino acid dipepetide frequency for Lactobacillus concavus DSM 17758

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.583AlaAla: 7.583 ± 0.171
0.369AlaCys: 0.369 ± 0.028
5.025AlaAsp: 5.025 ± 0.101
4.789AlaGlu: 4.789 ± 0.104
3.225AlaPhe: 3.225 ± 0.079
5.687AlaGly: 5.687 ± 0.123
1.735AlaHis: 1.735 ± 0.056
5.973AlaIle: 5.973 ± 0.109
5.197AlaLys: 5.197 ± 0.123
7.794AlaLeu: 7.794 ± 0.142
2.097AlaMet: 2.097 ± 0.06
3.6AlaAsn: 3.6 ± 0.09
2.456AlaPro: 2.456 ± 0.072
4.203AlaGln: 4.203 ± 0.098
3.366AlaArg: 3.366 ± 0.075
4.432AlaSer: 4.432 ± 0.11
5.469AlaThr: 5.469 ± 0.125
5.659AlaVal: 5.659 ± 0.126
0.714AlaTrp: 0.714 ± 0.041
2.493AlaTyr: 2.493 ± 0.077
0.0AlaXaa: 0.0 ± 0.0
Cys
0.337CysAla: 0.337 ± 0.028
0.051CysCys: 0.051 ± 0.01
0.23CysAsp: 0.23 ± 0.022
0.232CysGlu: 0.232 ± 0.022
0.23CysPhe: 0.23 ± 0.019
0.424CysGly: 0.424 ± 0.03
0.192CysHis: 0.192 ± 0.025
0.242CysIle: 0.242 ± 0.022
0.108CysLys: 0.108 ± 0.016
0.527CysLeu: 0.527 ± 0.035
0.061CysMet: 0.061 ± 0.01
0.15CysAsn: 0.15 ± 0.017
0.196CysPro: 0.196 ± 0.022
0.308CysGln: 0.308 ± 0.028
0.253CysArg: 0.253 ± 0.025
0.272CysSer: 0.272 ± 0.025
0.215CysThr: 0.215 ± 0.021
0.295CysVal: 0.295 ± 0.022
0.049CysTrp: 0.049 ± 0.01
0.213CysTyr: 0.213 ± 0.018
0.0CysXaa: 0.0 ± 0.0
Asp
4.325AspAla: 4.325 ± 0.096
0.253AspCys: 0.253 ± 0.023
3.543AspAsp: 3.543 ± 0.1
4.129AspGlu: 4.129 ± 0.092
3.022AspPhe: 3.022 ± 0.074
3.486AspGly: 3.486 ± 0.109
1.819AspHis: 1.819 ± 0.06
3.339AspIle: 3.339 ± 0.086
2.727AspLys: 2.727 ± 0.074
6.007AspLeu: 6.007 ± 0.105
1.265AspMet: 1.265 ± 0.045
2.179AspAsn: 2.179 ± 0.065
2.574AspPro: 2.574 ± 0.062
3.884AspGln: 3.884 ± 0.107
2.666AspArg: 2.666 ± 0.072
2.662AspSer: 2.662 ± 0.067
2.654AspThr: 2.654 ± 0.082
3.826AspVal: 3.826 ± 0.105
0.78AspTrp: 0.78 ± 0.037
2.689AspTyr: 2.689 ± 0.074
0.0AspXaa: 0.0 ± 0.0
Glu
4.538GluAla: 4.538 ± 0.098
0.219GluCys: 0.219 ± 0.025
2.892GluAsp: 2.892 ± 0.093
2.904GluGlu: 2.904 ± 0.087
2.198GluPhe: 2.198 ± 0.062
2.805GluGly: 2.805 ± 0.08
1.292GluHis: 1.292 ± 0.051
4.413GluIle: 4.413 ± 0.105
3.716GluLys: 3.716 ± 0.086
6.317GluLeu: 6.317 ± 0.12
1.724GluMet: 1.724 ± 0.065
2.679GluAsn: 2.679 ± 0.076
2.006GluPro: 2.006 ± 0.063
3.256GluGln: 3.256 ± 0.097
2.711GluArg: 2.711 ± 0.072
2.976GluSer: 2.976 ± 0.085
3.625GluThr: 3.625 ± 0.097
4.083GluVal: 4.083 ± 0.101
0.483GluTrp: 0.483 ± 0.03
1.418GluTyr: 1.418 ± 0.057
0.0GluXaa: 0.0 ± 0.0
Phe
3.412PheAla: 3.412 ± 0.079
0.251PheCys: 0.251 ± 0.02
2.965PheAsp: 2.965 ± 0.08
2.464PheGlu: 2.464 ± 0.072
1.92PhePhe: 1.92 ± 0.072
3.212PheGly: 3.212 ± 0.074
0.847PheHis: 0.847 ± 0.043
2.803PheIle: 2.803 ± 0.081
2.439PheLys: 2.439 ± 0.066
4.0PheLeu: 4.0 ± 0.113
1.214PheMet: 1.214 ± 0.053
2.163PheAsn: 2.163 ± 0.067
1.473PhePro: 1.473 ± 0.057
1.587PheGln: 1.587 ± 0.06
1.57PheArg: 1.57 ± 0.056
3.062PheSer: 3.062 ± 0.078
2.479PheThr: 2.479 ± 0.069
3.17PheVal: 3.17 ± 0.088
0.603PheTrp: 0.603 ± 0.036
1.627PheTyr: 1.627 ± 0.05
0.0PheXaa: 0.0 ± 0.0
Gly
4.791GlyAla: 4.791 ± 0.129
0.392GlyCys: 0.392 ± 0.031
3.286GlyAsp: 3.286 ± 0.083
3.699GlyGlu: 3.699 ± 0.088
2.915GlyPhe: 2.915 ± 0.072
4.11GlyGly: 4.11 ± 0.101
1.692GlyHis: 1.692 ± 0.056
4.911GlyIle: 4.911 ± 0.109
3.914GlyLys: 3.914 ± 0.098
6.57GlyLeu: 6.57 ± 0.11
1.792GlyMet: 1.792 ± 0.06
2.561GlyAsn: 2.561 ± 0.076
1.517GlyPro: 1.517 ± 0.057
3.221GlyGln: 3.221 ± 0.095
2.989GlyArg: 2.989 ± 0.08
3.6GlySer: 3.6 ± 0.08
3.977GlyThr: 3.977 ± 0.087
4.778GlyVal: 4.778 ± 0.089
0.755GlyTrp: 0.755 ± 0.043
2.662GlyTyr: 2.662 ± 0.071
0.0GlyXaa: 0.0 ± 0.0
His
1.669HisAla: 1.669 ± 0.061
0.156HisCys: 0.156 ± 0.017
1.501HisAsp: 1.501 ± 0.055
1.522HisGlu: 1.522 ± 0.05
1.26HisPhe: 1.26 ± 0.055
1.627HisGly: 1.627 ± 0.053
0.881HisHis: 0.881 ± 0.042
1.244HisIle: 1.244 ± 0.048
0.788HisLys: 0.788 ± 0.041
2.512HisLeu: 2.512 ± 0.073
0.401HisMet: 0.401 ± 0.024
0.877HisAsn: 0.877 ± 0.043
1.069HisPro: 1.069 ± 0.051
1.756HisGln: 1.756 ± 0.061
1.119HisArg: 1.119 ± 0.05
1.14HisSer: 1.14 ± 0.049
1.035HisThr: 1.035 ± 0.044
1.522HisVal: 1.522 ± 0.051
0.31HisTrp: 0.31 ± 0.026
1.067HisTyr: 1.067 ± 0.051
0.0HisXaa: 0.0 ± 0.0
Ile
6.325IleAla: 6.325 ± 0.133
0.432IleCys: 0.432 ± 0.03
4.612IleAsp: 4.612 ± 0.101
3.952IleGlu: 3.952 ± 0.083
2.978IlePhe: 2.978 ± 0.079
5.101IleGly: 5.101 ± 0.113
1.368IleHis: 1.368 ± 0.048
4.671IleIle: 4.671 ± 0.124
3.948IleLys: 3.948 ± 0.104
6.416IleLeu: 6.416 ± 0.13
1.595IleMet: 1.595 ± 0.06
3.357IleAsn: 3.357 ± 0.087
2.774IlePro: 2.774 ± 0.076
2.873IleGln: 2.873 ± 0.078
2.818IleArg: 2.818 ± 0.072
4.481IleSer: 4.481 ± 0.106
4.015IleThr: 4.015 ± 0.091
5.09IleVal: 5.09 ± 0.116
0.637IleTrp: 0.637 ± 0.041
2.261IleTyr: 2.261 ± 0.069
0.0IleXaa: 0.0 ± 0.0
Lys
4.331LysAla: 4.331 ± 0.11
0.135LysCys: 0.135 ± 0.018
2.751LysAsp: 2.751 ± 0.082
2.869LysGlu: 2.869 ± 0.09
1.99LysPhe: 1.99 ± 0.077
2.898LysGly: 2.898 ± 0.081
1.119LysHis: 1.119 ± 0.05
4.291LysIle: 4.291 ± 0.094
4.055LysLys: 4.055 ± 0.116
5.248LysLeu: 5.248 ± 0.101
1.897LysMet: 1.897 ± 0.059
2.946LysAsn: 2.946 ± 0.081
1.954LysPro: 1.954 ± 0.067
3.267LysGln: 3.267 ± 0.079
3.004LysArg: 3.004 ± 0.083
3.351LysSer: 3.351 ± 0.084
3.842LysThr: 3.842 ± 0.091
3.832LysVal: 3.832 ± 0.088
0.518LysTrp: 0.518 ± 0.032
1.981LysTyr: 1.981 ± 0.071
0.0LysXaa: 0.0 ± 0.0
Leu
9.276LeuAla: 9.276 ± 0.152
0.428LeuCys: 0.428 ± 0.03
5.632LeuAsp: 5.632 ± 0.108
4.677LeuGlu: 4.677 ± 0.087
4.236LeuPhe: 4.236 ± 0.124
6.272LeuGly: 6.272 ± 0.124
1.874LeuHis: 1.874 ± 0.069
7.611LeuIle: 7.611 ± 0.138
5.84LeuLys: 5.84 ± 0.113
10.218LeuLeu: 10.218 ± 0.198
2.574LeuMet: 2.574 ± 0.073
5.037LeuAsn: 5.037 ± 0.111
4.605LeuPro: 4.605 ± 0.097
4.628LeuGln: 4.628 ± 0.109
4.15LeuArg: 4.15 ± 0.087
6.89LeuSer: 6.89 ± 0.126
7.482LeuThr: 7.482 ± 0.126
7.094LeuVal: 7.094 ± 0.118
0.826LeuTrp: 0.826 ± 0.041
2.757LeuTyr: 2.757 ± 0.073
0.0LeuXaa: 0.0 ± 0.0
Met
2.413MetAla: 2.413 ± 0.074
0.069MetCys: 0.069 ± 0.011
1.26MetAsp: 1.26 ± 0.055
0.938MetGlu: 0.938 ± 0.043
0.9MetPhe: 0.9 ± 0.041
1.587MetGly: 1.587 ± 0.059
0.495MetHis: 0.495 ± 0.028
2.158MetIle: 2.158 ± 0.072
1.642MetLys: 1.642 ± 0.051
2.335MetLeu: 2.335 ± 0.073
0.864MetMet: 0.864 ± 0.044
1.254MetAsn: 1.254 ± 0.054
1.026MetPro: 1.026 ± 0.046
1.062MetGln: 1.062 ± 0.048
1.048MetArg: 1.048 ± 0.04
1.705MetSer: 1.705 ± 0.054
2.055MetThr: 2.055 ± 0.059
1.6MetVal: 1.6 ± 0.049
0.2MetTrp: 0.2 ± 0.019
0.552MetTyr: 0.552 ± 0.031
0.0MetXaa: 0.0 ± 0.0
Asn
3.288AsnAla: 3.288 ± 0.083
0.234AsnCys: 0.234 ± 0.025
2.632AsnAsp: 2.632 ± 0.072
2.601AsnGlu: 2.601 ± 0.071
2.089AsnPhe: 2.089 ± 0.059
3.25AsnGly: 3.25 ± 0.087
1.275AsnHis: 1.275 ± 0.048
2.63AsnIle: 2.63 ± 0.073
2.085AsnLys: 2.085 ± 0.067
4.576AsnLeu: 4.576 ± 0.1
1.094AsnMet: 1.094 ± 0.053
1.91AsnAsn: 1.91 ± 0.065
1.924AsnPro: 1.924 ± 0.054
2.847AsnGln: 2.847 ± 0.085
2.116AsnArg: 2.116 ± 0.07
2.453AsnSer: 2.453 ± 0.07
2.061AsnThr: 2.061 ± 0.052
2.803AsnVal: 2.803 ± 0.084
0.613AsnTrp: 0.613 ± 0.031
1.861AsnTyr: 1.861 ± 0.063
0.0AsnXaa: 0.0 ± 0.0
Pro
2.982ProAla: 2.982 ± 0.08
0.105ProCys: 0.105 ± 0.013
2.51ProAsp: 2.51 ± 0.067
3.259ProGlu: 3.259 ± 0.078
1.621ProPhe: 1.621 ± 0.049
2.196ProGly: 2.196 ± 0.064
0.698ProHis: 0.698 ± 0.04
2.706ProIle: 2.706 ± 0.076
2.232ProLys: 2.232 ± 0.065
3.482ProLeu: 3.482 ± 0.091
0.847ProMet: 0.847 ± 0.042
1.728ProAsn: 1.728 ± 0.062
0.721ProPro: 0.721 ± 0.04
1.87ProGln: 1.87 ± 0.06
1.359ProArg: 1.359 ± 0.049
2.038ProSer: 2.038 ± 0.056
2.546ProThr: 2.546 ± 0.064
2.841ProVal: 2.841 ± 0.071
0.388ProTrp: 0.388 ± 0.028
1.265ProTyr: 1.265 ± 0.051
0.0ProXaa: 0.0 ± 0.0
Gln
4.464GlnAla: 4.464 ± 0.121
0.15GlnCys: 0.15 ± 0.016
2.487GlnAsp: 2.487 ± 0.072
2.651GlnGlu: 2.651 ± 0.086
1.948GlnPhe: 1.948 ± 0.066
2.479GlnGly: 2.479 ± 0.072
1.36GlnHis: 1.36 ± 0.055
3.642GlnIle: 3.642 ± 0.085
3.168GlnLys: 3.168 ± 0.082
6.787GlnLeu: 6.787 ± 0.149
1.324GlnMet: 1.324 ± 0.052
2.171GlnAsn: 2.171 ± 0.07
2.082GlnPro: 2.082 ± 0.071
3.646GlnGln: 3.646 ± 0.131
2.797GlnArg: 2.797 ± 0.076
2.841GlnSer: 2.841 ± 0.075
3.393GlnThr: 3.393 ± 0.086
3.783GlnVal: 3.783 ± 0.091
0.514GlnTrp: 0.514 ± 0.032
1.661GlnTyr: 1.661 ± 0.061
0.0GlnXaa: 0.0 ± 0.0
Arg
3.176ArgAla: 3.176 ± 0.078
0.167ArgCys: 0.167 ± 0.021
2.384ArgAsp: 2.384 ± 0.07
2.869ArgGlu: 2.869 ± 0.076
2.04ArgPhe: 2.04 ± 0.052
2.536ArgGly: 2.536 ± 0.069
1.334ArgHis: 1.334 ± 0.061
2.858ArgIle: 2.858 ± 0.077
2.293ArgLys: 2.293 ± 0.078
4.875ArgLeu: 4.875 ± 0.108
1.106ArgMet: 1.106 ± 0.042
1.692ArgAsn: 1.692 ± 0.066
1.775ArgPro: 1.775 ± 0.06
3.105ArgGln: 3.105 ± 0.089
2.662ArgArg: 2.662 ± 0.078
2.245ArgSer: 2.245 ± 0.071
2.31ArgThr: 2.31 ± 0.073
3.102ArgVal: 3.102 ± 0.079
0.386ArgTrp: 0.386 ± 0.028
1.825ArgTyr: 1.825 ± 0.064
0.0ArgXaa: 0.0 ± 0.0
Ser
4.629SerAla: 4.629 ± 0.111
0.251SerCys: 0.251 ± 0.024
3.478SerAsp: 3.478 ± 0.078
3.611SerGlu: 3.611 ± 0.086
2.772SerPhe: 2.772 ± 0.081
4.367SerGly: 4.367 ± 0.091
1.366SerHis: 1.366 ± 0.051
3.728SerIle: 3.728 ± 0.1
3.014SerLys: 3.014 ± 0.084
5.921SerLeu: 5.921 ± 0.126
1.376SerMet: 1.376 ± 0.044
2.304SerAsn: 2.304 ± 0.064
1.992SerPro: 1.992 ± 0.056
3.001SerGln: 3.001 ± 0.073
2.517SerArg: 2.517 ± 0.071
3.905SerSer: 3.905 ± 0.178
3.522SerThr: 3.522 ± 0.1
3.925SerVal: 3.925 ± 0.098
0.702SerTrp: 0.702 ± 0.037
2.143SerTyr: 2.143 ± 0.072
0.0SerXaa: 0.0 ± 0.0
Thr
5.139ThrAla: 5.139 ± 0.105
0.28ThrCys: 0.28 ± 0.026
3.705ThrAsp: 3.705 ± 0.086
3.334ThrGlu: 3.334 ± 0.076
2.74ThrPhe: 2.74 ± 0.073
4.222ThrGly: 4.222 ± 0.089
1.399ThrHis: 1.399 ± 0.053
4.761ThrIle: 4.761 ± 0.105
3.52ThrLys: 3.52 ± 0.096
5.927ThrLeu: 5.927 ± 0.116
1.345ThrMet: 1.345 ± 0.054
2.858ThrAsn: 2.858 ± 0.076
2.755ThrPro: 2.755 ± 0.081
2.649ThrGln: 2.649 ± 0.079
2.268ThrArg: 2.268 ± 0.065
3.655ThrSer: 3.655 ± 0.094
4.614ThrThr: 4.614 ± 0.137
4.397ThrVal: 4.397 ± 0.101
0.637ThrTrp: 0.637 ± 0.037
2.085ThrTyr: 2.085 ± 0.07
0.0ThrXaa: 0.0 ± 0.0
Val
6.19ValAla: 6.19 ± 0.113
0.379ValCys: 0.379 ± 0.025
4.344ValAsp: 4.344 ± 0.094
3.6ValGlu: 3.6 ± 0.089
2.784ValPhe: 2.784 ± 0.085
4.883ValGly: 4.883 ± 0.09
1.303ValHis: 1.303 ± 0.052
5.341ValIle: 5.341 ± 0.117
4.12ValLys: 4.12 ± 0.096
6.812ValLeu: 6.812 ± 0.125
1.754ValMet: 1.754 ± 0.065
3.027ValAsn: 3.027 ± 0.076
2.652ValPro: 2.652 ± 0.073
2.748ValGln: 2.748 ± 0.067
2.736ValArg: 2.736 ± 0.079
4.342ValSer: 4.342 ± 0.087
4.675ValThr: 4.675 ± 0.108
5.324ValVal: 5.324 ± 0.1
0.632ValTrp: 0.632 ± 0.032
2.211ValTyr: 2.211 ± 0.057
0.0ValXaa: 0.0 ± 0.0
Trp
0.596TrpAla: 0.596 ± 0.038
0.061TrpCys: 0.061 ± 0.01
0.466TrpAsp: 0.466 ± 0.031
0.453TrpGlu: 0.453 ± 0.031
0.472TrpPhe: 0.472 ± 0.033
0.618TrpGly: 0.618 ± 0.036
0.36TrpHis: 0.36 ± 0.026
0.603TrpIle: 0.603 ± 0.035
0.348TrpLys: 0.348 ± 0.026
1.499TrpLeu: 1.499 ± 0.052
0.242TrpMet: 0.242 ± 0.02
0.432TrpAsn: 0.432 ± 0.027
0.398TrpPro: 0.398 ± 0.026
0.929TrpGln: 0.929 ± 0.05
0.603TrpArg: 0.603 ± 0.037
0.615TrpSer: 0.615 ± 0.035
0.529TrpThr: 0.529 ± 0.029
0.599TrpVal: 0.599 ± 0.037
0.186TrpTrp: 0.186 ± 0.023
0.352TrpTyr: 0.352 ± 0.025
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.403TyrAla: 2.403 ± 0.069
0.206TyrCys: 0.206 ± 0.02
2.125TyrAsp: 2.125 ± 0.072
1.699TyrGlu: 1.699 ± 0.066
1.92TyrPhe: 1.92 ± 0.059
2.373TyrGly: 2.373 ± 0.074
1.048TyrHis: 1.048 ± 0.048
1.781TyrIle: 1.781 ± 0.053
1.125TyrLys: 1.125 ± 0.047
4.281TyrLeu: 4.281 ± 0.103
0.586TyrMet: 0.586 ± 0.032
1.416TyrAsn: 1.416 ± 0.054
1.444TyrPro: 1.444 ± 0.048
2.512TyrGln: 2.512 ± 0.069
1.987TyrArg: 1.987 ± 0.062
1.863TyrSer: 1.863 ± 0.053
1.794TyrThr: 1.794 ± 0.069
2.133TyrVal: 2.133 ± 0.064
0.426TyrTrp: 0.426 ± 0.031
1.385TyrTyr: 1.385 ± 0.055
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1720 proteins (525547 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski