Amino acid dipepetide frequency for Ruminococcaceae bacterium D16

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.65AlaAla: 10.65 ± 0.158
1.491AlaCys: 1.491 ± 0.045
4.922AlaAsp: 4.922 ± 0.071
5.316AlaGlu: 5.316 ± 0.097
3.292AlaPhe: 3.292 ± 0.063
7.302AlaGly: 7.302 ± 0.104
1.433AlaHis: 1.433 ± 0.046
5.224AlaIle: 5.224 ± 0.08
4.521AlaLys: 4.521 ± 0.08
10.264AlaLeu: 10.264 ± 0.123
2.802AlaMet: 2.802 ± 0.049
2.431AlaAsn: 2.431 ± 0.061
3.362AlaPro: 3.362 ± 0.063
4.478AlaGln: 4.478 ± 0.086
4.287AlaArg: 4.287 ± 0.072
4.187AlaSer: 4.187 ± 0.074
3.712AlaThr: 3.712 ± 0.069
7.481AlaVal: 7.481 ± 0.109
0.945AlaTrp: 0.945 ± 0.036
2.647AlaTyr: 2.647 ± 0.051
0.0AlaXaa: 0.0 ± 0.0
Cys
1.488CysAla: 1.488 ± 0.04
0.327CysCys: 0.327 ± 0.022
0.836CysAsp: 0.836 ± 0.03
0.792CysGlu: 0.792 ± 0.033
0.594CysPhe: 0.594 ± 0.025
1.651CysGly: 1.651 ± 0.04
0.321CysHis: 0.321 ± 0.021
0.847CysIle: 0.847 ± 0.033
0.69CysLys: 0.69 ± 0.032
1.426CysLeu: 1.426 ± 0.045
0.386CysMet: 0.386 ± 0.019
0.481CysAsn: 0.481 ± 0.025
0.853CysPro: 0.853 ± 0.034
0.604CysGln: 0.604 ± 0.026
0.993CysArg: 0.993 ± 0.032
0.856CysSer: 0.856 ± 0.029
0.861CysThr: 0.861 ± 0.033
1.095CysVal: 1.095 ± 0.036
0.192CysTrp: 0.192 ± 0.014
0.53CysTyr: 0.53 ± 0.023
0.0CysXaa: 0.0 ± 0.0
Asp
4.696AspAla: 4.696 ± 0.077
0.863AspCys: 0.863 ± 0.032
2.734AspAsp: 2.734 ± 0.072
4.025AspGlu: 4.025 ± 0.086
2.433AspPhe: 2.433 ± 0.049
4.723AspGly: 4.723 ± 0.09
1.058AspHis: 1.058 ± 0.032
3.281AspIle: 3.281 ± 0.064
2.672AspLys: 2.672 ± 0.063
5.433AspLeu: 5.433 ± 0.086
1.548AspMet: 1.548 ± 0.039
1.758AspAsn: 1.758 ± 0.049
2.575AspPro: 2.575 ± 0.06
2.145AspGln: 2.145 ± 0.053
2.994AspArg: 2.994 ± 0.058
2.801AspSer: 2.801 ± 0.061
3.124AspThr: 3.124 ± 0.065
3.672AspVal: 3.672 ± 0.066
0.679AspTrp: 0.679 ± 0.032
2.372AspTyr: 2.372 ± 0.051
0.0AspXaa: 0.0 ± 0.0
Glu
5.709GluAla: 5.709 ± 0.092
0.72GluCys: 0.72 ± 0.028
3.91GluAsp: 3.91 ± 0.073
5.938GluGlu: 5.938 ± 0.112
2.102GluPhe: 2.102 ± 0.049
4.639GluGly: 4.639 ± 0.078
1.452GluHis: 1.452 ± 0.036
3.872GluIle: 3.872 ± 0.07
4.491GluLys: 4.491 ± 0.077
7.21GluLeu: 7.21 ± 0.101
2.01GluMet: 2.01 ± 0.046
2.824GluAsn: 2.824 ± 0.055
2.215GluPro: 2.215 ± 0.052
3.48GluGln: 3.48 ± 0.08
4.021GluArg: 4.021 ± 0.082
2.894GluSer: 2.894 ± 0.052
3.445GluThr: 3.445 ± 0.066
4.14GluVal: 4.14 ± 0.072
0.622GluTrp: 0.622 ± 0.028
2.379GluTyr: 2.379 ± 0.048
0.0GluXaa: 0.0 ± 0.0
Phe
3.221PheAla: 3.221 ± 0.071
0.739PheCys: 0.739 ± 0.028
2.319PheAsp: 2.319 ± 0.053
2.07PheGlu: 2.07 ± 0.048
1.711PhePhe: 1.711 ± 0.05
3.034PheGly: 3.034 ± 0.061
0.856PheHis: 0.856 ± 0.031
1.924PheIle: 1.924 ± 0.053
1.266PheLys: 1.266 ± 0.04
3.985PheLeu: 3.985 ± 0.075
0.847PheMet: 0.847 ± 0.032
1.186PheAsn: 1.186 ± 0.036
1.552PhePro: 1.552 ± 0.041
1.567PheGln: 1.567 ± 0.04
1.857PheArg: 1.857 ± 0.048
2.734PheSer: 2.734 ± 0.06
2.339PheThr: 2.339 ± 0.054
2.596PheVal: 2.596 ± 0.052
0.488PheTrp: 0.488 ± 0.023
1.46PheTyr: 1.46 ± 0.043
0.0PheXaa: 0.0 ± 0.0
Gly
6.147GlyAla: 6.147 ± 0.096
1.38GlyCys: 1.38 ± 0.039
3.882GlyAsp: 3.882 ± 0.073
5.209GlyGlu: 5.209 ± 0.079
2.91GlyPhe: 2.91 ± 0.058
6.161GlyGly: 6.161 ± 0.093
1.367GlyHis: 1.367 ± 0.043
4.751GlyIle: 4.751 ± 0.088
4.649GlyLys: 4.649 ± 0.08
7.339GlyLeu: 7.339 ± 0.1
2.565GlyMet: 2.565 ± 0.058
2.63GlyAsn: 2.63 ± 0.061
2.03GlyPro: 2.03 ± 0.062
2.941GlyGln: 2.941 ± 0.061
3.994GlyArg: 3.994 ± 0.067
4.56GlySer: 4.56 ± 0.091
4.487GlyThr: 4.487 ± 0.075
6.187GlyVal: 6.187 ± 0.083
0.989GlyTrp: 0.989 ± 0.034
3.033GlyTyr: 3.033 ± 0.057
0.0GlyXaa: 0.0 ± 0.0
His
1.313HisAla: 1.313 ± 0.038
0.371HisCys: 0.371 ± 0.019
0.917HisAsp: 0.917 ± 0.028
0.891HisGlu: 0.891 ± 0.032
0.808HisPhe: 0.808 ± 0.031
1.436HisGly: 1.436 ± 0.042
0.452HisHis: 0.452 ± 0.027
1.285HisIle: 1.285 ± 0.043
0.768HisLys: 0.768 ± 0.032
1.689HisLeu: 1.689 ± 0.046
0.507HisMet: 0.507 ± 0.022
0.609HisAsn: 0.609 ± 0.026
1.164HisPro: 1.164 ± 0.032
0.732HisGln: 0.732 ± 0.028
1.008HisArg: 1.008 ± 0.032
0.993HisSer: 0.993 ± 0.03
1.134HisThr: 1.134 ± 0.039
1.108HisVal: 1.108 ± 0.033
0.249HisTrp: 0.249 ± 0.017
0.72HisTyr: 0.72 ± 0.028
0.0HisXaa: 0.0 ± 0.0
Ile
5.091IleAla: 5.091 ± 0.078
1.052IleCys: 1.052 ± 0.032
3.101IleAsp: 3.101 ± 0.06
2.962IleGlu: 2.962 ± 0.067
2.202IlePhe: 2.202 ± 0.05
4.013IleGly: 4.013 ± 0.07
1.167IleHis: 1.167 ± 0.032
3.121IleIle: 3.121 ± 0.075
2.29IleLys: 2.29 ± 0.053
6.02IleLeu: 6.02 ± 0.091
1.305IleMet: 1.305 ± 0.043
1.958IleAsn: 1.958 ± 0.053
3.022IlePro: 3.022 ± 0.055
2.344IleGln: 2.344 ± 0.055
3.191IleArg: 3.191 ± 0.07
3.659IleSer: 3.659 ± 0.07
3.518IleThr: 3.518 ± 0.069
3.908IleVal: 3.908 ± 0.072
0.604IleTrp: 0.604 ± 0.025
1.915IleTyr: 1.915 ± 0.048
0.0IleXaa: 0.0 ± 0.0
Lys
4.671LysAla: 4.671 ± 0.082
0.533LysCys: 0.533 ± 0.025
2.781LysAsp: 2.781 ± 0.055
4.132LysGlu: 4.132 ± 0.083
1.317LysPhe: 1.317 ± 0.036
3.474LysGly: 3.474 ± 0.07
0.847LysHis: 0.847 ± 0.031
2.75LysIle: 2.75 ± 0.056
3.629LysLys: 3.629 ± 0.085
4.724LysLeu: 4.724 ± 0.068
1.417LysMet: 1.417 ± 0.04
1.947LysAsn: 1.947 ± 0.05
1.951LysPro: 1.951 ± 0.047
1.951LysGln: 1.951 ± 0.051
2.681LysArg: 2.681 ± 0.059
2.487LysSer: 2.487 ± 0.051
2.956LysThr: 2.956 ± 0.065
3.502LysVal: 3.502 ± 0.07
0.509LysTrp: 0.509 ± 0.022
1.733LysTyr: 1.733 ± 0.054
0.0LysXaa: 0.0 ± 0.0
Leu
9.595LeuAla: 9.595 ± 0.106
1.917LeuCys: 1.917 ± 0.045
5.867LeuAsp: 5.867 ± 0.083
7.353LeuGlu: 7.353 ± 0.102
3.989LeuPhe: 3.989 ± 0.084
7.341LeuGly: 7.341 ± 0.116
1.66LeuHis: 1.66 ± 0.042
5.068LeuIle: 5.068 ± 0.086
4.643LeuLys: 4.643 ± 0.062
10.582LeuLeu: 10.582 ± 0.151
2.792LeuMet: 2.792 ± 0.055
3.376LeuAsn: 3.376 ± 0.068
4.738LeuPro: 4.738 ± 0.081
3.035LeuGln: 3.035 ± 0.058
5.354LeuArg: 5.354 ± 0.085
7.165LeuSer: 7.165 ± 0.1
6.359LeuThr: 6.359 ± 0.09
6.905LeuVal: 6.905 ± 0.1
1.085LeuTrp: 1.085 ± 0.034
3.138LeuTyr: 3.138 ± 0.054
0.0LeuXaa: 0.0 ± 0.0
Met
2.842MetAla: 2.842 ± 0.061
0.316MetCys: 0.316 ± 0.019
1.91MetAsp: 1.91 ± 0.046
2.338MetGlu: 2.338 ± 0.052
0.848MetPhe: 0.848 ± 0.033
2.225MetGly: 2.225 ± 0.049
0.357MetHis: 0.357 ± 0.021
1.488MetIle: 1.488 ± 0.044
1.782MetLys: 1.782 ± 0.046
2.764MetLeu: 2.764 ± 0.056
0.788MetMet: 0.788 ± 0.031
1.18MetAsn: 1.18 ± 0.037
1.119MetPro: 1.119 ± 0.037
0.857MetGln: 0.857 ± 0.027
1.29MetArg: 1.29 ± 0.037
1.594MetSer: 1.594 ± 0.036
1.729MetThr: 1.729 ± 0.041
2.016MetVal: 2.016 ± 0.043
0.23MetTrp: 0.23 ± 0.016
0.601MetTyr: 0.601 ± 0.024
0.0MetXaa: 0.0 ± 0.0
Asn
2.895AsnAla: 2.895 ± 0.064
0.504AsnCys: 0.504 ± 0.024
1.623AsnAsp: 1.623 ± 0.048
1.833AsnGlu: 1.833 ± 0.05
1.269AsnPhe: 1.269 ± 0.039
2.954AsnGly: 2.954 ± 0.066
0.629AsnHis: 0.629 ± 0.028
2.166AsnIle: 2.166 ± 0.059
1.452AsnLys: 1.452 ± 0.043
3.421AsnLeu: 3.421 ± 0.062
0.934AsnMet: 0.934 ± 0.032
1.167AsnAsn: 1.167 ± 0.042
1.923AsnPro: 1.923 ± 0.047
1.315AsnGln: 1.315 ± 0.035
1.943AsnArg: 1.943 ± 0.046
1.764AsnSer: 1.764 ± 0.047
2.017AsnThr: 2.017 ± 0.057
2.261AsnVal: 2.261 ± 0.057
0.385AsnTrp: 0.385 ± 0.021
1.404AsnTyr: 1.404 ± 0.046
0.0AsnXaa: 0.0 ± 0.0
Pro
3.651ProAla: 3.651 ± 0.072
0.547ProCys: 0.547 ± 0.025
2.776ProAsp: 2.776 ± 0.055
3.852ProGlu: 3.852 ± 0.074
1.689ProPhe: 1.689 ± 0.042
3.14ProGly: 3.14 ± 0.066
0.731ProHis: 0.731 ± 0.027
2.235ProIle: 2.235 ± 0.051
2.051ProLys: 2.051 ± 0.053
3.674ProLeu: 3.674 ± 0.071
1.199ProMet: 1.199 ± 0.034
1.353ProAsn: 1.353 ± 0.043
1.407ProPro: 1.407 ± 0.048
2.0ProGln: 2.0 ± 0.051
1.751ProArg: 1.751 ± 0.048
2.053ProSer: 2.053 ± 0.056
2.113ProThr: 2.113 ± 0.049
3.403ProVal: 3.403 ± 0.067
0.505ProTrp: 0.505 ± 0.027
1.455ProTyr: 1.455 ± 0.044
0.0ProXaa: 0.0 ± 0.0
Gln
4.226GlnAla: 4.226 ± 0.085
0.44GlnCys: 0.44 ± 0.022
1.97GlnAsp: 1.97 ± 0.045
3.319GlnGlu: 3.319 ± 0.066
1.252GlnPhe: 1.252 ± 0.038
2.956GlnGly: 2.956 ± 0.059
0.624GlnHis: 0.624 ± 0.026
2.219GlnIle: 2.219 ± 0.05
2.219GlnLys: 2.219 ± 0.059
4.052GlnLeu: 4.052 ± 0.066
1.286GlnMet: 1.286 ± 0.039
1.512GlnAsn: 1.512 ± 0.047
1.543GlnPro: 1.543 ± 0.039
1.843GlnGln: 1.843 ± 0.053
2.353GlnArg: 2.353 ± 0.055
2.093GlnSer: 2.093 ± 0.048
2.126GlnThr: 2.126 ± 0.048
3.081GlnVal: 3.081 ± 0.063
0.491GlnTrp: 0.491 ± 0.025
1.386GlnTyr: 1.386 ± 0.039
0.0GlnXaa: 0.0 ± 0.0
Arg
4.186ArgAla: 4.186 ± 0.071
0.847ArgCys: 0.847 ± 0.03
2.823ArgAsp: 2.823 ± 0.064
4.392ArgGlu: 4.392 ± 0.081
2.085ArgPhe: 2.085 ± 0.052
3.305ArgGly: 3.305 ± 0.065
0.985ArgHis: 0.985 ± 0.032
2.843ArgIle: 2.843 ± 0.058
2.937ArgLys: 2.937 ± 0.06
5.413ArgLeu: 5.413 ± 0.095
1.679ArgMet: 1.679 ± 0.043
1.712ArgAsn: 1.712 ± 0.045
2.183ArgPro: 2.183 ± 0.045
2.556ArgGln: 2.556 ± 0.051
3.798ArgArg: 3.798 ± 0.094
2.849ArgSer: 2.849 ± 0.055
2.827ArgThr: 2.827 ± 0.059
3.275ArgVal: 3.275 ± 0.07
0.692ArgTrp: 0.692 ± 0.029
1.995ArgTyr: 1.995 ± 0.051
0.0ArgXaa: 0.0 ± 0.0
Ser
4.967SerAla: 4.967 ± 0.085
0.848SerCys: 0.848 ± 0.031
3.048SerAsp: 3.048 ± 0.068
2.947SerGlu: 2.947 ± 0.061
2.365SerPhe: 2.365 ± 0.049
5.2SerGly: 5.2 ± 0.099
1.105SerHis: 1.105 ± 0.032
3.302SerIle: 3.302 ± 0.068
2.397SerLys: 2.397 ± 0.062
5.338SerLeu: 5.338 ± 0.086
1.617SerMet: 1.617 ± 0.041
1.819SerAsn: 1.819 ± 0.052
2.288SerPro: 2.288 ± 0.051
2.513SerGln: 2.513 ± 0.055
3.091SerArg: 3.091 ± 0.058
3.419SerSer: 3.419 ± 0.107
2.949SerThr: 2.949 ± 0.057
4.082SerVal: 4.082 ± 0.077
0.662SerTrp: 0.662 ± 0.027
2.053SerTyr: 2.053 ± 0.058
0.0SerXaa: 0.0 ± 0.0
Thr
5.582ThrAla: 5.582 ± 0.095
0.742ThrCys: 0.742 ± 0.03
3.013ThrAsp: 3.013 ± 0.064
2.967ThrGlu: 2.967 ± 0.058
2.11ThrPhe: 2.11 ± 0.049
4.949ThrGly: 4.949 ± 0.083
0.99ThrHis: 0.99 ± 0.032
3.402ThrIle: 3.402 ± 0.066
2.244ThrLys: 2.244 ± 0.057
5.916ThrLeu: 5.916 ± 0.09
1.437ThrMet: 1.437 ± 0.043
1.665ThrAsn: 1.665 ± 0.048
2.875ThrPro: 2.875 ± 0.057
2.228ThrGln: 2.228 ± 0.048
2.522ThrArg: 2.522 ± 0.052
2.766ThrSer: 2.766 ± 0.069
2.978ThrThr: 2.978 ± 0.071
4.983ThrVal: 4.983 ± 0.083
0.614ThrTrp: 0.614 ± 0.025
1.844ThrTyr: 1.844 ± 0.047
0.0ThrXaa: 0.0 ± 0.0
Val
6.073ValAla: 6.073 ± 0.092
1.392ValCys: 1.392 ± 0.043
4.168ValAsp: 4.168 ± 0.074
4.736ValGlu: 4.736 ± 0.081
2.822ValPhe: 2.822 ± 0.061
5.107ValGly: 5.107 ± 0.077
1.199ValHis: 1.199 ± 0.035
4.18ValIle: 4.18 ± 0.074
3.41ValLys: 3.41 ± 0.07
7.839ValLeu: 7.839 ± 0.096
2.047ValMet: 2.047 ± 0.048
2.41ValAsn: 2.41 ± 0.054
3.049ValPro: 3.049 ± 0.057
2.281ValGln: 2.281 ± 0.046
3.613ValArg: 3.613 ± 0.082
4.553ValSer: 4.553 ± 0.077
4.442ValThr: 4.442 ± 0.078
5.431ValVal: 5.431 ± 0.092
0.833ValTrp: 0.833 ± 0.031
2.443ValTyr: 2.443 ± 0.054
0.0ValXaa: 0.0 ± 0.0
Trp
0.925TrpAla: 0.925 ± 0.034
0.191TrpCys: 0.191 ± 0.017
0.688TrpAsp: 0.688 ± 0.028
0.799TrpGlu: 0.799 ± 0.034
0.467TrpPhe: 0.467 ± 0.022
0.846TrpGly: 0.846 ± 0.034
0.178TrpHis: 0.178 ± 0.014
0.529TrpIle: 0.529 ± 0.024
0.581TrpLys: 0.581 ± 0.026
1.273TrpLeu: 1.273 ± 0.046
0.37TrpMet: 0.37 ± 0.021
0.516TrpAsn: 0.516 ± 0.025
0.332TrpPro: 0.332 ± 0.018
0.557TrpGln: 0.557 ± 0.023
0.611TrpArg: 0.611 ± 0.026
0.654TrpSer: 0.654 ± 0.027
0.523TrpThr: 0.523 ± 0.023
0.695TrpVal: 0.695 ± 0.028
0.148TrpTrp: 0.148 ± 0.014
0.416TrpTyr: 0.416 ± 0.026
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.82TyrAla: 2.82 ± 0.058
0.604TyrCys: 0.604 ± 0.025
2.297TyrAsp: 2.297 ± 0.058
2.206TyrGlu: 2.206 ± 0.052
1.486TyrPhe: 1.486 ± 0.042
2.754TyrGly: 2.754 ± 0.052
0.741TyrHis: 0.741 ± 0.031
1.957TyrIle: 1.957 ± 0.051
1.301TyrLys: 1.301 ± 0.041
3.601TyrLeu: 3.601 ± 0.065
0.737TyrMet: 0.737 ± 0.031
1.325TyrAsn: 1.325 ± 0.047
1.406TyrPro: 1.406 ± 0.041
1.563TyrGln: 1.563 ± 0.038
2.031TyrArg: 2.031 ± 0.049
1.929TyrSer: 1.929 ± 0.051
2.186TyrThr: 2.186 ± 0.052
2.206TyrVal: 2.206 ± 0.056
0.376TyrTrp: 0.376 ± 0.023
1.485TyrTyr: 1.485 ± 0.045
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2987 proteins (924795 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski