Amino acid dipepetide frequency for [Clostridium] fimetarium

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.632AlaAla: 4.632 ± 0.073
0.881AlaCys: 0.881 ± 0.029
3.547AlaAsp: 3.547 ± 0.059
3.599AlaGlu: 3.599 ± 0.055
2.865AlaPhe: 2.865 ± 0.051
4.349AlaGly: 4.349 ± 0.067
0.878AlaHis: 0.878 ± 0.024
6.111AlaIle: 6.111 ± 0.08
4.838AlaLys: 4.838 ± 0.069
5.568AlaLeu: 5.568 ± 0.082
1.925AlaMet: 1.925 ± 0.041
3.103AlaAsn: 3.103 ± 0.049
1.653AlaPro: 1.653 ± 0.044
1.956AlaGln: 1.956 ± 0.037
1.977AlaArg: 1.977 ± 0.047
3.946AlaSer: 3.946 ± 0.07
3.828AlaThr: 3.828 ± 0.102
4.625AlaVal: 4.625 ± 0.071
0.45AlaTrp: 0.45 ± 0.02
2.528AlaTyr: 2.528 ± 0.047
0.0AlaXaa: 0.0 ± 0.0
Cys
0.801CysAla: 0.801 ± 0.026
0.251CysCys: 0.251 ± 0.015
0.828CysAsp: 0.828 ± 0.027
0.878CysGlu: 0.878 ± 0.027
0.629CysPhe: 0.629 ± 0.022
1.234CysGly: 1.234 ± 0.031
0.246CysHis: 0.246 ± 0.014
1.315CysIle: 1.315 ± 0.034
0.965CysLys: 0.965 ± 0.03
1.034CysLeu: 1.034 ± 0.033
0.396CysMet: 0.396 ± 0.018
0.773CysAsn: 0.773 ± 0.026
0.496CysPro: 0.496 ± 0.031
0.317CysGln: 0.317 ± 0.016
0.451CysArg: 0.451 ± 0.021
0.931CysSer: 0.931 ± 0.03
0.718CysThr: 0.718 ± 0.026
0.915CysVal: 0.915 ± 0.029
0.085CysTrp: 0.085 ± 0.008
0.54CysTyr: 0.54 ± 0.019
0.0CysXaa: 0.0 ± 0.0
Asp
3.353AspAla: 3.353 ± 0.055
0.79AspCys: 0.79 ± 0.027
2.937AspAsp: 2.937 ± 0.059
4.249AspGlu: 4.249 ± 0.068
2.766AspPhe: 2.766 ± 0.042
3.765AspGly: 3.765 ± 0.073
0.658AspHis: 0.658 ± 0.026
5.686AspIle: 5.686 ± 0.07
4.58AspLys: 4.58 ± 0.065
4.46AspLeu: 4.46 ± 0.059
1.704AspMet: 1.704 ± 0.037
3.357AspAsn: 3.357 ± 0.063
1.268AspPro: 1.268 ± 0.03
1.237AspGln: 1.237 ± 0.033
1.762AspArg: 1.762 ± 0.044
3.639AspSer: 3.639 ± 0.07
3.014AspThr: 3.014 ± 0.054
3.615AspVal: 3.615 ± 0.054
0.493AspTrp: 0.493 ± 0.019
2.852AspTyr: 2.852 ± 0.055
0.0AspXaa: 0.0 ± 0.0
Glu
4.199GluAla: 4.199 ± 0.062
0.79GluCys: 0.79 ± 0.025
3.624GluAsp: 3.624 ± 0.059
5.158GluGlu: 5.158 ± 0.081
2.807GluPhe: 2.807 ± 0.05
3.385GluGly: 3.385 ± 0.06
1.105GluHis: 1.105 ± 0.027
6.289GluIle: 6.289 ± 0.093
6.124GluLys: 6.124 ± 0.077
6.348GluLeu: 6.348 ± 0.09
1.991GluMet: 1.991 ± 0.041
4.667GluAsn: 4.667 ± 0.063
1.468GluPro: 1.468 ± 0.048
2.309GluGln: 2.309 ± 0.045
2.374GluArg: 2.374 ± 0.046
3.651GluSer: 3.651 ± 0.053
3.279GluThr: 3.279 ± 0.053
4.159GluVal: 4.159 ± 0.069
0.573GluTrp: 0.573 ± 0.02
3.161GluTyr: 3.161 ± 0.054
0.0GluXaa: 0.0 ± 0.0
Phe
2.802PheAla: 2.802 ± 0.045
0.669PheCys: 0.669 ± 0.023
2.768PheAsp: 2.768 ± 0.048
3.0PheGlu: 3.0 ± 0.047
1.896PhePhe: 1.896 ± 0.047
3.044PheGly: 3.044 ± 0.055
0.679PheHis: 0.679 ± 0.022
3.955PheIle: 3.955 ± 0.064
2.891PheLys: 2.891 ± 0.051
3.773PheLeu: 3.773 ± 0.056
1.208PheMet: 1.208 ± 0.029
2.356PheAsn: 2.356 ± 0.044
1.28PhePro: 1.28 ± 0.036
1.199PheGln: 1.199 ± 0.031
1.309PheArg: 1.309 ± 0.032
3.196PheSer: 3.196 ± 0.052
2.478PheThr: 2.478 ± 0.04
3.088PheVal: 3.088 ± 0.058
0.396PheTrp: 0.396 ± 0.018
1.853PheTyr: 1.853 ± 0.041
0.0PheXaa: 0.0 ± 0.0
Gly
3.931GlyAla: 3.931 ± 0.072
1.095GlyCys: 1.095 ± 0.031
3.181GlyAsp: 3.181 ± 0.055
3.658GlyGlu: 3.658 ± 0.055
3.154GlyPhe: 3.154 ± 0.06
4.047GlyGly: 4.047 ± 0.078
1.004GlyHis: 1.004 ± 0.031
6.602GlyIle: 6.602 ± 0.072
5.142GlyLys: 5.142 ± 0.068
5.261GlyLeu: 5.261 ± 0.069
2.002GlyMet: 2.002 ± 0.04
3.514GlyAsn: 3.514 ± 0.07
1.114GlyPro: 1.114 ± 0.06
1.709GlyGln: 1.709 ± 0.04
2.115GlyArg: 2.115 ± 0.049
3.902GlySer: 3.902 ± 0.065
4.023GlyThr: 4.023 ± 0.081
4.389GlyVal: 4.389 ± 0.061
0.593GlyTrp: 0.593 ± 0.021
3.158GlyTyr: 3.158 ± 0.049
0.0GlyXaa: 0.0 ± 0.0
His
0.743HisAla: 0.743 ± 0.025
0.273HisCys: 0.273 ± 0.015
0.753HisAsp: 0.753 ± 0.027
0.91HisGlu: 0.91 ± 0.031
0.759HisPhe: 0.759 ± 0.021
0.988HisGly: 0.988 ± 0.027
0.3HisHis: 0.3 ± 0.018
1.478HisIle: 1.478 ± 0.038
1.002HisLys: 1.002 ± 0.027
1.234HisLeu: 1.234 ± 0.031
0.449HisMet: 0.449 ± 0.019
0.87HisAsn: 0.87 ± 0.027
0.601HisPro: 0.601 ± 0.023
0.429HisGln: 0.429 ± 0.02
0.523HisArg: 0.523 ± 0.022
0.977HisSer: 0.977 ± 0.032
0.779HisThr: 0.779 ± 0.024
0.803HisVal: 0.803 ± 0.028
0.133HisTrp: 0.133 ± 0.01
0.663HisTyr: 0.663 ± 0.025
0.0HisXaa: 0.0 ± 0.0
Ile
6.366IleAla: 6.366 ± 0.077
1.456IleCys: 1.456 ± 0.035
5.388IleAsp: 5.388 ± 0.071
6.253IleGlu: 6.253 ± 0.082
4.031IlePhe: 4.031 ± 0.07
5.926IleGly: 5.926 ± 0.095
1.372IleHis: 1.372 ± 0.035
8.772IleIle: 8.772 ± 0.114
6.721IleLys: 6.721 ± 0.09
8.288IleLeu: 8.288 ± 0.096
2.622IleMet: 2.622 ± 0.052
5.326IleAsn: 5.326 ± 0.075
3.171IlePro: 3.171 ± 0.052
2.551IleGln: 2.551 ± 0.048
3.022IleArg: 3.022 ± 0.051
6.959IleSer: 6.959 ± 0.074
5.418IleThr: 5.418 ± 0.077
6.173IleVal: 6.173 ± 0.07
0.697IleTrp: 0.697 ± 0.025
3.626IleTyr: 3.626 ± 0.063
0.0IleXaa: 0.0 ± 0.0
Lys
4.75LysAla: 4.75 ± 0.065
0.875LysCys: 0.875 ± 0.027
4.602LysAsp: 4.602 ± 0.07
6.429LysGlu: 6.429 ± 0.087
2.546LysPhe: 2.546 ± 0.045
4.083LysGly: 4.083 ± 0.054
1.083LysHis: 1.083 ± 0.028
6.783LysIle: 6.783 ± 0.082
6.844LysLys: 6.844 ± 0.09
6.531LysLeu: 6.531 ± 0.065
2.52LysMet: 2.52 ± 0.046
4.949LysAsn: 4.949 ± 0.068
1.853LysPro: 1.853 ± 0.036
2.443LysGln: 2.443 ± 0.044
2.803LysArg: 2.803 ± 0.052
4.671LysSer: 4.671 ± 0.066
4.255LysThr: 4.255 ± 0.059
4.997LysVal: 4.997 ± 0.059
0.636LysTrp: 0.636 ± 0.025
3.717LysTyr: 3.717 ± 0.064
0.0LysXaa: 0.0 ± 0.0
Leu
5.562LeuAla: 5.562 ± 0.067
1.276LeuCys: 1.276 ± 0.031
4.914LeuAsp: 4.914 ± 0.061
5.774LeuGlu: 5.774 ± 0.075
3.905LeuPhe: 3.905 ± 0.067
5.419LeuGly: 5.419 ± 0.075
1.26LeuHis: 1.26 ± 0.032
7.578LeuIle: 7.578 ± 0.094
6.654LeuLys: 6.654 ± 0.075
7.947LeuLeu: 7.947 ± 0.091
2.501LeuMet: 2.501 ± 0.041
4.904LeuAsn: 4.904 ± 0.071
2.728LeuPro: 2.728 ± 0.045
2.507LeuGln: 2.507 ± 0.044
2.8LeuArg: 2.8 ± 0.047
6.509LeuSer: 6.509 ± 0.082
4.666LeuThr: 4.666 ± 0.056
5.543LeuVal: 5.543 ± 0.069
0.618LeuTrp: 0.618 ± 0.02
3.304LeuTyr: 3.304 ± 0.052
0.0LeuXaa: 0.0 ± 0.0
Met
1.921MetAla: 1.921 ± 0.04
0.32MetCys: 0.32 ± 0.015
1.64MetAsp: 1.64 ± 0.036
1.979MetGlu: 1.979 ± 0.043
1.116MetPhe: 1.116 ± 0.027
1.83MetGly: 1.83 ± 0.043
0.43MetHis: 0.43 ± 0.019
2.592MetIle: 2.592 ± 0.048
2.727MetLys: 2.727 ± 0.053
2.608MetLeu: 2.608 ± 0.055
0.846MetMet: 0.846 ± 0.027
1.942MetAsn: 1.942 ± 0.039
0.96MetPro: 0.96 ± 0.024
0.994MetGln: 0.994 ± 0.03
0.924MetArg: 0.924 ± 0.024
1.892MetSer: 1.892 ± 0.034
1.51MetThr: 1.51 ± 0.031
1.763MetVal: 1.763 ± 0.038
0.194MetTrp: 0.194 ± 0.012
1.006MetTyr: 1.006 ± 0.027
0.0MetXaa: 0.0 ± 0.0
Asn
3.581AsnAla: 3.581 ± 0.06
0.794AsnCys: 0.794 ± 0.028
3.161AsnAsp: 3.161 ± 0.054
3.963AsnGlu: 3.963 ± 0.057
2.147AsnPhe: 2.147 ± 0.037
4.109AsnGly: 4.109 ± 0.067
0.895AsnHis: 0.895 ± 0.025
5.817AsnIle: 5.817 ± 0.079
4.369AsnLys: 4.369 ± 0.07
4.746AsnLeu: 4.746 ± 0.063
1.761AsnMet: 1.761 ± 0.037
3.803AsnAsn: 3.803 ± 0.069
2.002AsnPro: 2.002 ± 0.039
1.892AsnGln: 1.892 ± 0.042
1.935AsnArg: 1.935 ± 0.038
3.905AsnSer: 3.905 ± 0.067
3.163AsnThr: 3.163 ± 0.058
3.849AsnVal: 3.849 ± 0.052
0.446AsnTrp: 0.446 ± 0.022
2.628AsnTyr: 2.628 ± 0.052
0.0AsnXaa: 0.0 ± 0.0
Pro
1.606ProAla: 1.606 ± 0.038
0.352ProCys: 0.352 ± 0.021
1.716ProAsp: 1.716 ± 0.038
2.156ProGlu: 2.156 ± 0.054
1.444ProPhe: 1.444 ± 0.037
1.605ProGly: 1.605 ± 0.045
0.419ProHis: 0.419 ± 0.017
2.673ProIle: 2.673 ± 0.055
1.811ProLys: 1.811 ± 0.041
2.257ProLeu: 2.257 ± 0.039
0.746ProMet: 0.746 ± 0.026
1.456ProAsn: 1.456 ± 0.038
0.543ProPro: 0.543 ± 0.022
0.919ProGln: 0.919 ± 0.026
0.738ProArg: 0.738 ± 0.026
1.777ProSer: 1.777 ± 0.037
1.833ProThr: 1.833 ± 0.053
2.117ProVal: 2.117 ± 0.046
0.258ProTrp: 0.258 ± 0.014
1.316ProTyr: 1.316 ± 0.031
0.0ProXaa: 0.0 ± 0.0
Gln
1.828GlnAla: 1.828 ± 0.048
0.361GlnCys: 0.361 ± 0.019
1.415GlnAsp: 1.415 ± 0.033
2.052GlnGlu: 2.052 ± 0.048
1.215GlnPhe: 1.215 ± 0.028
1.61GlnGly: 1.61 ± 0.036
0.376GlnHis: 0.376 ± 0.015
2.802GlnIle: 2.802 ± 0.048
2.429GlnLys: 2.429 ± 0.041
2.814GlnLeu: 2.814 ± 0.049
1.022GlnMet: 1.022 ± 0.029
1.921GlnAsn: 1.921 ± 0.042
0.746GlnPro: 0.746 ± 0.032
1.077GlnGln: 1.077 ± 0.039
1.077GlnArg: 1.077 ± 0.03
1.729GlnSer: 1.729 ± 0.042
1.588GlnThr: 1.588 ± 0.037
1.817GlnVal: 1.817 ± 0.039
0.261GlnTrp: 0.261 ± 0.014
1.517GlnTyr: 1.517 ± 0.033
0.0GlnXaa: 0.0 ± 0.0
Arg
1.842ArgAla: 1.842 ± 0.039
0.447ArgCys: 0.447 ± 0.018
1.673ArgAsp: 1.673 ± 0.037
2.393ArgGlu: 2.393 ± 0.048
1.498ArgPhe: 1.498 ± 0.034
1.844ArgGly: 1.844 ± 0.047
0.514ArgHis: 0.514 ± 0.02
3.102ArgIle: 3.102 ± 0.05
2.851ArgLys: 2.851 ± 0.044
2.865ArgLeu: 2.865 ± 0.053
1.099ArgMet: 1.099 ± 0.029
2.072ArgAsn: 2.072 ± 0.042
0.891ArgPro: 0.891 ± 0.035
1.071ArgGln: 1.071 ± 0.029
1.335ArgArg: 1.335 ± 0.032
1.751ArgSer: 1.751 ± 0.038
1.774ArgThr: 1.774 ± 0.042
2.04ArgVal: 2.04 ± 0.045
0.281ArgTrp: 0.281 ± 0.015
1.443ArgTyr: 1.443 ± 0.033
0.0ArgXaa: 0.0 ± 0.0
Ser
3.973SerAla: 3.973 ± 0.068
0.79SerCys: 0.79 ± 0.028
3.853SerAsp: 3.853 ± 0.065
4.194SerGlu: 4.194 ± 0.062
3.051SerPhe: 3.051 ± 0.054
4.746SerGly: 4.746 ± 0.065
0.904SerHis: 0.904 ± 0.026
6.291SerIle: 6.291 ± 0.081
4.948SerLys: 4.948 ± 0.065
5.384SerLeu: 5.384 ± 0.069
1.815SerMet: 1.815 ± 0.037
3.883SerAsn: 3.883 ± 0.062
1.593SerPro: 1.593 ± 0.037
2.03SerGln: 2.03 ± 0.045
2.141SerArg: 2.141 ± 0.044
4.582SerSer: 4.582 ± 0.081
3.95SerThr: 3.95 ± 0.075
4.506SerVal: 4.506 ± 0.072
0.525SerTrp: 0.525 ± 0.021
2.864SerTyr: 2.864 ± 0.054
0.0SerXaa: 0.0 ± 0.0
Thr
3.978ThrAla: 3.978 ± 0.1
0.628ThrCys: 0.628 ± 0.021
3.231ThrAsp: 3.231 ± 0.06
3.139ThrGlu: 3.139 ± 0.052
2.498ThrPhe: 2.498 ± 0.048
4.142ThrGly: 4.142 ± 0.09
0.844ThrHis: 0.844 ± 0.027
5.324ThrIle: 5.324 ± 0.075
3.8ThrLys: 3.8 ± 0.055
4.849ThrLeu: 4.849 ± 0.065
1.377ThrMet: 1.377 ± 0.028
3.102ThrAsn: 3.102 ± 0.053
1.921ThrPro: 1.921 ± 0.043
1.778ThrGln: 1.778 ± 0.053
1.659ThrArg: 1.659 ± 0.038
3.844ThrSer: 3.844 ± 0.078
3.562ThrThr: 3.562 ± 0.082
4.293ThrVal: 4.293 ± 0.078
0.412ThrTrp: 0.412 ± 0.019
2.433ThrTyr: 2.433 ± 0.049
0.0ThrXaa: 0.0 ± 0.0
Val
4.488ValAla: 4.488 ± 0.063
1.006ValCys: 1.006 ± 0.03
3.859ValAsp: 3.859 ± 0.06
4.25ValGlu: 4.25 ± 0.065
2.948ValPhe: 2.948 ± 0.048
4.105ValGly: 4.105 ± 0.063
0.86ValHis: 0.86 ± 0.026
6.339ValIle: 6.339 ± 0.086
4.981ValLys: 4.981 ± 0.061
5.999ValLeu: 5.999 ± 0.078
1.799ValMet: 1.799 ± 0.04
3.661ValAsn: 3.661 ± 0.057
1.966ValPro: 1.966 ± 0.045
1.688ValGln: 1.688 ± 0.032
1.966ValArg: 1.966 ± 0.041
4.652ValSer: 4.652 ± 0.076
4.165ValThr: 4.165 ± 0.079
4.784ValVal: 4.784 ± 0.077
0.477ValTrp: 0.477 ± 0.018
2.561ValTyr: 2.561 ± 0.047
0.0ValXaa: 0.0 ± 0.0
Trp
0.449TrpAla: 0.449 ± 0.022
0.121TrpCys: 0.121 ± 0.012
0.499TrpAsp: 0.499 ± 0.021
0.475TrpGlu: 0.475 ± 0.019
0.37TrpPhe: 0.37 ± 0.017
0.577TrpGly: 0.577 ± 0.023
0.141TrpHis: 0.141 ± 0.011
0.747TrpIle: 0.747 ± 0.025
0.64TrpLys: 0.64 ± 0.024
0.673TrpLeu: 0.673 ± 0.025
0.232TrpMet: 0.232 ± 0.013
0.556TrpAsn: 0.556 ± 0.022
0.18TrpPro: 0.18 ± 0.013
0.249TrpGln: 0.249 ± 0.014
0.257TrpArg: 0.257 ± 0.019
0.513TrpSer: 0.513 ± 0.022
0.375TrpThr: 0.375 ± 0.019
0.46TrpVal: 0.46 ± 0.018
0.095TrpTrp: 0.095 ± 0.008
0.362TrpTyr: 0.362 ± 0.017
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.452TyrAla: 2.452 ± 0.045
0.63TyrCys: 0.63 ± 0.023
2.594TyrAsp: 2.594 ± 0.044
2.952TyrGlu: 2.952 ± 0.053
2.194TyrPhe: 2.194 ± 0.038
2.785TyrGly: 2.785 ± 0.049
0.7TyrHis: 0.7 ± 0.028
3.881TyrIle: 3.881 ± 0.057
3.056TyrLys: 3.056 ± 0.05
3.731TyrLeu: 3.731 ± 0.054
1.177TyrMet: 1.177 ± 0.032
2.751TyrAsn: 2.751 ± 0.054
1.307TyrPro: 1.307 ± 0.033
1.302TyrGln: 1.302 ± 0.037
1.616TyrArg: 1.616 ± 0.036
2.998TyrSer: 2.998 ± 0.053
2.438TyrThr: 2.438 ± 0.055
2.618TyrVal: 2.618 ± 0.048
0.348TyrTrp: 0.348 ± 0.018
2.087TyrTyr: 2.087 ± 0.043
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4150 proteins (1311489 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski