Amino acid dipepetide frequency for Thioclava indica

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
15.959AlaAla: 15.959 ± 0.206
1.063AlaCys: 1.063 ± 0.035
6.753AlaAsp: 6.753 ± 0.085
6.991AlaGlu: 6.991 ± 0.119
4.143AlaPhe: 4.143 ± 0.067
10.357AlaGly: 10.357 ± 0.145
2.461AlaHis: 2.461 ± 0.053
6.353AlaIle: 6.353 ± 0.079
4.772AlaLys: 4.772 ± 0.078
14.452AlaLeu: 14.452 ± 0.16
4.031AlaMet: 4.031 ± 0.073
2.93AlaAsn: 2.93 ± 0.054
6.338AlaPro: 6.338 ± 0.081
5.829AlaGln: 5.829 ± 0.086
9.088AlaArg: 9.088 ± 0.119
6.088AlaSer: 6.088 ± 0.107
6.06AlaThr: 6.06 ± 0.1
7.813AlaVal: 7.813 ± 0.1
1.371AlaTrp: 1.371 ± 0.037
2.489AlaTyr: 2.489 ± 0.048
0.0AlaXaa: 0.0 ± 0.0
Cys
1.133CysAla: 1.133 ± 0.033
0.094CysCys: 0.094 ± 0.01
0.617CysAsp: 0.617 ± 0.024
0.44CysGlu: 0.44 ± 0.021
0.303CysPhe: 0.303 ± 0.017
0.838CysGly: 0.838 ± 0.031
0.258CysHis: 0.258 ± 0.017
0.401CysIle: 0.401 ± 0.021
0.241CysLys: 0.241 ± 0.015
0.795CysLeu: 0.795 ± 0.027
0.138CysMet: 0.138 ± 0.012
0.219CysAsn: 0.219 ± 0.013
0.488CysPro: 0.488 ± 0.022
0.235CysGln: 0.235 ± 0.017
0.473CysArg: 0.473 ± 0.02
0.434CysSer: 0.434 ± 0.02
0.43CysThr: 0.43 ± 0.022
0.606CysVal: 0.606 ± 0.027
0.107CysTrp: 0.107 ± 0.01
0.213CysTyr: 0.213 ± 0.013
0.0CysXaa: 0.0 ± 0.0
Asp
7.221AspAla: 7.221 ± 0.1
0.514AspCys: 0.514 ± 0.022
3.245AspAsp: 3.245 ± 0.087
3.273AspGlu: 3.273 ± 0.064
2.334AspPhe: 2.334 ± 0.05
5.158AspGly: 5.158 ± 0.087
1.379AspHis: 1.379 ± 0.043
3.012AspIle: 3.012 ± 0.054
1.935AspLys: 1.935 ± 0.044
6.724AspLeu: 6.724 ± 0.091
1.616AspMet: 1.616 ± 0.045
1.248AspAsn: 1.248 ± 0.037
3.761AspPro: 3.761 ± 0.061
2.096AspGln: 2.096 ± 0.039
3.606AspArg: 3.606 ± 0.063
2.348AspSer: 2.348 ± 0.05
3.014AspThr: 3.014 ± 0.065
3.899AspVal: 3.899 ± 0.058
1.231AspTrp: 1.231 ± 0.039
1.548AspTyr: 1.548 ± 0.039
0.0AspXaa: 0.0 ± 0.0
Glu
7.464GluAla: 7.464 ± 0.119
0.321GluCys: 0.321 ± 0.016
2.982GluAsp: 2.982 ± 0.067
2.768GluGlu: 2.768 ± 0.063
1.787GluPhe: 1.787 ± 0.038
4.61GluGly: 4.61 ± 0.067
1.105GluHis: 1.105 ± 0.032
3.701GluIle: 3.701 ± 0.073
2.19GluLys: 2.19 ± 0.054
4.865GluLeu: 4.865 ± 0.082
1.895GluMet: 1.895 ± 0.048
1.705GluAsn: 1.705 ± 0.041
2.33GluPro: 2.33 ± 0.053
1.924GluGln: 1.924 ± 0.049
3.896GluArg: 3.896 ± 0.076
2.288GluSer: 2.288 ± 0.049
3.689GluThr: 3.689 ± 0.064
3.966GluVal: 3.966 ± 0.07
0.667GluTrp: 0.667 ± 0.03
0.996GluTyr: 0.996 ± 0.03
0.0GluXaa: 0.0 ± 0.0
Phe
4.606PheAla: 4.606 ± 0.084
0.384PheCys: 0.384 ± 0.019
2.783PheAsp: 2.783 ± 0.055
2.103PheGlu: 2.103 ± 0.042
1.387PhePhe: 1.387 ± 0.046
3.693PheGly: 3.693 ± 0.054
0.742PheHis: 0.742 ± 0.027
1.674PheIle: 1.674 ± 0.04
1.031PheLys: 1.031 ± 0.033
3.413PheLeu: 3.413 ± 0.066
0.868PheMet: 0.868 ± 0.029
1.031PheAsn: 1.031 ± 0.031
1.509PhePro: 1.509 ± 0.037
0.932PheGln: 0.932 ± 0.029
1.942PheArg: 1.942 ± 0.049
2.252PheSer: 2.252 ± 0.05
2.155PheThr: 2.155 ± 0.045
2.753PheVal: 2.753 ± 0.053
0.638PheTrp: 0.638 ± 0.028
0.939PheTyr: 0.939 ± 0.034
0.0PheXaa: 0.0 ± 0.0
Gly
10.639GlyAla: 10.639 ± 0.145
0.808GlyCys: 0.808 ± 0.024
4.712GlyAsp: 4.712 ± 0.085
4.544GlyGlu: 4.544 ± 0.066
3.686GlyPhe: 3.686 ± 0.064
7.139GlyGly: 7.139 ± 0.128
1.856GlyHis: 1.856 ± 0.049
4.459GlyIle: 4.459 ± 0.066
3.482GlyLys: 3.482 ± 0.073
9.137GlyLeu: 9.137 ± 0.103
2.552GlyMet: 2.552 ± 0.053
2.147GlyAsn: 2.147 ± 0.067
3.618GlyPro: 3.618 ± 0.057
3.204GlyGln: 3.204 ± 0.062
5.283GlyArg: 5.283 ± 0.083
4.239GlySer: 4.239 ± 0.079
4.444GlyThr: 4.444 ± 0.093
6.526GlyVal: 6.526 ± 0.079
1.485GlyTrp: 1.485 ± 0.035
2.28GlyTyr: 2.28 ± 0.049
0.0GlyXaa: 0.0 ± 0.0
His
2.337HisAla: 2.337 ± 0.052
0.21HisCys: 0.21 ± 0.015
1.307HisAsp: 1.307 ± 0.042
1.1HisGlu: 1.1 ± 0.032
0.835HisPhe: 0.835 ± 0.031
1.831HisGly: 1.831 ± 0.049
0.537HisHis: 0.537 ± 0.024
1.036HisIle: 1.036 ± 0.031
0.557HisLys: 0.557 ± 0.021
2.159HisLeu: 2.159 ± 0.043
0.562HisMet: 0.562 ± 0.024
0.492HisAsn: 0.492 ± 0.022
1.345HisPro: 1.345 ± 0.038
0.582HisGln: 0.582 ± 0.023
1.311HisArg: 1.311 ± 0.036
1.063HisSer: 1.063 ± 0.029
0.823HisThr: 0.823 ± 0.027
1.375HisVal: 1.375 ± 0.036
0.342HisTrp: 0.342 ± 0.017
0.561HisTyr: 0.561 ± 0.021
0.0HisXaa: 0.0 ± 0.0
Ile
7.806IleAla: 7.806 ± 0.089
0.678IleCys: 0.678 ± 0.024
3.434IleAsp: 3.434 ± 0.062
3.611IleGlu: 3.611 ± 0.066
1.896IlePhe: 1.896 ± 0.041
5.086IleGly: 5.086 ± 0.073
0.934IleHis: 0.934 ± 0.031
2.22IleIle: 2.22 ± 0.056
1.661IleLys: 1.661 ± 0.045
4.77IleLeu: 4.77 ± 0.076
1.072IleMet: 1.072 ± 0.03
1.468IleAsn: 1.468 ± 0.042
2.393IlePro: 2.393 ± 0.051
1.122IleGln: 1.122 ± 0.034
3.029IleArg: 3.029 ± 0.055
3.246IleSer: 3.246 ± 0.062
3.204IleThr: 3.204 ± 0.064
3.921IleVal: 3.921 ± 0.064
0.822IleTrp: 0.822 ± 0.029
1.265IleTyr: 1.265 ± 0.033
0.0IleXaa: 0.0 ± 0.0
Lys
4.132LysAla: 4.132 ± 0.073
0.196LysCys: 0.196 ± 0.012
1.87LysAsp: 1.87 ± 0.046
1.606LysGlu: 1.606 ± 0.046
1.064LysPhe: 1.064 ± 0.031
3.004LysGly: 3.004 ± 0.057
0.669LysHis: 0.669 ± 0.025
2.092LysIle: 2.092 ± 0.046
1.383LysLys: 1.383 ± 0.046
3.559LysLeu: 3.559 ± 0.068
1.136LysMet: 1.136 ± 0.03
0.95LysAsn: 0.95 ± 0.027
2.106LysPro: 2.106 ± 0.056
1.041LysGln: 1.041 ± 0.033
2.43LysArg: 2.43 ± 0.05
2.062LysSer: 2.062 ± 0.045
2.205LysThr: 2.205 ± 0.049
2.569LysVal: 2.569 ± 0.06
0.444LysTrp: 0.444 ± 0.02
0.71LysTyr: 0.71 ± 0.027
0.0LysXaa: 0.0 ± 0.0
Leu
13.318LeuAla: 13.318 ± 0.143
0.885LeuCys: 0.885 ± 0.031
5.941LeuAsp: 5.941 ± 0.081
5.312LeuGlu: 5.312 ± 0.089
3.51LeuPhe: 3.51 ± 0.067
8.853LeuGly: 8.853 ± 0.112
1.965LeuHis: 1.965 ± 0.042
5.745LeuIle: 5.745 ± 0.09
3.381LeuLys: 3.381 ± 0.061
8.882LeuLeu: 8.882 ± 0.125
2.809LeuMet: 2.809 ± 0.054
2.779LeuAsn: 2.779 ± 0.05
5.526LeuPro: 5.526 ± 0.069
2.81LeuGln: 2.81 ± 0.051
7.092LeuArg: 7.092 ± 0.091
6.925LeuSer: 6.925 ± 0.071
5.576LeuThr: 5.576 ± 0.079
6.76LeuVal: 6.76 ± 0.088
1.332LeuTrp: 1.332 ± 0.039
1.943LeuTyr: 1.943 ± 0.043
0.0LeuXaa: 0.0 ± 0.0
Met
3.654MetAla: 3.654 ± 0.06
0.18MetCys: 0.18 ± 0.012
1.284MetAsp: 1.284 ± 0.033
1.303MetGlu: 1.303 ± 0.042
0.856MetPhe: 0.856 ± 0.026
2.398MetGly: 2.398 ± 0.051
0.455MetHis: 0.455 ± 0.02
1.695MetIle: 1.695 ± 0.043
1.163MetLys: 1.163 ± 0.039
2.767MetLeu: 2.767 ± 0.057
0.889MetMet: 0.889 ± 0.033
0.859MetAsn: 0.859 ± 0.03
1.57MetPro: 1.57 ± 0.04
0.991MetGln: 0.991 ± 0.032
2.014MetArg: 2.014 ± 0.045
1.822MetSer: 1.822 ± 0.041
2.01MetThr: 2.01 ± 0.039
1.916MetVal: 1.916 ± 0.045
0.249MetTrp: 0.249 ± 0.016
0.328MetTyr: 0.328 ± 0.014
0.0MetXaa: 0.0 ± 0.0
Asn
3.344AsnAla: 3.344 ± 0.068
0.258AsnCys: 0.258 ± 0.016
1.491AsnAsp: 1.491 ± 0.049
1.2AsnGlu: 1.2 ± 0.034
0.993AsnPhe: 0.993 ± 0.03
2.366AsnGly: 2.366 ± 0.053
0.51AsnHis: 0.51 ± 0.023
1.438AsnIle: 1.438 ± 0.041
0.793AsnLys: 0.793 ± 0.025
2.609AsnLeu: 2.609 ± 0.046
0.697AsnMet: 0.697 ± 0.024
0.677AsnAsn: 0.677 ± 0.031
1.852AsnPro: 1.852 ± 0.044
0.71AsnGln: 0.71 ± 0.026
1.672AsnArg: 1.672 ± 0.039
1.304AsnSer: 1.304 ± 0.037
1.39AsnThr: 1.39 ± 0.056
1.875AsnVal: 1.875 ± 0.054
0.462AsnTrp: 0.462 ± 0.02
0.682AsnTyr: 0.682 ± 0.026
0.0AsnXaa: 0.0 ± 0.0
Pro
5.993ProAla: 5.993 ± 0.072
0.334ProCys: 0.334 ± 0.019
4.05ProAsp: 4.05 ± 0.064
4.069ProGlu: 4.069 ± 0.067
2.017ProPhe: 2.017 ± 0.046
4.355ProGly: 4.355 ± 0.069
1.03ProHis: 1.03 ± 0.031
2.346ProIle: 2.346 ± 0.052
1.919ProLys: 1.919 ± 0.043
4.596ProLeu: 4.596 ± 0.058
1.326ProMet: 1.326 ± 0.027
1.344ProAsn: 1.344 ± 0.033
2.287ProPro: 2.287 ± 0.053
1.795ProGln: 1.795 ± 0.041
2.789ProArg: 2.789 ± 0.061
2.592ProSer: 2.592 ± 0.057
2.344ProThr: 2.344 ± 0.05
4.078ProVal: 4.078 ± 0.055
0.676ProTrp: 0.676 ± 0.023
1.127ProTyr: 1.127 ± 0.032
0.0ProXaa: 0.0 ± 0.0
Gln
4.377GlnAla: 4.377 ± 0.068
0.211GlnCys: 0.211 ± 0.014
1.801GlnAsp: 1.801 ± 0.046
1.468GlnGlu: 1.468 ± 0.035
1.203GlnPhe: 1.203 ± 0.035
2.943GlnGly: 2.943 ± 0.053
0.591GlnHis: 0.591 ± 0.022
2.358GlnIle: 2.358 ± 0.048
1.204GlnLys: 1.204 ± 0.035
2.956GlnLeu: 2.956 ± 0.057
1.207GlnMet: 1.207 ± 0.032
0.96GlnAsn: 0.96 ± 0.034
1.708GlnPro: 1.708 ± 0.048
1.117GlnGln: 1.117 ± 0.035
2.16GlnArg: 2.16 ± 0.055
2.081GlnSer: 2.081 ± 0.044
1.995GlnThr: 1.995 ± 0.039
2.452GlnVal: 2.452 ± 0.045
0.436GlnTrp: 0.436 ± 0.021
0.633GlnTyr: 0.633 ± 0.021
0.0GlnXaa: 0.0 ± 0.0
Arg
8.583ArgAla: 8.583 ± 0.113
0.455ArgCys: 0.455 ± 0.021
4.192ArgAsp: 4.192 ± 0.069
3.833ArgGlu: 3.833 ± 0.073
2.566ArgPhe: 2.566 ± 0.048
4.392ArgGly: 4.392 ± 0.074
1.485ArgHis: 1.485 ± 0.041
3.629ArgIle: 3.629 ± 0.059
2.434ArgLys: 2.434 ± 0.054
6.877ArgLeu: 6.877 ± 0.101
1.86ArgMet: 1.86 ± 0.045
1.715ArgAsn: 1.715 ± 0.042
3.025ArgPro: 3.025 ± 0.057
2.153ArgGln: 2.153 ± 0.055
4.439ArgArg: 4.439 ± 0.083
3.091ArgSer: 3.091 ± 0.062
2.601ArgThr: 2.601 ± 0.048
4.478ArgVal: 4.478 ± 0.066
0.864ArgTrp: 0.864 ± 0.029
1.501ArgTyr: 1.501 ± 0.041
0.0ArgXaa: 0.0 ± 0.0
Ser
6.332SerAla: 6.332 ± 0.104
0.443SerCys: 0.443 ± 0.023
3.615SerAsp: 3.615 ± 0.072
3.005SerGlu: 3.005 ± 0.059
2.19SerPhe: 2.19 ± 0.049
5.72SerGly: 5.72 ± 0.107
1.105SerHis: 1.105 ± 0.031
2.461SerIle: 2.461 ± 0.046
1.814SerLys: 1.814 ± 0.045
5.203SerLeu: 5.203 ± 0.071
1.39SerMet: 1.39 ± 0.038
1.476SerAsn: 1.476 ± 0.039
2.486SerPro: 2.486 ± 0.048
1.824SerGln: 1.824 ± 0.044
3.099SerArg: 3.099 ± 0.067
2.826SerSer: 2.826 ± 0.063
2.566SerThr: 2.566 ± 0.069
3.969SerVal: 3.969 ± 0.074
0.71SerTrp: 0.71 ± 0.028
1.451SerTyr: 1.451 ± 0.035
0.0SerXaa: 0.0 ± 0.0
Thr
5.909ThrAla: 5.909 ± 0.1
0.496ThrCys: 0.496 ± 0.022
2.979ThrAsp: 2.979 ± 0.056
2.685ThrGlu: 2.685 ± 0.047
1.874ThrPhe: 1.874 ± 0.044
5.136ThrGly: 5.136 ± 0.1
1.141ThrHis: 1.141 ± 0.036
2.755ThrIle: 2.755 ± 0.064
1.686ThrLys: 1.686 ± 0.041
6.133ThrLeu: 6.133 ± 0.093
1.246ThrMet: 1.246 ± 0.035
1.337ThrAsn: 1.337 ± 0.045
3.592ThrPro: 3.592 ± 0.053
1.951ThrGln: 1.951 ± 0.043
3.536ThrArg: 3.536 ± 0.064
2.688ThrSer: 2.688 ± 0.07
2.804ThrThr: 2.804 ± 0.083
3.948ThrVal: 3.948 ± 0.081
0.617ThrTrp: 0.617 ± 0.025
1.246ThrTyr: 1.246 ± 0.036
0.0ThrXaa: 0.0 ± 0.0
Val
8.721ValAla: 8.721 ± 0.089
0.571ValCys: 0.571 ± 0.022
3.736ValAsp: 3.736 ± 0.061
3.993ValGlu: 3.993 ± 0.068
2.736ValPhe: 2.736 ± 0.055
5.195ValGly: 5.195 ± 0.073
1.255ValHis: 1.255 ± 0.033
4.516ValIle: 4.516 ± 0.063
2.44ValLys: 2.44 ± 0.054
7.415ValLeu: 7.415 ± 0.094
2.202ValMet: 2.202 ± 0.049
1.959ValAsn: 1.959 ± 0.051
3.362ValPro: 3.362 ± 0.053
2.16ValGln: 2.16 ± 0.041
3.708ValArg: 3.708 ± 0.058
4.313ValSer: 4.313 ± 0.078
4.666ValThr: 4.666 ± 0.09
5.393ValVal: 5.393 ± 0.082
0.944ValTrp: 0.944 ± 0.032
1.45ValTyr: 1.45 ± 0.035
0.0ValXaa: 0.0 ± 0.0
Trp
1.377TrpAla: 1.377 ± 0.037
0.124TrpCys: 0.124 ± 0.012
0.749TrpAsp: 0.749 ± 0.029
0.612TrpGlu: 0.612 ± 0.025
0.549TrpPhe: 0.549 ± 0.024
1.074TrpGly: 1.074 ± 0.036
0.356TrpHis: 0.356 ± 0.017
0.747TrpIle: 0.747 ± 0.028
0.494TrpLys: 0.494 ± 0.021
1.729TrpLeu: 1.729 ± 0.045
0.412TrpMet: 0.412 ± 0.021
0.403TrpAsn: 0.403 ± 0.02
0.677TrpPro: 0.677 ± 0.025
0.643TrpGln: 0.643 ± 0.028
1.13TrpArg: 1.13 ± 0.04
0.844TrpSer: 0.844 ± 0.031
0.62TrpThr: 0.62 ± 0.023
0.94TrpVal: 0.94 ± 0.033
0.222TrpTrp: 0.222 ± 0.015
0.282TrpTyr: 0.282 ± 0.015
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.438TyrAla: 2.438 ± 0.043
0.238TyrCys: 0.238 ± 0.013
1.621TyrAsp: 1.621 ± 0.035
1.234TyrGlu: 1.234 ± 0.041
0.906TyrPhe: 0.906 ± 0.025
2.091TyrGly: 2.091 ± 0.051
0.53TyrHis: 0.53 ± 0.023
0.996TyrIle: 0.996 ± 0.029
0.651TyrLys: 0.651 ± 0.027
2.304TyrLeu: 2.304 ± 0.046
0.494TyrMet: 0.494 ± 0.02
0.626TyrAsn: 0.626 ± 0.024
1.095TyrPro: 1.095 ± 0.035
0.74TyrGln: 0.74 ± 0.024
1.474TyrArg: 1.474 ± 0.038
1.176TyrSer: 1.176 ± 0.038
1.167TyrThr: 1.167 ± 0.036
1.497TyrVal: 1.497 ± 0.035
0.364TyrTrp: 0.364 ± 0.02
0.606TyrTyr: 0.606 ± 0.024
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3671 proteins (1133096 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski