Amino acid dipepetide frequency for Actinomyces urogenitalis DSM 15434

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
18.713AlaAla: 18.713 ± 0.227
1.298AlaCys: 1.298 ± 0.044
7.463AlaAsp: 7.463 ± 0.122
7.965AlaGlu: 7.965 ± 0.128
3.168AlaPhe: 3.168 ± 0.06
12.985AlaGly: 12.985 ± 0.172
2.654AlaHis: 2.654 ± 0.057
4.257AlaIle: 4.257 ± 0.085
2.622AlaLys: 2.622 ± 0.073
14.141AlaLeu: 14.141 ± 0.164
3.132AlaMet: 3.132 ± 0.072
2.008AlaAsn: 2.008 ± 0.057
6.377AlaPro: 6.377 ± 0.128
5.424AlaGln: 5.424 ± 0.092
9.778AlaArg: 9.778 ± 0.165
8.095AlaSer: 8.095 ± 0.124
7.669AlaThr: 7.669 ± 0.121
11.492AlaVal: 11.492 ± 0.139
2.168AlaTrp: 2.168 ± 0.068
2.464AlaTyr: 2.464 ± 0.057
0.0AlaXaa: 0.0 ± 0.0
Cys
0.988CysAla: 0.988 ± 0.035
0.09CysCys: 0.09 ± 0.012
0.399CysAsp: 0.399 ± 0.023
0.446CysGlu: 0.446 ± 0.023
0.203CysPhe: 0.203 ± 0.015
0.885CysGly: 0.885 ± 0.037
0.194CysHis: 0.194 ± 0.015
0.202CysIle: 0.202 ± 0.015
0.101CysLys: 0.101 ± 0.011
0.86CysLeu: 0.86 ± 0.035
0.139CysMet: 0.139 ± 0.013
0.138CysAsn: 0.138 ± 0.013
0.5CysPro: 0.5 ± 0.031
0.315CysGln: 0.315 ± 0.02
0.483CysArg: 0.483 ± 0.026
0.45CysSer: 0.45 ± 0.025
0.438CysThr: 0.438 ± 0.023
0.658CysVal: 0.658 ± 0.031
0.139CysTrp: 0.139 ± 0.015
0.159CysTyr: 0.159 ± 0.013
0.0CysXaa: 0.0 ± 0.0
Asp
7.424AspAla: 7.424 ± 0.123
0.377AspCys: 0.377 ± 0.02
3.333AspAsp: 3.333 ± 0.073
3.829AspGlu: 3.829 ± 0.084
1.514AspPhe: 1.514 ± 0.047
5.748AspGly: 5.748 ± 0.102
1.306AspHis: 1.306 ± 0.041
1.729AspIle: 1.729 ± 0.054
1.244AspLys: 1.244 ± 0.048
6.491AspLeu: 6.491 ± 0.113
0.946AspMet: 0.946 ± 0.036
0.988AspAsn: 0.988 ± 0.04
4.054AspPro: 4.054 ± 0.077
1.801AspGln: 1.801 ± 0.051
3.522AspArg: 3.522 ± 0.064
2.895AspSer: 2.895 ± 0.07
2.735AspThr: 2.735 ± 0.054
5.053AspVal: 5.053 ± 0.095
0.87AspTrp: 0.87 ± 0.037
1.332AspTyr: 1.332 ± 0.043
0.0AspXaa: 0.0 ± 0.0
Glu
8.513GluAla: 8.513 ± 0.13
0.319GluCys: 0.319 ± 0.021
3.549GluAsp: 3.549 ± 0.079
4.023GluGlu: 4.023 ± 0.091
1.123GluPhe: 1.123 ± 0.035
4.655GluGly: 4.655 ± 0.079
1.534GluHis: 1.534 ± 0.051
2.589GluIle: 2.589 ± 0.067
1.302GluLys: 1.302 ± 0.049
6.498GluLeu: 6.498 ± 0.099
1.133GluMet: 1.133 ± 0.041
1.015GluAsn: 1.015 ± 0.04
3.142GluPro: 3.142 ± 0.08
2.555GluGln: 2.555 ± 0.068
4.951GluArg: 4.951 ± 0.092
2.482GluSer: 2.482 ± 0.06
2.766GluThr: 2.766 ± 0.061
5.578GluVal: 5.578 ± 0.089
0.605GluTrp: 0.605 ± 0.032
0.989GluTyr: 0.989 ± 0.038
0.0GluXaa: 0.0 ± 0.0
Phe
2.999PheAla: 2.999 ± 0.072
0.219PheCys: 0.219 ± 0.016
1.738PheAsp: 1.738 ± 0.053
1.38PheGlu: 1.38 ± 0.044
0.92PhePhe: 0.92 ± 0.038
2.412PheGly: 2.412 ± 0.074
0.517PheHis: 0.517 ± 0.025
1.06PheIle: 1.06 ± 0.039
0.6PheLys: 0.6 ± 0.033
2.502PheLeu: 2.502 ± 0.067
0.509PheMet: 0.509 ± 0.028
0.703PheAsn: 0.703 ± 0.034
1.179PhePro: 1.179 ± 0.045
0.695PheGln: 0.695 ± 0.033
1.356PheArg: 1.356 ± 0.047
1.678PheSer: 1.678 ± 0.046
1.898PheThr: 1.898 ± 0.052
2.187PheVal: 2.187 ± 0.061
0.372PheTrp: 0.372 ± 0.023
0.687PheTyr: 0.687 ± 0.031
0.0PheXaa: 0.0 ± 0.0
Gly
10.821GlyAla: 10.821 ± 0.14
0.755GlyCys: 0.755 ± 0.034
4.193GlyAsp: 4.193 ± 0.075
5.273GlyGlu: 5.273 ± 0.087
2.485GlyPhe: 2.485 ± 0.056
7.354GlyGly: 7.354 ± 0.123
2.024GlyHis: 2.024 ± 0.057
3.545GlyIle: 3.545 ± 0.079
2.307GlyLys: 2.307 ± 0.067
9.273GlyLeu: 9.273 ± 0.125
2.095GlyMet: 2.095 ± 0.052
1.587GlyAsn: 1.587 ± 0.053
3.885GlyPro: 3.885 ± 0.086
3.657GlyGln: 3.657 ± 0.077
6.612GlyArg: 6.612 ± 0.109
5.523GlySer: 5.523 ± 0.093
5.808GlyThr: 5.808 ± 0.106
7.896GlyVal: 7.896 ± 0.124
1.702GlyTrp: 1.702 ± 0.061
2.272GlyTyr: 2.272 ± 0.065
0.0GlyXaa: 0.0 ± 0.0
His
2.534HisAla: 2.534 ± 0.063
0.161HisCys: 0.161 ± 0.012
1.296HisAsp: 1.296 ± 0.036
1.378HisGlu: 1.378 ± 0.041
0.462HisPhe: 0.462 ± 0.027
2.084HisGly: 2.084 ± 0.054
0.609HisHis: 0.609 ± 0.029
0.612HisIle: 0.612 ± 0.027
0.333HisLys: 0.333 ± 0.021
2.436HisLeu: 2.436 ± 0.057
0.431HisMet: 0.431 ± 0.023
0.431HisAsn: 0.431 ± 0.025
1.474HisPro: 1.474 ± 0.042
0.628HisGln: 0.628 ± 0.028
1.55HisArg: 1.55 ± 0.048
0.968HisSer: 0.968 ± 0.036
1.093HisThr: 1.093 ± 0.033
1.747HisVal: 1.747 ± 0.049
0.327HisTrp: 0.327 ± 0.021
0.47HisTyr: 0.47 ± 0.022
0.0HisXaa: 0.0 ± 0.0
Ile
5.066IleAla: 5.066 ± 0.09
0.314IleCys: 0.314 ± 0.022
2.78IleAsp: 2.78 ± 0.065
2.545IleGlu: 2.545 ± 0.06
0.929IlePhe: 0.929 ± 0.043
3.664IleGly: 3.664 ± 0.086
0.765IleHis: 0.765 ± 0.029
1.612IleIle: 1.612 ± 0.061
0.863IleLys: 0.863 ± 0.033
3.172IleLeu: 3.172 ± 0.08
0.721IleMet: 0.721 ± 0.03
0.899IleAsn: 0.899 ± 0.042
1.911IlePro: 1.911 ± 0.06
1.01IleGln: 1.01 ± 0.036
2.039IleArg: 2.039 ± 0.053
2.117IleSer: 2.117 ± 0.057
2.456IleThr: 2.456 ± 0.062
3.324IleVal: 3.324 ± 0.075
0.428IleTrp: 0.428 ± 0.025
0.673IleTyr: 0.673 ± 0.033
0.0IleXaa: 0.0 ± 0.0
Lys
2.917LysAla: 2.917 ± 0.068
0.086LysCys: 0.086 ± 0.01
1.457LysAsp: 1.457 ± 0.048
1.311LysGlu: 1.311 ± 0.049
0.383LysPhe: 0.383 ± 0.024
1.697LysGly: 1.697 ± 0.051
0.401LysHis: 0.401 ± 0.023
0.964LysIle: 0.964 ± 0.038
0.762LysLys: 0.762 ± 0.034
1.507LysLeu: 1.507 ± 0.052
0.419LysMet: 0.419 ± 0.024
0.532LysAsn: 0.532 ± 0.026
1.037LysPro: 1.037 ± 0.04
0.656LysGln: 0.656 ± 0.031
1.421LysArg: 1.421 ± 0.059
0.976LysSer: 0.976 ± 0.041
1.279LysThr: 1.279 ± 0.051
1.959LysVal: 1.959 ± 0.056
0.196LysTrp: 0.196 ± 0.017
0.45LysTyr: 0.45 ± 0.028
0.0LysXaa: 0.0 ± 0.0
Leu
15.716LeuAla: 15.716 ± 0.2
0.722LeuCys: 0.722 ± 0.029
6.569LeuAsp: 6.569 ± 0.097
5.51LeuGlu: 5.51 ± 0.099
2.315LeuPhe: 2.315 ± 0.064
9.282LeuGly: 9.282 ± 0.123
1.887LeuHis: 1.887 ± 0.054
3.78LeuIle: 3.78 ± 0.072
1.782LeuLys: 1.782 ± 0.056
9.809LeuLeu: 9.809 ± 0.173
1.944LeuMet: 1.944 ± 0.052
1.65LeuAsn: 1.65 ± 0.051
5.88LeuPro: 5.88 ± 0.09
2.322LeuGln: 2.322 ± 0.058
7.423LeuArg: 7.423 ± 0.103
6.107LeuSer: 6.107 ± 0.102
7.294LeuThr: 7.294 ± 0.114
9.391LeuVal: 9.391 ± 0.153
1.179LeuTrp: 1.179 ± 0.047
1.66LeuTyr: 1.66 ± 0.048
0.0LeuXaa: 0.0 ± 0.0
Met
2.59MetAla: 2.59 ± 0.063
0.146MetCys: 0.146 ± 0.014
1.028MetAsp: 1.028 ± 0.033
0.998MetGlu: 0.998 ± 0.042
0.461MetPhe: 0.461 ± 0.026
1.621MetGly: 1.621 ± 0.043
0.347MetHis: 0.347 ± 0.023
0.83MetIle: 0.83 ± 0.031
0.458MetLys: 0.458 ± 0.023
2.013MetLeu: 2.013 ± 0.053
0.44MetMet: 0.44 ± 0.028
0.424MetAsn: 0.424 ± 0.023
1.269MetPro: 1.269 ± 0.044
0.464MetGln: 0.464 ± 0.025
1.578MetArg: 1.578 ± 0.047
1.795MetSer: 1.795 ± 0.049
1.864MetThr: 1.864 ± 0.047
1.733MetVal: 1.733 ± 0.051
0.265MetTrp: 0.265 ± 0.019
0.33MetTyr: 0.33 ± 0.021
0.0MetXaa: 0.0 ± 0.0
Asn
2.218AsnAla: 2.218 ± 0.053
0.108AsnCys: 0.108 ± 0.014
0.995AsnAsp: 0.995 ± 0.037
0.959AsnGlu: 0.959 ± 0.038
0.479AsnPhe: 0.479 ± 0.025
1.667AsnGly: 1.667 ± 0.069
0.383AsnHis: 0.383 ± 0.021
0.748AsnIle: 0.748 ± 0.034
0.427AsnLys: 0.427 ± 0.029
1.974AsnLeu: 1.974 ± 0.056
0.329AsnMet: 0.329 ± 0.021
0.455AsnAsn: 0.455 ± 0.03
1.456AsnPro: 1.456 ± 0.045
0.595AsnGln: 0.595 ± 0.03
1.101AsnArg: 1.101 ± 0.038
0.812AsnSer: 0.812 ± 0.03
1.041AsnThr: 1.041 ± 0.045
1.475AsnVal: 1.475 ± 0.051
0.25AsnTrp: 0.25 ± 0.02
0.457AsnTyr: 0.457 ± 0.024
0.0AsnXaa: 0.0 ± 0.0
Pro
7.879ProAla: 7.879 ± 0.124
0.31ProCys: 0.31 ± 0.024
3.311ProAsp: 3.311 ± 0.075
3.789ProGlu: 3.789 ± 0.072
1.429ProPhe: 1.429 ± 0.042
5.466ProGly: 5.466 ± 0.113
1.079ProHis: 1.079 ± 0.035
1.551ProIle: 1.551 ± 0.045
0.898ProLys: 0.898 ± 0.034
4.689ProLeu: 4.689 ± 0.087
1.014ProMet: 1.014 ± 0.035
0.814ProAsn: 0.814 ± 0.034
1.982ProPro: 1.982 ± 0.065
2.221ProGln: 2.221 ± 0.049
3.396ProArg: 3.396 ± 0.083
3.664ProSer: 3.664 ± 0.077
3.549ProThr: 3.549 ± 0.077
5.095ProVal: 5.095 ± 0.078
0.89ProTrp: 0.89 ± 0.035
1.112ProTyr: 1.112 ± 0.039
0.0ProXaa: 0.0 ± 0.0
Gln
6.211GlnAla: 6.211 ± 0.102
0.204GlnCys: 0.204 ± 0.014
2.027GlnAsp: 2.027 ± 0.058
2.217GlnGlu: 2.217 ± 0.059
0.64GlnPhe: 0.64 ± 0.026
2.779GlnGly: 2.779 ± 0.058
0.597GlnHis: 0.597 ± 0.028
1.43GlnIle: 1.43 ± 0.049
0.567GlnLys: 0.567 ± 0.029
3.034GlnLeu: 3.034 ± 0.068
0.801GlnMet: 0.801 ± 0.035
0.454GlnAsn: 0.454 ± 0.023
1.849GlnPro: 1.849 ± 0.058
1.107GlnGln: 1.107 ± 0.039
2.533GlnArg: 2.533 ± 0.055
1.426GlnSer: 1.426 ± 0.043
1.81GlnThr: 1.81 ± 0.049
3.836GlnVal: 3.836 ± 0.081
0.614GlnTrp: 0.614 ± 0.03
0.532GlnTyr: 0.532 ± 0.024
0.001GlnXaa: 0.001 ± 0.001
Arg
8.596ArgAla: 8.596 ± 0.132
0.533ArgCys: 0.533 ± 0.027
3.372ArgAsp: 3.372 ± 0.068
4.699ArgGlu: 4.699 ± 0.086
1.89ArgPhe: 1.89 ± 0.052
5.286ArgGly: 5.286 ± 0.101
1.737ArgHis: 1.737 ± 0.051
2.611ArgIle: 2.611 ± 0.063
1.373ArgLys: 1.373 ± 0.051
7.819ArgLeu: 7.819 ± 0.107
1.645ArgMet: 1.645 ± 0.05
1.069ArgAsn: 1.069 ± 0.036
3.915ArgPro: 3.915 ± 0.091
2.858ArgGln: 2.858 ± 0.074
6.978ArgArg: 6.978 ± 0.135
4.507ArgSer: 4.507 ± 0.078
4.268ArgThr: 4.268 ± 0.075
5.487ArgVal: 5.487 ± 0.085
1.252ArgTrp: 1.252 ± 0.043
1.643ArgTyr: 1.643 ± 0.048
0.003ArgXaa: 0.003 ± 0.002
Ser
7.264SerAla: 7.264 ± 0.111
0.5SerCys: 0.5 ± 0.025
2.813SerAsp: 2.813 ± 0.071
2.922SerGlu: 2.922 ± 0.07
1.937SerPhe: 1.937 ± 0.049
5.82SerGly: 5.82 ± 0.107
1.304SerHis: 1.304 ± 0.042
1.959SerIle: 1.959 ± 0.054
1.128SerLys: 1.128 ± 0.042
5.985SerLeu: 5.985 ± 0.102
1.27SerMet: 1.27 ± 0.042
1.015SerAsn: 1.015 ± 0.033
3.448SerPro: 3.448 ± 0.075
2.589SerGln: 2.589 ± 0.061
4.027SerArg: 4.027 ± 0.082
4.122SerSer: 4.122 ± 0.092
3.911SerThr: 3.911 ± 0.085
4.716SerVal: 4.716 ± 0.086
1.189SerTrp: 1.189 ± 0.048
1.427SerTyr: 1.427 ± 0.045
0.0SerXaa: 0.0 ± 0.0
Thr
7.795ThrAla: 7.795 ± 0.11
0.497ThrCys: 0.497 ± 0.029
3.476ThrAsp: 3.476 ± 0.073
2.982ThrGlu: 2.982 ± 0.07
1.795ThrPhe: 1.795 ± 0.063
5.937ThrGly: 5.937 ± 0.099
1.285ThrHis: 1.285 ± 0.04
2.587ThrIle: 2.587 ± 0.069
1.253ThrLys: 1.253 ± 0.05
6.096ThrLeu: 6.096 ± 0.092
1.18ThrMet: 1.18 ± 0.042
1.193ThrAsn: 1.193 ± 0.043
3.741ThrPro: 3.741 ± 0.079
2.149ThrGln: 2.149 ± 0.054
3.819ThrArg: 3.819 ± 0.073
4.019ThrSer: 4.019 ± 0.083
4.398ThrThr: 4.398 ± 0.092
5.756ThrVal: 5.756 ± 0.094
1.201ThrTrp: 1.201 ± 0.041
1.414ThrTyr: 1.414 ± 0.045
0.0ThrXaa: 0.0 ± 0.0
Val
11.491ValAla: 11.491 ± 0.16
0.785ValCys: 0.785 ± 0.038
5.183ValAsp: 5.183 ± 0.101
5.048ValGlu: 5.048 ± 0.077
2.355ValPhe: 2.355 ± 0.056
6.772ValGly: 6.772 ± 0.117
1.654ValHis: 1.654 ± 0.046
3.95ValIle: 3.95 ± 0.081
1.63ValLys: 1.63 ± 0.058
9.876ValLeu: 9.876 ± 0.151
1.777ValMet: 1.777 ± 0.056
1.646ValAsn: 1.646 ± 0.05
5.156ValPro: 5.156 ± 0.096
2.183ValGln: 2.183 ± 0.048
6.233ValArg: 6.233 ± 0.092
5.567ValSer: 5.567 ± 0.09
6.063ValThr: 6.063 ± 0.099
9.028ValVal: 9.028 ± 0.141
1.187ValTrp: 1.187 ± 0.044
1.66ValTyr: 1.66 ± 0.051
0.001ValXaa: 0.001 ± 0.001
Trp
1.685TrpAla: 1.685 ± 0.051
0.186TrpCys: 0.186 ± 0.016
0.925TrpAsp: 0.925 ± 0.034
0.761TrpGlu: 0.761 ± 0.035
0.488TrpPhe: 0.488 ± 0.025
1.109TrpGly: 1.109 ± 0.04
0.358TrpHis: 0.358 ± 0.023
0.628TrpIle: 0.628 ± 0.033
0.343TrpLys: 0.343 ± 0.025
1.65TrpLeu: 1.65 ± 0.054
0.39TrpMet: 0.39 ± 0.024
0.425TrpAsn: 0.425 ± 0.028
0.79TrpPro: 0.79 ± 0.035
0.677TrpGln: 0.677 ± 0.031
1.241TrpArg: 1.241 ± 0.047
0.994TrpSer: 0.994 ± 0.039
0.967TrpThr: 0.967 ± 0.038
1.189TrpVal: 1.189 ± 0.042
0.379TrpTrp: 0.379 ± 0.024
0.342TrpTyr: 0.342 ± 0.032
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.451TyrAla: 2.451 ± 0.066
0.178TyrCys: 0.178 ± 0.015
1.282TyrAsp: 1.282 ± 0.044
1.284TyrGlu: 1.284 ± 0.04
0.635TyrPhe: 0.635 ± 0.032
1.846TyrGly: 1.846 ± 0.045
0.389TyrHis: 0.389 ± 0.025
0.643TyrIle: 0.643 ± 0.029
0.416TyrLys: 0.416 ± 0.025
2.397TyrLeu: 2.397 ± 0.057
0.34TyrMet: 0.34 ± 0.023
0.481TyrAsn: 0.481 ± 0.023
1.093TyrPro: 1.093 ± 0.037
0.764TyrGln: 0.764 ± 0.033
1.503TyrArg: 1.503 ± 0.048
1.149TyrSer: 1.149 ± 0.039
1.265TyrThr: 1.265 ± 0.04
1.645TyrVal: 1.645 ± 0.047
0.315TyrTrp: 0.315 ± 0.024
0.505TyrTyr: 0.505 ± 0.04
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.001XaaMet: 0.001 ± 0.001
0.0XaaAsn: 0.0 ± 0.0
0.001XaaPro: 0.001 ± 0.001
0.0XaaGln: 0.0 ± 0.0
0.001XaaArg: 0.001 ± 0.001
0.0XaaSer: 0.0 ± 0.0
0.001XaaThr: 0.001 ± 0.001
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2394 proteins (768598 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski