Amino acid dipepetide frequency for Parasponia andersonii (Sponia andersonii)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.126AlaAla: 6.126 ± 0.036
1.218AlaCys: 1.218 ± 0.012
2.931AlaAsp: 2.931 ± 0.018
4.152AlaGlu: 4.152 ± 0.026
2.762AlaPhe: 2.762 ± 0.018
4.002AlaGly: 4.002 ± 0.02
1.347AlaHis: 1.347 ± 0.011
3.72AlaIle: 3.72 ± 0.02
3.791AlaLys: 3.791 ± 0.02
6.62AlaLeu: 6.62 ± 0.029
1.674AlaMet: 1.674 ± 0.013
2.527AlaAsn: 2.527 ± 0.017
2.77AlaPro: 2.77 ± 0.021
2.099AlaGln: 2.099 ± 0.016
3.398AlaArg: 3.398 ± 0.018
5.974AlaSer: 5.974 ± 0.026
3.536AlaThr: 3.536 ± 0.019
4.696AlaVal: 4.696 ± 0.025
0.778AlaTrp: 0.778 ± 0.009
1.737AlaTyr: 1.737 ± 0.012
0.002AlaXaa: 0.002 ± 0.0
Cys
0.948CysAla: 0.948 ± 0.01
0.519CysCys: 0.519 ± 0.008
0.841CysAsp: 0.841 ± 0.009
0.864CysGlu: 0.864 ± 0.009
0.93CysPhe: 0.93 ± 0.01
1.37CysGly: 1.37 ± 0.015
0.498CysHis: 0.498 ± 0.007
0.948CysIle: 0.948 ± 0.01
1.072CysLys: 1.072 ± 0.012
1.934CysLeu: 1.934 ± 0.015
0.422CysMet: 0.422 ± 0.007
0.817CysAsn: 0.817 ± 0.01
0.946CysPro: 0.946 ± 0.011
0.622CysGln: 0.622 ± 0.007
1.093CysArg: 1.093 ± 0.012
1.835CysSer: 1.835 ± 0.014
0.857CysThr: 0.857 ± 0.01
1.059CysVal: 1.059 ± 0.01
0.264CysTrp: 0.264 ± 0.005
0.526CysTyr: 0.526 ± 0.006
0.001CysXaa: 0.001 ± 0.0
Asp
3.14AspAla: 3.14 ± 0.019
0.941AspCys: 0.941 ± 0.01
3.599AspAsp: 3.599 ± 0.027
3.902AspGlu: 3.902 ± 0.021
2.371AspPhe: 2.371 ± 0.017
3.655AspGly: 3.655 ± 0.019
1.423AspHis: 1.423 ± 0.011
2.857AspIle: 2.857 ± 0.018
2.588AspLys: 2.588 ± 0.016
5.109AspLeu: 5.109 ± 0.025
1.221AspMet: 1.221 ± 0.01
2.064AspAsn: 2.064 ± 0.014
2.592AspPro: 2.592 ± 0.017
1.878AspGln: 1.878 ± 0.013
2.497AspArg: 2.497 ± 0.019
4.23AspSer: 4.23 ± 0.022
2.068AspThr: 2.068 ± 0.013
3.53AspVal: 3.53 ± 0.019
0.702AspTrp: 0.702 ± 0.007
1.531AspTyr: 1.531 ± 0.013
0.001AspXaa: 0.001 ± 0.0
Glu
4.719GluAla: 4.719 ± 0.027
0.875GluCys: 0.875 ± 0.01
3.845GluAsp: 3.845 ± 0.024
6.144GluGlu: 6.144 ± 0.044
2.399GluPhe: 2.399 ± 0.015
3.669GluGly: 3.669 ± 0.017
1.212GluHis: 1.212 ± 0.012
3.743GluIle: 3.743 ± 0.02
4.678GluLys: 4.678 ± 0.025
6.069GluLeu: 6.069 ± 0.031
1.667GluMet: 1.667 ± 0.014
3.135GluAsn: 3.135 ± 0.019
2.16GluPro: 2.16 ± 0.014
2.024GluGln: 2.024 ± 0.016
3.517GluArg: 3.517 ± 0.021
4.626GluSer: 4.626 ± 0.026
3.076GluThr: 3.076 ± 0.018
4.178GluVal: 4.178 ± 0.02
0.727GluTrp: 0.727 ± 0.008
1.614GluTyr: 1.614 ± 0.013
0.001GluXaa: 0.001 ± 0.0
Phe
2.469PheAla: 2.469 ± 0.016
0.945PheCys: 0.945 ± 0.01
2.394PheAsp: 2.394 ± 0.016
2.405PheGlu: 2.405 ± 0.014
2.173PhePhe: 2.173 ± 0.019
3.208PheGly: 3.208 ± 0.021
1.144PheHis: 1.144 ± 0.011
2.085PheIle: 2.085 ± 0.014
2.124PheLys: 2.124 ± 0.013
4.501PheLeu: 4.501 ± 0.024
0.977PheMet: 0.977 ± 0.01
1.762PheAsn: 1.762 ± 0.015
2.13PhePro: 2.13 ± 0.014
1.564PheGln: 1.564 ± 0.012
2.191PheArg: 2.191 ± 0.015
4.31PheSer: 4.31 ± 0.022
1.969PheThr: 1.969 ± 0.014
2.838PheVal: 2.838 ± 0.017
0.623PheTrp: 0.623 ± 0.008
1.279PheTyr: 1.279 ± 0.012
0.001PheXaa: 0.001 ± 0.0
Gly
3.846GlyAla: 3.846 ± 0.02
1.279GlyCys: 1.279 ± 0.012
3.338GlyAsp: 3.338 ± 0.018
3.711GlyGlu: 3.711 ± 0.02
3.267GlyPhe: 3.267 ± 0.019
5.498GlyGly: 5.498 ± 0.049
1.567GlyHis: 1.567 ± 0.013
3.552GlyIle: 3.552 ± 0.019
3.94GlyLys: 3.94 ± 0.02
6.197GlyLeu: 6.197 ± 0.023
1.388GlyMet: 1.388 ± 0.013
2.992GlyAsn: 2.992 ± 0.019
2.583GlyPro: 2.583 ± 0.019
2.093GlyGln: 2.093 ± 0.013
3.725GlyArg: 3.725 ± 0.022
6.037GlySer: 6.037 ± 0.026
3.211GlyThr: 3.211 ± 0.016
4.273GlyVal: 4.273 ± 0.022
0.907GlyTrp: 0.907 ± 0.009
1.971GlyTyr: 1.971 ± 0.016
0.002GlyXaa: 0.002 ± 0.0
His
1.332HisAla: 1.332 ± 0.011
0.54HisCys: 0.54 ± 0.008
1.234HisAsp: 1.234 ± 0.012
1.262HisGlu: 1.262 ± 0.011
1.124HisPhe: 1.124 ± 0.011
1.746HisGly: 1.746 ± 0.014
1.127HisHis: 1.127 ± 0.016
1.209HisIle: 1.209 ± 0.011
1.169HisLys: 1.169 ± 0.011
2.477HisLeu: 2.477 ± 0.018
0.536HisMet: 0.536 ± 0.007
1.009HisAsn: 1.009 ± 0.011
1.309HisPro: 1.309 ± 0.011
1.029HisGln: 1.029 ± 0.01
1.43HisArg: 1.43 ± 0.011
1.975HisSer: 1.975 ± 0.014
0.957HisThr: 0.957 ± 0.009
1.579HisVal: 1.579 ± 0.012
0.311HisTrp: 0.311 ± 0.006
0.737HisTyr: 0.737 ± 0.009
0.001HisXaa: 0.001 ± 0.0
Ile
3.465IleAla: 3.465 ± 0.02
1.053IleCys: 1.053 ± 0.009
2.84IleAsp: 2.84 ± 0.017
3.175IleGlu: 3.175 ± 0.019
2.388IlePhe: 2.388 ± 0.015
3.409IleGly: 3.409 ± 0.019
1.281IleHis: 1.281 ± 0.012
2.829IleIle: 2.829 ± 0.02
2.899IleLys: 2.899 ± 0.019
5.293IleLeu: 5.293 ± 0.026
1.145IleMet: 1.145 ± 0.011
2.2IleAsn: 2.2 ± 0.015
2.924IlePro: 2.924 ± 0.022
1.911IleGln: 1.911 ± 0.014
2.755IleArg: 2.755 ± 0.017
4.958IleSer: 4.958 ± 0.024
2.631IleThr: 2.631 ± 0.014
3.447IleVal: 3.447 ± 0.017
0.731IleTrp: 0.731 ± 0.01
1.495IleTyr: 1.495 ± 0.013
0.002IleXaa: 0.002 ± 0.0
Lys
3.958LysAla: 3.958 ± 0.021
0.963LysCys: 0.963 ± 0.011
3.13LysAsp: 3.13 ± 0.02
4.517LysGlu: 4.517 ± 0.029
2.163LysPhe: 2.163 ± 0.013
3.518LysGly: 3.518 ± 0.02
1.263LysHis: 1.263 ± 0.012
3.228LysIle: 3.228 ± 0.021
4.629LysLys: 4.629 ± 0.032
6.018LysLeu: 6.018 ± 0.028
1.462LysMet: 1.462 ± 0.013
2.675LysAsn: 2.675 ± 0.017
2.729LysPro: 2.729 ± 0.02
2.12LysGln: 2.12 ± 0.015
3.71LysArg: 3.71 ± 0.022
4.627LysSer: 4.627 ± 0.024
2.917LysThr: 2.917 ± 0.02
3.771LysVal: 3.771 ± 0.023
0.799LysTrp: 0.799 ± 0.008
1.566LysTyr: 1.566 ± 0.012
0.001LysXaa: 0.001 ± 0.0
Leu
6.565LeuAla: 6.565 ± 0.028
1.874LeuCys: 1.874 ± 0.013
5.118LeuAsp: 5.118 ± 0.025
6.447LeuGlu: 6.447 ± 0.031
4.06LeuPhe: 4.06 ± 0.023
6.109LeuGly: 6.109 ± 0.026
2.529LeuHis: 2.529 ± 0.016
4.813LeuIle: 4.813 ± 0.024
6.14LeuLys: 6.14 ± 0.028
10.198LeuLeu: 10.198 ± 0.047
2.185LeuMet: 2.185 ± 0.014
4.025LeuAsn: 4.025 ± 0.02
5.189LeuPro: 5.189 ± 0.026
4.066LeuGln: 4.066 ± 0.024
5.797LeuArg: 5.797 ± 0.026
8.966LeuSer: 8.966 ± 0.044
4.6LeuThr: 4.6 ± 0.022
6.757LeuVal: 6.757 ± 0.028
1.204LeuTrp: 1.204 ± 0.012
2.55LeuTyr: 2.55 ± 0.017
0.002LeuXaa: 0.002 ± 0.0
Met
2.193MetAla: 2.193 ± 0.015
0.307MetCys: 0.307 ± 0.006
1.296MetAsp: 1.296 ± 0.011
1.897MetGlu: 1.897 ± 0.014
0.787MetPhe: 0.787 ± 0.009
1.615MetGly: 1.615 ± 0.013
0.476MetHis: 0.476 ± 0.007
1.187MetIle: 1.187 ± 0.01
1.539MetLys: 1.539 ± 0.013
2.071MetLeu: 2.071 ± 0.013
0.634MetMet: 0.634 ± 0.009
0.954MetAsn: 0.954 ± 0.01
0.994MetPro: 0.994 ± 0.009
0.807MetGln: 0.807 ± 0.009
1.216MetArg: 1.216 ± 0.012
1.75MetSer: 1.75 ± 0.014
1.058MetThr: 1.058 ± 0.01
1.704MetVal: 1.704 ± 0.012
0.264MetTrp: 0.264 ± 0.005
0.562MetTyr: 0.562 ± 0.007
0.001MetXaa: 0.001 ± 0.0
Asn
2.514AsnAla: 2.514 ± 0.015
0.848AsnCys: 0.848 ± 0.009
2.179AsnAsp: 2.179 ± 0.016
2.488AsnGlu: 2.488 ± 0.018
1.982AsnPhe: 1.982 ± 0.014
3.259AsnGly: 3.259 ± 0.018
1.114AsnHis: 1.114 ± 0.01
2.471AsnIle: 2.471 ± 0.019
2.398AsnLys: 2.398 ± 0.016
4.677AsnLeu: 4.677 ± 0.031
1.068AsnMet: 1.068 ± 0.011
2.412AsnAsn: 2.412 ± 0.022
2.395AsnPro: 2.395 ± 0.016
1.68AsnGln: 1.68 ± 0.012
2.122AsnArg: 2.122 ± 0.015
4.029AsnSer: 4.029 ± 0.023
1.971AsnThr: 1.971 ± 0.015
2.727AsnVal: 2.727 ± 0.016
0.59AsnTrp: 0.59 ± 0.007
1.303AsnTyr: 1.303 ± 0.012
0.0AsnXaa: 0.0 ± 0.0
Pro
2.943ProAla: 2.943 ± 0.022
0.787ProCys: 0.787 ± 0.009
2.436ProAsp: 2.436 ± 0.016
3.094ProGlu: 3.094 ± 0.017
2.041ProPhe: 2.041 ± 0.015
2.634ProGly: 2.634 ± 0.018
1.154ProHis: 1.154 ± 0.012
2.426ProIle: 2.426 ± 0.014
2.792ProLys: 2.792 ± 0.018
4.517ProLeu: 4.517 ± 0.023
0.944ProMet: 0.944 ± 0.011
2.382ProAsn: 2.382 ± 0.017
3.907ProPro: 3.907 ± 0.047
1.77ProGln: 1.77 ± 0.014
2.556ProArg: 2.556 ± 0.017
5.182ProSer: 5.182 ± 0.026
2.697ProThr: 2.697 ± 0.019
3.043ProVal: 3.043 ± 0.019
0.624ProTrp: 0.624 ± 0.007
1.296ProTyr: 1.296 ± 0.012
0.001ProXaa: 0.001 ± 0.0
Gln
2.35GlnAla: 2.35 ± 0.018
0.563GlnCys: 0.563 ± 0.008
1.599GlnAsp: 1.599 ± 0.013
2.257GlnGlu: 2.257 ± 0.015
1.36GlnPhe: 1.36 ± 0.011
2.015GlnGly: 2.015 ± 0.014
0.873GlnHis: 0.873 ± 0.01
1.977GlnIle: 1.977 ± 0.013
2.229GlnLys: 2.229 ± 0.015
3.585GlnLeu: 3.585 ± 0.02
0.909GlnMet: 0.909 ± 0.009
1.818GlnAsn: 1.818 ± 0.015
1.755GlnPro: 1.755 ± 0.015
1.856GlnGln: 1.856 ± 0.027
2.133GlnArg: 2.133 ± 0.015
2.851GlnSer: 2.851 ± 0.018
1.75GlnThr: 1.75 ± 0.012
2.352GlnVal: 2.352 ± 0.015
0.463GlnTrp: 0.463 ± 0.007
0.931GlnTyr: 0.931 ± 0.01
0.0GlnXaa: 0.0 ± 0.0
Arg
3.349ArgAla: 3.349 ± 0.018
0.996ArgCys: 0.996 ± 0.011
2.741ArgAsp: 2.741 ± 0.017
3.513ArgGlu: 3.513 ± 0.022
2.331ArgPhe: 2.331 ± 0.016
3.359ArgGly: 3.359 ± 0.022
1.385ArgHis: 1.385 ± 0.011
2.929ArgIle: 2.929 ± 0.02
3.788ArgLys: 3.788 ± 0.02
5.333ArgLeu: 5.333 ± 0.02
1.333ArgMet: 1.333 ± 0.012
2.549ArgAsn: 2.549 ± 0.017
2.547ArgPro: 2.547 ± 0.016
1.869ArgGln: 1.869 ± 0.013
4.387ArgArg: 4.387 ± 0.025
4.605ArgSer: 4.605 ± 0.025
2.635ArgThr: 2.635 ± 0.017
3.556ArgVal: 3.556 ± 0.02
0.776ArgTrp: 0.776 ± 0.009
1.434ArgTyr: 1.434 ± 0.012
0.001ArgXaa: 0.001 ± 0.0
Ser
5.333SerAla: 5.333 ± 0.027
1.721SerCys: 1.721 ± 0.014
4.358SerAsp: 4.358 ± 0.022
4.845SerGlu: 4.845 ± 0.025
4.223SerPhe: 4.223 ± 0.021
6.028SerGly: 6.028 ± 0.026
2.097SerHis: 2.097 ± 0.015
4.55SerIle: 4.55 ± 0.022
4.95SerLys: 4.95 ± 0.024
9.141SerLeu: 9.141 ± 0.036
2.041SerMet: 2.041 ± 0.016
4.165SerAsn: 4.165 ± 0.021
4.666SerPro: 4.666 ± 0.032
3.046SerGln: 3.046 ± 0.017
4.675SerArg: 4.675 ± 0.024
11.71SerSer: 11.71 ± 0.05
4.997SerThr: 4.997 ± 0.024
5.261SerVal: 5.261 ± 0.024
1.238SerTrp: 1.238 ± 0.012
2.366SerTyr: 2.366 ± 0.015
0.002SerXaa: 0.002 ± 0.0
Thr
3.295ThrAla: 3.295 ± 0.019
0.938ThrCys: 0.938 ± 0.01
2.18ThrAsp: 2.18 ± 0.016
2.712ThrGlu: 2.712 ± 0.02
2.048ThrPhe: 2.048 ± 0.014
3.175ThrGly: 3.175 ± 0.019
1.06ThrHis: 1.06 ± 0.01
2.792ThrIle: 2.792 ± 0.017
2.797ThrLys: 2.797 ± 0.018
4.803ThrLeu: 4.803 ± 0.021
1.176ThrMet: 1.176 ± 0.011
2.144ThrAsn: 2.144 ± 0.014
2.607ThrPro: 2.607 ± 0.02
1.588ThrGln: 1.588 ± 0.012
2.587ThrArg: 2.587 ± 0.015
4.821ThrSer: 4.821 ± 0.025
3.432ThrThr: 3.432 ± 0.023
3.268ThrVal: 3.268 ± 0.017
0.678ThrTrp: 0.678 ± 0.008
1.349ThrTyr: 1.349 ± 0.013
0.001ThrXaa: 0.001 ± 0.0
Val
4.81ValAla: 4.81 ± 0.025
1.123ValCys: 1.123 ± 0.012
3.676ValAsp: 3.676 ± 0.02
4.426ValGlu: 4.426 ± 0.021
2.822ValPhe: 2.822 ± 0.019
4.283ValGly: 4.283 ± 0.021
1.535ValHis: 1.535 ± 0.011
3.382ValIle: 3.382 ± 0.022
3.805ValLys: 3.805 ± 0.02
6.575ValLeu: 6.575 ± 0.025
1.491ValMet: 1.491 ± 0.011
2.565ValAsn: 2.565 ± 0.014
3.242ValPro: 3.242 ± 0.018
2.229ValGln: 2.229 ± 0.014
3.289ValArg: 3.289 ± 0.019
5.563ValSer: 5.563 ± 0.022
3.195ValThr: 3.195 ± 0.018
5.349ValVal: 5.349 ± 0.027
0.795ValTrp: 0.795 ± 0.009
1.882ValTyr: 1.882 ± 0.014
0.001ValXaa: 0.001 ± 0.0
Trp
0.795TrpAla: 0.795 ± 0.009
0.252TrpCys: 0.252 ± 0.005
0.688TrpAsp: 0.688 ± 0.009
0.752TrpGlu: 0.752 ± 0.009
0.559TrpPhe: 0.559 ± 0.007
0.756TrpGly: 0.756 ± 0.009
0.294TrpHis: 0.294 ± 0.005
0.708TrpIle: 0.708 ± 0.009
0.925TrpLys: 0.925 ± 0.009
1.291TrpLeu: 1.291 ± 0.01
0.336TrpMet: 0.336 ± 0.006
0.717TrpAsn: 0.717 ± 0.009
0.543TrpPro: 0.543 ± 0.006
0.453TrpGln: 0.453 ± 0.007
0.896TrpArg: 0.896 ± 0.009
1.035TrpSer: 1.035 ± 0.01
0.662TrpThr: 0.662 ± 0.007
0.873TrpVal: 0.873 ± 0.008
0.264TrpTrp: 0.264 ± 0.005
0.353TrpTyr: 0.353 ± 0.006
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.708TyrAla: 1.708 ± 0.011
0.624TyrCys: 0.624 ± 0.008
1.474TyrAsp: 1.474 ± 0.013
1.515TyrGlu: 1.515 ± 0.012
1.303TyrPhe: 1.303 ± 0.012
2.068TyrGly: 2.068 ± 0.016
0.712TyrHis: 0.712 ± 0.008
1.421TyrIle: 1.421 ± 0.013
1.51TyrLys: 1.51 ± 0.014
2.785TyrLeu: 2.785 ± 0.019
0.702TyrMet: 0.702 ± 0.008
1.311TyrAsn: 1.311 ± 0.013
1.238TyrPro: 1.238 ± 0.012
0.921TyrGln: 0.921 ± 0.009
1.441TyrArg: 1.441 ± 0.012
2.326TyrSer: 2.326 ± 0.016
1.235TyrThr: 1.235 ± 0.012
1.776TyrVal: 1.776 ± 0.015
0.414TyrTrp: 0.414 ± 0.007
0.993TyrTyr: 0.993 ± 0.012
0.001TyrXaa: 0.001 ± 0.0
Xaa
0.001XaaAla: 0.001 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.001XaaAsp: 0.001 ± 0.0
0.001XaaGlu: 0.001 ± 0.0
0.001XaaPhe: 0.001 ± 0.0
0.002XaaGly: 0.002 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.002XaaIle: 0.002 ± 0.0
0.001XaaLys: 0.001 ± 0.0
0.002XaaLeu: 0.002 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.002XaaAsn: 0.002 ± 0.0
0.001XaaPro: 0.001 ± 0.0
0.001XaaGln: 0.001 ± 0.0
0.003XaaArg: 0.003 ± 0.001
0.001XaaSer: 0.001 ± 0.0
0.001XaaThr: 0.001 ± 0.0
0.002XaaVal: 0.002 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.001XaaTyr: 0.001 ± 0.0
0.083XaaXaa: 0.083 ± 0.025
Statistics based on 37181 proteins (11013480 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski