Amino acid dipepetide frequency for Arsenophonus nasoniae (son-killer infecting Nasonia vitripennis)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.344AlaAla: 6.344 ± 0.103
1.01AlaCys: 1.01 ± 0.028
3.965AlaAsp: 3.965 ± 0.069
5.025AlaGlu: 5.025 ± 0.076
2.794AlaPhe: 2.794 ± 0.048
4.971AlaGly: 4.971 ± 0.084
1.455AlaHis: 1.455 ± 0.035
6.297AlaIle: 6.297 ± 0.09
5.58AlaLys: 5.58 ± 0.071
8.237AlaLeu: 8.237 ± 0.098
2.168AlaMet: 2.168 ± 0.035
3.923AlaAsn: 3.923 ± 0.077
2.177AlaPro: 2.177 ± 0.045
3.434AlaGln: 3.434 ± 0.07
3.539AlaArg: 3.539 ± 0.062
4.291AlaSer: 4.291 ± 0.06
4.242AlaThr: 4.242 ± 0.088
4.637AlaVal: 4.637 ± 0.072
0.88AlaTrp: 0.88 ± 0.032
2.264AlaTyr: 2.264 ± 0.052
0.0AlaXaa: 0.0 ± 0.0
Cys
0.864CysAla: 0.864 ± 0.029
0.262CysCys: 0.262 ± 0.017
0.662CysAsp: 0.662 ± 0.027
0.562CysGlu: 0.562 ± 0.025
0.549CysPhe: 0.549 ± 0.022
0.925CysGly: 0.925 ± 0.032
0.372CysHis: 0.372 ± 0.018
0.705CysIle: 0.705 ± 0.03
0.559CysLys: 0.559 ± 0.019
1.109CysLeu: 1.109 ± 0.031
0.236CysMet: 0.236 ± 0.013
0.427CysAsn: 0.427 ± 0.018
0.428CysPro: 0.428 ± 0.021
0.759CysGln: 0.759 ± 0.029
0.584CysArg: 0.584 ± 0.024
0.812CysSer: 0.812 ± 0.031
0.431CysThr: 0.431 ± 0.021
0.597CysVal: 0.597 ± 0.023
0.165CysTrp: 0.165 ± 0.012
0.458CysTyr: 0.458 ± 0.02
0.0CysXaa: 0.0 ± 0.0
Asp
3.448AspAla: 3.448 ± 0.055
0.673AspCys: 0.673 ± 0.027
2.646AspAsp: 2.646 ± 0.05
3.442AspGlu: 3.442 ± 0.055
2.402AspPhe: 2.402 ± 0.048
3.158AspGly: 3.158 ± 0.058
0.951AspHis: 0.951 ± 0.03
4.339AspIle: 4.339 ± 0.06
3.672AspLys: 3.672 ± 0.064
4.375AspLeu: 4.375 ± 0.067
1.209AspMet: 1.209 ± 0.034
3.095AspAsn: 3.095 ± 0.06
1.937AspPro: 1.937 ± 0.039
1.53AspGln: 1.53 ± 0.038
2.119AspArg: 2.119 ± 0.045
2.834AspSer: 2.834 ± 0.054
2.413AspThr: 2.413 ± 0.047
2.939AspVal: 2.939 ± 0.048
0.846AspTrp: 0.846 ± 0.028
2.091AspTyr: 2.091 ± 0.045
0.0AspXaa: 0.0 ± 0.0
Glu
4.108GluAla: 4.108 ± 0.073
0.47GluCys: 0.47 ± 0.018
2.043GluAsp: 2.043 ± 0.044
3.206GluGlu: 3.206 ± 0.064
2.063GluPhe: 2.063 ± 0.045
3.004GluGly: 3.004 ± 0.061
1.265GluHis: 1.265 ± 0.035
4.907GluIle: 4.907 ± 0.078
4.979GluLys: 4.979 ± 0.069
5.954GluLeu: 5.954 ± 0.086
1.666GluMet: 1.666 ± 0.036
3.324GluAsn: 3.324 ± 0.059
1.616GluPro: 1.616 ± 0.035
3.466GluGln: 3.466 ± 0.063
3.341GluArg: 3.341 ± 0.066
2.976GluSer: 2.976 ± 0.054
2.772GluThr: 2.772 ± 0.048
3.337GluVal: 3.337 ± 0.057
0.69GluTrp: 0.69 ± 0.024
1.73GluTyr: 1.73 ± 0.039
0.0GluXaa: 0.0 ± 0.0
Phe
3.096PheAla: 3.096 ± 0.047
0.693PheCys: 0.693 ± 0.025
2.562PheAsp: 2.562 ± 0.053
2.05PheGlu: 2.05 ± 0.042
2.161PhePhe: 2.161 ± 0.051
2.628PheGly: 2.628 ± 0.048
0.804PheHis: 0.804 ± 0.024
3.405PheIle: 3.405 ± 0.065
2.346PheLys: 2.346 ± 0.044
3.64PheLeu: 3.64 ± 0.061
1.025PheMet: 1.025 ± 0.027
2.484PheAsn: 2.484 ± 0.043
1.423PhePro: 1.423 ± 0.036
1.237PheGln: 1.237 ± 0.033
1.648PheArg: 1.648 ± 0.042
3.405PheSer: 3.405 ± 0.054
2.494PheThr: 2.494 ± 0.047
2.212PheVal: 2.212 ± 0.045
0.523PheTrp: 0.523 ± 0.023
1.577PheTyr: 1.577 ± 0.036
0.0PheXaa: 0.0 ± 0.0
Gly
4.275GlyAla: 4.275 ± 0.072
0.823GlyCys: 0.823 ± 0.027
2.956GlyAsp: 2.956 ± 0.054
3.6GlyGlu: 3.6 ± 0.057
3.0GlyPhe: 3.0 ± 0.063
4.09GlyGly: 4.09 ± 0.076
1.337GlyHis: 1.337 ± 0.034
5.118GlyIle: 5.118 ± 0.068
4.598GlyLys: 4.598 ± 0.089
6.078GlyLeu: 6.078 ± 0.073
1.67GlyMet: 1.67 ± 0.039
3.0GlyAsn: 3.0 ± 0.07
1.189GlyPro: 1.189 ± 0.037
2.676GlyGln: 2.676 ± 0.057
2.921GlyArg: 2.921 ± 0.058
3.464GlySer: 3.464 ± 0.06
2.878GlyThr: 2.878 ± 0.058
3.945GlyVal: 3.945 ± 0.065
0.904GlyTrp: 0.904 ± 0.03
2.497GlyTyr: 2.497 ± 0.047
0.0GlyXaa: 0.0 ± 0.0
His
1.463HisAla: 1.463 ± 0.034
0.365HisCys: 0.365 ± 0.017
1.022HisAsp: 1.022 ± 0.03
1.068HisGlu: 1.068 ± 0.028
1.145HisPhe: 1.145 ± 0.032
1.308HisGly: 1.308 ± 0.039
0.864HisHis: 0.864 ± 0.025
1.779HisIle: 1.779 ± 0.047
1.076HisLys: 1.076 ± 0.027
2.348HisLeu: 2.348 ± 0.05
0.431HisMet: 0.431 ± 0.02
1.197HisAsn: 1.197 ± 0.033
1.033HisPro: 1.033 ± 0.031
1.264HisGln: 1.264 ± 0.036
1.123HisArg: 1.123 ± 0.033
1.432HisSer: 1.432 ± 0.038
1.063HisThr: 1.063 ± 0.03
1.249HisVal: 1.249 ± 0.033
0.378HisTrp: 0.378 ± 0.018
1.128HisTyr: 1.128 ± 0.032
0.0HisXaa: 0.0 ± 0.0
Ile
6.834IleAla: 6.834 ± 0.085
0.881IleCys: 0.881 ± 0.031
4.71IleAsp: 4.71 ± 0.064
5.009IleGlu: 5.009 ± 0.069
3.003IlePhe: 3.003 ± 0.061
4.978IleGly: 4.978 ± 0.064
1.555IleHis: 1.555 ± 0.037
5.715IleIle: 5.715 ± 0.091
4.896IleLys: 4.896 ± 0.069
6.554IleLeu: 6.554 ± 0.084
1.637IleMet: 1.637 ± 0.032
4.802IleAsn: 4.802 ± 0.076
3.19IlePro: 3.19 ± 0.057
2.729IleGln: 2.729 ± 0.046
3.46IleArg: 3.46 ± 0.061
5.289IleSer: 5.289 ± 0.07
4.777IleThr: 4.777 ± 0.074
4.081IleVal: 4.081 ± 0.059
0.8IleTrp: 0.8 ± 0.025
2.512IleTyr: 2.512 ± 0.055
0.0IleXaa: 0.0 ± 0.0
Lys
4.939LysAla: 4.939 ± 0.097
0.466LysCys: 0.466 ± 0.021
3.004LysAsp: 3.004 ± 0.053
3.832LysGlu: 3.832 ± 0.075
1.944LysPhe: 1.944 ± 0.043
3.63LysGly: 3.63 ± 0.056
1.429LysHis: 1.429 ± 0.036
4.992LysIle: 4.992 ± 0.073
5.21LysLys: 5.21 ± 0.095
6.581LysLeu: 6.581 ± 0.089
1.766LysMet: 1.766 ± 0.038
3.82LysAsn: 3.82 ± 0.076
2.78LysPro: 2.78 ± 0.058
3.543LysGln: 3.543 ± 0.059
3.409LysArg: 3.409 ± 0.055
3.852LysSer: 3.852 ± 0.063
3.852LysThr: 3.852 ± 0.076
3.7LysVal: 3.7 ± 0.059
0.643LysTrp: 0.643 ± 0.024
2.012LysTyr: 2.012 ± 0.042
0.0LysXaa: 0.0 ± 0.0
Leu
8.849LeuAla: 8.849 ± 0.106
1.268LeuCys: 1.268 ± 0.034
4.951LeuAsp: 4.951 ± 0.068
5.047LeuGlu: 5.047 ± 0.074
4.476LeuPhe: 4.476 ± 0.067
5.792LeuGly: 5.792 ± 0.087
2.059LeuHis: 2.059 ± 0.041
7.496LeuIle: 7.496 ± 0.096
6.401LeuLys: 6.401 ± 0.077
10.721LeuLeu: 10.721 ± 0.145
2.571LeuMet: 2.571 ± 0.046
5.418LeuAsn: 5.418 ± 0.078
5.301LeuPro: 5.301 ± 0.11
4.166LeuGln: 4.166 ± 0.062
4.85LeuArg: 4.85 ± 0.072
7.675LeuSer: 7.675 ± 0.1
6.489LeuThr: 6.489 ± 0.097
5.447LeuVal: 5.447 ± 0.072
1.038LeuTrp: 1.038 ± 0.032
2.731LeuTyr: 2.731 ± 0.054
0.0LeuXaa: 0.0 ± 0.0
Met
2.298MetAla: 2.298 ± 0.042
0.201MetCys: 0.201 ± 0.012
0.924MetAsp: 0.924 ± 0.026
1.125MetGlu: 1.125 ± 0.031
0.704MetPhe: 0.704 ± 0.026
1.512MetGly: 1.512 ± 0.038
0.379MetHis: 0.379 ± 0.018
1.81MetIle: 1.81 ± 0.039
1.729MetLys: 1.729 ± 0.041
2.834MetLeu: 2.834 ± 0.05
0.725MetMet: 0.725 ± 0.025
1.164MetAsn: 1.164 ± 0.034
1.18MetPro: 1.18 ± 0.03
1.206MetGln: 1.206 ± 0.033
1.204MetArg: 1.204 ± 0.032
1.672MetSer: 1.672 ± 0.039
1.561MetThr: 1.561 ± 0.037
1.49MetVal: 1.49 ± 0.031
0.228MetTrp: 0.228 ± 0.015
0.52MetTyr: 0.52 ± 0.021
0.0MetXaa: 0.0 ± 0.0
Asn
3.735AsnAla: 3.735 ± 0.06
0.511AsnCys: 0.511 ± 0.018
2.712AsnAsp: 2.712 ± 0.053
2.903AsnGlu: 2.903 ± 0.048
1.939AsnPhe: 1.939 ± 0.045
3.337AsnGly: 3.337 ± 0.071
1.244AsnHis: 1.244 ± 0.031
4.389AsnIle: 4.389 ± 0.072
3.647AsnLys: 3.647 ± 0.066
4.768AsnLeu: 4.768 ± 0.074
1.15AsnMet: 1.15 ± 0.031
3.662AsnAsn: 3.662 ± 0.096
2.109AsnPro: 2.109 ± 0.044
2.803AsnGln: 2.803 ± 0.062
2.493AsnArg: 2.493 ± 0.049
3.267AsnSer: 3.267 ± 0.057
2.694AsnThr: 2.694 ± 0.052
2.71AsnVal: 2.71 ± 0.049
0.799AsnTrp: 0.799 ± 0.025
2.018AsnTyr: 2.018 ± 0.046
0.0AsnXaa: 0.0 ± 0.0
Pro
3.049ProAla: 3.049 ± 0.056
0.356ProCys: 0.356 ± 0.019
2.449ProAsp: 2.449 ± 0.048
2.674ProGlu: 2.674 ± 0.046
1.742ProPhe: 1.742 ± 0.037
1.97ProGly: 1.97 ± 0.05
0.934ProHis: 0.934 ± 0.027
2.871ProIle: 2.871 ± 0.055
2.239ProLys: 2.239 ± 0.049
3.881ProLeu: 3.881 ± 0.061
0.827ProMet: 0.827 ± 0.027
1.913ProAsn: 1.913 ± 0.042
1.373ProPro: 1.373 ± 0.037
1.75ProGln: 1.75 ± 0.042
1.381ProArg: 1.381 ± 0.035
2.256ProSer: 2.256 ± 0.04
2.275ProThr: 2.275 ± 0.055
2.653ProVal: 2.653 ± 0.049
0.503ProTrp: 0.503 ± 0.024
1.422ProTyr: 1.422 ± 0.034
0.0ProXaa: 0.0 ± 0.0
Gln
3.744GlnAla: 3.744 ± 0.069
0.423GlnCys: 0.423 ± 0.019
1.781GlnAsp: 1.781 ± 0.038
2.322GlnGlu: 2.322 ± 0.055
1.95GlnPhe: 1.95 ± 0.041
2.698GlnGly: 2.698 ± 0.049
1.39GlnHis: 1.39 ± 0.033
3.464GlnIle: 3.464 ± 0.055
2.854GlnLys: 2.854 ± 0.052
5.689GlnLeu: 5.689 ± 0.09
1.123GlnMet: 1.123 ± 0.031
1.984GlnAsn: 1.984 ± 0.046
1.901GlnPro: 1.901 ± 0.053
4.062GlnGln: 4.062 ± 0.07
2.627GlnArg: 2.627 ± 0.052
2.77GlnSer: 2.77 ± 0.049
2.419GlnThr: 2.419 ± 0.049
2.596GlnVal: 2.596 ± 0.051
0.646GlnTrp: 0.646 ± 0.025
1.669GlnTyr: 1.669 ± 0.042
0.0GlnXaa: 0.0 ± 0.0
Arg
3.304ArgAla: 3.304 ± 0.059
0.504ArgCys: 0.504 ± 0.024
2.333ArgAsp: 2.333 ± 0.047
2.797ArgGlu: 2.797 ± 0.057
2.428ArgPhe: 2.428 ± 0.049
2.583ArgGly: 2.583 ± 0.05
1.342ArgHis: 1.342 ± 0.036
3.554ArgIle: 3.554 ± 0.061
3.171ArgLys: 3.171 ± 0.051
5.509ArgLeu: 5.509 ± 0.077
1.172ArgMet: 1.172 ± 0.028
2.211ArgAsn: 2.211 ± 0.043
1.723ArgPro: 1.723 ± 0.036
2.835ArgGln: 2.835 ± 0.049
2.874ArgArg: 2.874 ± 0.061
2.405ArgSer: 2.405 ± 0.042
2.079ArgThr: 2.079 ± 0.045
2.581ArgVal: 2.581 ± 0.053
0.738ArgTrp: 0.738 ± 0.03
2.258ArgTyr: 2.258 ± 0.052
0.0ArgXaa: 0.0 ± 0.0
Ser
4.751SerAla: 4.751 ± 0.073
0.646SerCys: 0.646 ± 0.026
3.059SerAsp: 3.059 ± 0.049
3.443SerGlu: 3.443 ± 0.055
2.678SerPhe: 2.678 ± 0.052
4.304SerGly: 4.304 ± 0.07
1.625SerHis: 1.625 ± 0.037
4.49SerIle: 4.49 ± 0.069
3.381SerLys: 3.381 ± 0.06
6.984SerLeu: 6.984 ± 0.089
1.467SerMet: 1.467 ± 0.034
2.958SerAsn: 2.958 ± 0.063
2.533SerPro: 2.533 ± 0.053
2.996SerGln: 2.996 ± 0.06
2.897SerArg: 2.897 ± 0.05
3.926SerSer: 3.926 ± 0.075
3.14SerThr: 3.14 ± 0.052
4.062SerVal: 4.062 ± 0.06
0.854SerTrp: 0.854 ± 0.028
2.104SerTyr: 2.104 ± 0.041
0.0SerXaa: 0.0 ± 0.0
Thr
4.436ThrAla: 4.436 ± 0.1
0.505ThrCys: 0.505 ± 0.022
2.855ThrAsp: 2.855 ± 0.065
3.371ThrGlu: 3.371 ± 0.054
2.133ThrPhe: 2.133 ± 0.049
3.762ThrGly: 3.762 ± 0.059
1.193ThrHis: 1.193 ± 0.034
4.145ThrIle: 4.145 ± 0.074
3.017ThrLys: 3.017 ± 0.053
6.462ThrLeu: 6.462 ± 0.076
1.089ThrMet: 1.089 ± 0.03
2.393ThrAsn: 2.393 ± 0.054
2.532ThrPro: 2.532 ± 0.052
2.418ThrGln: 2.418 ± 0.054
2.275ThrArg: 2.275 ± 0.046
3.071ThrSer: 3.071 ± 0.057
3.158ThrThr: 3.158 ± 0.066
3.396ThrVal: 3.396 ± 0.076
0.554ThrTrp: 0.554 ± 0.022
1.688ThrTyr: 1.688 ± 0.038
0.0ThrXaa: 0.0 ± 0.0
Val
4.532ValAla: 4.532 ± 0.076
0.68ValCys: 0.68 ± 0.026
3.155ValAsp: 3.155 ± 0.055
3.456ValGlu: 3.456 ± 0.055
2.142ValPhe: 2.142 ± 0.049
3.436ValGly: 3.436 ± 0.063
1.101ValHis: 1.101 ± 0.035
4.853ValIle: 4.853 ± 0.071
3.917ValLys: 3.917 ± 0.07
5.134ValLeu: 5.134 ± 0.074
1.631ValMet: 1.631 ± 0.04
3.065ValAsn: 3.065 ± 0.052
2.128ValPro: 2.128 ± 0.046
1.767ValGln: 1.767 ± 0.038
2.748ValArg: 2.748 ± 0.051
3.998ValSer: 3.998 ± 0.076
3.718ValThr: 3.718 ± 0.096
3.792ValVal: 3.792 ± 0.069
0.718ValTrp: 0.718 ± 0.024
1.717ValTyr: 1.717 ± 0.033
0.0ValXaa: 0.0 ± 0.0
Trp
0.724TrpAla: 0.724 ± 0.024
0.193TrpCys: 0.193 ± 0.012
0.513TrpAsp: 0.513 ± 0.02
0.456TrpGlu: 0.456 ± 0.022
0.527TrpPhe: 0.527 ± 0.021
0.661TrpGly: 0.661 ± 0.023
0.428TrpHis: 0.428 ± 0.021
0.85TrpIle: 0.85 ± 0.023
0.597TrpLys: 0.597 ± 0.021
1.932TrpLeu: 1.932 ± 0.047
0.3TrpMet: 0.3 ± 0.016
0.533TrpAsn: 0.533 ± 0.021
0.549TrpPro: 0.549 ± 0.022
1.197TrpGln: 1.197 ± 0.034
0.844TrpArg: 0.844 ± 0.024
0.689TrpSer: 0.689 ± 0.023
0.405TrpThr: 0.405 ± 0.016
0.653TrpVal: 0.653 ± 0.025
0.155TrpTrp: 0.155 ± 0.011
0.36TrpTyr: 0.36 ± 0.018
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.441TyrAla: 2.441 ± 0.05
0.538TyrCys: 0.538 ± 0.019
1.818TyrAsp: 1.818 ± 0.046
1.532TyrGlu: 1.532 ± 0.032
1.631TyrPhe: 1.631 ± 0.041
2.271TyrGly: 2.271 ± 0.043
1.009TyrHis: 1.009 ± 0.03
2.233TyrIle: 2.233 ± 0.047
1.534TyrLys: 1.534 ± 0.042
3.955TyrLeu: 3.955 ± 0.067
0.602TyrMet: 0.602 ± 0.021
1.516TyrAsn: 1.516 ± 0.039
1.415TyrPro: 1.415 ± 0.035
2.273TyrGln: 2.273 ± 0.048
2.063TyrArg: 2.063 ± 0.042
2.213TyrSer: 2.213 ± 0.047
1.594TyrThr: 1.594 ± 0.038
1.612TyrVal: 1.612 ± 0.035
0.504TyrTrp: 0.504 ± 0.022
1.387TyrTyr: 1.387 ± 0.039
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4741 proteins (1228501 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski