Amino acid dipepetide frequency for Angelica bushy stunt virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.016AlaAla: 4.016 ± 1.127
0.0AlaCys: 0.0 ± 0.0
1.205AlaAsp: 1.205 ± 0.729
4.418AlaGlu: 4.418 ± 0.899
4.016AlaPhe: 4.016 ± 0.893
1.205AlaGly: 1.205 ± 0.567
0.803AlaHis: 0.803 ± 0.423
2.008AlaIle: 2.008 ± 1.099
4.819AlaLys: 4.819 ± 0.962
5.221AlaLeu: 5.221 ± 1.766
0.803AlaMet: 0.803 ± 0.478
2.41AlaAsn: 2.41 ± 0.68
2.41AlaPro: 2.41 ± 0.742
0.402AlaGln: 0.402 ± 0.281
0.803AlaArg: 0.803 ± 0.711
6.024AlaSer: 6.024 ± 1.93
0.803AlaThr: 0.803 ± 0.711
2.008AlaVal: 2.008 ± 1.137
0.0AlaTrp: 0.0 ± 0.0
1.606AlaTyr: 1.606 ± 0.976
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.803CysCys: 0.803 ± 0.371
0.402CysAsp: 0.402 ± 0.345
2.008CysGlu: 2.008 ± 0.836
0.402CysPhe: 0.402 ± 0.501
0.0CysGly: 0.0 ± 0.0
0.402CysHis: 0.402 ± 0.501
0.402CysIle: 0.402 ± 0.281
2.008CysLys: 2.008 ± 0.854
1.606CysLeu: 1.606 ± 0.513
0.0CysMet: 0.0 ± 0.0
1.606CysAsn: 1.606 ± 0.964
1.606CysPro: 1.606 ± 0.79
0.402CysGln: 0.402 ± 0.356
0.803CysArg: 0.803 ± 0.371
1.205CysSer: 1.205 ± 0.337
0.0CysThr: 0.0 ± 0.0
1.606CysVal: 1.606 ± 0.249
0.402CysTrp: 0.402 ± 0.345
0.402CysTyr: 0.402 ± 0.281
0.0CysXaa: 0.0 ± 0.0
Asp
1.606AspAla: 1.606 ± 0.742
0.803AspCys: 0.803 ± 0.349
2.811AspAsp: 2.811 ± 0.523
3.213AspGlu: 3.213 ± 1.311
3.213AspPhe: 3.213 ± 0.513
2.008AspGly: 2.008 ± 0.836
1.606AspHis: 1.606 ± 0.536
2.811AspIle: 2.811 ± 0.935
5.221AspLys: 5.221 ± 0.835
5.221AspLeu: 5.221 ± 1.004
2.41AspMet: 2.41 ± 0.864
4.016AspAsn: 4.016 ± 1.044
4.016AspPro: 4.016 ± 1.145
2.008AspGln: 2.008 ± 0.564
1.205AspArg: 1.205 ± 0.688
4.016AspSer: 4.016 ± 1.28
2.008AspThr: 2.008 ± 0.916
1.205AspVal: 1.205 ± 0.398
0.0AspTrp: 0.0 ± 0.0
3.614AspTyr: 3.614 ± 1.011
0.0AspXaa: 0.0 ± 0.0
Glu
3.213GluAla: 3.213 ± 1.483
0.803GluCys: 0.803 ± 0.69
6.426GluAsp: 6.426 ± 1.507
9.639GluGlu: 9.639 ± 2.025
3.614GluPhe: 3.614 ± 1.011
2.008GluGly: 2.008 ± 0.464
2.008GluHis: 2.008 ± 0.755
6.827GluIle: 6.827 ± 1.78
11.647GluLys: 11.647 ± 1.244
5.221GluLeu: 5.221 ± 1.316
2.008GluMet: 2.008 ± 1.18
5.622GluAsn: 5.622 ± 1.854
2.008GluPro: 2.008 ± 1.208
4.819GluGln: 4.819 ± 0.951
4.016GluArg: 4.016 ± 1.098
6.024GluSer: 6.024 ± 2.285
3.213GluThr: 3.213 ± 1.346
1.205GluVal: 1.205 ± 1.036
0.0GluTrp: 0.0 ± 0.0
1.606GluTyr: 1.606 ± 0.536
0.0GluXaa: 0.0 ± 0.0
Phe
2.008PheAla: 2.008 ± 0.638
2.811PheCys: 2.811 ± 0.977
0.402PheAsp: 0.402 ± 0.501
3.213PheGlu: 3.213 ± 1.672
0.803PhePhe: 0.803 ± 0.371
4.016PheGly: 4.016 ± 1.045
1.606PheHis: 1.606 ± 0.617
4.418PheIle: 4.418 ± 1.585
3.614PheLys: 3.614 ± 0.758
4.016PheLeu: 4.016 ± 1.027
1.205PheMet: 1.205 ± 0.52
2.41PheAsn: 2.41 ± 1.013
2.811PhePro: 2.811 ± 1.452
1.205PheGln: 1.205 ± 0.575
2.811PheArg: 2.811 ± 0.964
6.024PheSer: 6.024 ± 1.252
3.614PheThr: 3.614 ± 1.184
0.803PheVal: 0.803 ± 0.725
1.205PheTrp: 1.205 ± 0.623
1.205PheTyr: 1.205 ± 0.337
0.0PheXaa: 0.0 ± 0.0
Gly
2.41GlyAla: 2.41 ± 0.851
0.402GlyCys: 0.402 ± 0.345
1.606GlyAsp: 1.606 ± 0.699
2.41GlyGlu: 2.41 ± 0.971
3.213GlyPhe: 3.213 ± 0.603
0.803GlyGly: 0.803 ± 0.423
2.008GlyHis: 2.008 ± 0.682
4.418GlyIle: 4.418 ± 0.539
3.614GlyLys: 3.614 ± 1.091
4.418GlyLeu: 4.418 ± 0.974
0.803GlyMet: 0.803 ± 0.423
2.41GlyAsn: 2.41 ± 0.998
1.205GlyPro: 1.205 ± 0.337
0.803GlyGln: 0.803 ± 0.69
2.41GlyArg: 2.41 ± 0.957
2.811GlySer: 2.811 ± 0.618
3.614GlyThr: 3.614 ± 1.333
1.205GlyVal: 1.205 ± 0.687
0.0GlyTrp: 0.0 ± 0.0
0.402GlyTyr: 0.402 ± 0.281
0.0GlyXaa: 0.0 ± 0.0
His
0.402HisAla: 0.402 ± 0.501
1.205HisCys: 1.205 ± 0.495
0.803HisAsp: 0.803 ± 0.557
1.205HisGlu: 1.205 ± 0.687
1.606HisPhe: 1.606 ± 0.51
0.402HisGly: 0.402 ± 0.362
0.803HisHis: 0.803 ± 0.349
2.008HisIle: 2.008 ± 1.021
2.008HisLys: 2.008 ± 0.915
0.803HisLeu: 0.803 ± 0.562
1.205HisMet: 1.205 ± 0.56
0.803HisAsn: 0.803 ± 0.371
1.205HisPro: 1.205 ± 0.493
3.614HisGln: 3.614 ± 0.751
1.205HisArg: 1.205 ± 0.575
1.606HisSer: 1.606 ± 0.639
0.402HisThr: 0.402 ± 0.281
1.606HisVal: 1.606 ± 0.904
0.803HisTrp: 0.803 ± 0.562
1.606HisTyr: 1.606 ± 0.513
0.0HisXaa: 0.0 ± 0.0
Ile
3.213IleAla: 3.213 ± 0.824
1.606IleCys: 1.606 ± 0.742
7.631IleAsp: 7.631 ± 2.096
6.426IleGlu: 6.426 ± 1.16
3.213IlePhe: 3.213 ± 0.991
4.418IleGly: 4.418 ± 1.235
0.402IleHis: 0.402 ± 0.356
5.622IleIle: 5.622 ± 1.026
5.622IleLys: 5.622 ± 1.034
8.032IleLeu: 8.032 ± 1.058
0.402IleMet: 0.402 ± 0.745
7.229IleAsn: 7.229 ± 1.688
3.213IlePro: 3.213 ± 0.68
2.41IleGln: 2.41 ± 1.052
2.811IleArg: 2.811 ± 0.837
4.819IleSer: 4.819 ± 1.724
3.213IleThr: 3.213 ± 1.164
3.213IleVal: 3.213 ± 1.62
0.0IleTrp: 0.0 ± 0.0
2.811IleTyr: 2.811 ± 0.784
0.0IleXaa: 0.0 ± 0.0
Lys
5.221LysAla: 5.221 ± 1.555
1.606LysCys: 1.606 ± 0.526
4.819LysAsp: 4.819 ± 1.191
8.835LysGlu: 8.835 ± 2.242
6.024LysPhe: 6.024 ± 1.217
3.213LysGly: 3.213 ± 1.136
3.614LysHis: 3.614 ± 0.957
8.032LysIle: 8.032 ± 1.836
16.466LysLys: 16.466 ± 3.043
7.631LysLeu: 7.631 ± 1.872
1.205LysMet: 1.205 ± 0.618
4.016LysAsn: 4.016 ± 0.934
3.614LysPro: 3.614 ± 0.791
7.631LysGln: 7.631 ± 2.001
4.016LysArg: 4.016 ± 1.287
3.213LysSer: 3.213 ± 0.883
5.221LysThr: 5.221 ± 1.238
6.024LysVal: 6.024 ± 1.361
0.0LysTrp: 0.0 ± 0.0
2.008LysTyr: 2.008 ± 1.013
0.0LysXaa: 0.0 ± 0.0
Leu
4.819LeuAla: 4.819 ± 0.798
0.402LeuCys: 0.402 ± 0.345
6.024LeuAsp: 6.024 ± 0.869
11.245LeuGlu: 11.245 ± 1.886
2.811LeuPhe: 2.811 ± 1.035
4.016LeuGly: 4.016 ± 0.954
2.008LeuHis: 2.008 ± 0.515
6.426LeuIle: 6.426 ± 0.693
8.434LeuLys: 8.434 ± 0.807
10.442LeuLeu: 10.442 ± 1.846
1.205LeuMet: 1.205 ± 0.664
6.426LeuAsn: 6.426 ± 1.423
2.41LeuPro: 2.41 ± 0.585
4.819LeuGln: 4.819 ± 1.183
2.811LeuArg: 2.811 ± 1.063
4.819LeuSer: 4.819 ± 1.439
3.614LeuThr: 3.614 ± 1.452
4.819LeuVal: 4.819 ± 0.887
1.205LeuTrp: 1.205 ± 0.489
1.205LeuTyr: 1.205 ± 0.603
0.0LeuXaa: 0.0 ± 0.0
Met
1.205MetAla: 1.205 ± 0.689
0.0MetCys: 0.0 ± 0.0
1.205MetAsp: 1.205 ± 0.575
2.41MetGlu: 2.41 ± 1.373
1.205MetPhe: 1.205 ± 0.41
0.402MetGly: 0.402 ± 0.356
0.0MetHis: 0.0 ± 0.0
1.205MetIle: 1.205 ± 0.489
1.606MetLys: 1.606 ± 0.609
1.606MetLeu: 1.606 ± 1.128
0.0MetMet: 0.0 ± 0.0
2.41MetAsn: 2.41 ± 0.787
0.0MetPro: 0.0 ± 0.0
1.606MetGln: 1.606 ± 0.536
0.0MetArg: 0.0 ± 0.0
2.008MetSer: 2.008 ± 0.738
1.606MetThr: 1.606 ± 0.51
0.803MetVal: 0.803 ± 0.349
0.0MetTrp: 0.0 ± 0.0
0.402MetTyr: 0.402 ± 0.345
0.0MetXaa: 0.0 ± 0.0
Asn
2.811AsnAla: 2.811 ± 1.09
0.402AsnCys: 0.402 ± 0.345
2.41AsnAsp: 2.41 ± 1.095
5.221AsnGlu: 5.221 ± 1.416
3.614AsnPhe: 3.614 ± 1.014
2.41AsnGly: 2.41 ± 1.157
1.606AsnHis: 1.606 ± 0.688
6.426AsnIle: 6.426 ± 1.442
6.426AsnLys: 6.426 ± 1.518
9.237AsnLeu: 9.237 ± 2.712
1.606AsnMet: 1.606 ± 0.576
5.221AsnAsn: 5.221 ± 1.617
2.41AsnPro: 2.41 ± 0.538
3.213AsnGln: 3.213 ± 1.349
2.41AsnArg: 2.41 ± 1.659
3.614AsnSer: 3.614 ± 1.398
2.41AsnThr: 2.41 ± 0.741
3.614AsnVal: 3.614 ± 1.001
0.803AsnTrp: 0.803 ± 0.349
1.606AsnTyr: 1.606 ± 0.586
0.0AsnXaa: 0.0 ± 0.0
Pro
1.205ProAla: 1.205 ± 0.575
1.606ProCys: 1.606 ± 0.818
2.008ProAsp: 2.008 ± 0.755
2.41ProGlu: 2.41 ± 0.741
2.008ProPhe: 2.008 ± 0.816
1.606ProGly: 1.606 ± 0.536
0.803ProHis: 0.803 ± 0.737
2.811ProIle: 2.811 ± 0.737
2.811ProLys: 2.811 ± 0.935
4.418ProLeu: 4.418 ± 1.641
1.205ProMet: 1.205 ± 0.489
2.41ProAsn: 2.41 ± 0.66
0.803ProPro: 0.803 ± 0.498
2.008ProGln: 2.008 ± 0.764
2.811ProArg: 2.811 ± 0.802
4.418ProSer: 4.418 ± 0.685
1.205ProThr: 1.205 ± 0.493
3.213ProVal: 3.213 ± 0.658
0.0ProTrp: 0.0 ± 0.0
1.205ProTyr: 1.205 ± 1.067
0.0ProXaa: 0.0 ± 0.0
Gln
1.606GlnAla: 1.606 ± 0.554
0.0GlnCys: 0.0 ± 0.0
4.016GlnAsp: 4.016 ± 1.258
4.016GlnGlu: 4.016 ± 1.422
2.811GlnPhe: 2.811 ± 0.934
2.811GlnGly: 2.811 ± 1.218
0.803GlnHis: 0.803 ± 0.501
4.016GlnIle: 4.016 ± 0.678
4.819GlnLys: 4.819 ± 1.428
3.213GlnLeu: 3.213 ± 0.845
0.803GlnMet: 0.803 ± 0.584
3.614GlnAsn: 3.614 ± 0.932
2.41GlnPro: 2.41 ± 0.959
2.41GlnGln: 2.41 ± 0.493
1.606GlnArg: 1.606 ± 0.513
2.008GlnSer: 2.008 ± 1.187
1.606GlnThr: 1.606 ± 0.617
2.41GlnVal: 2.41 ± 0.864
0.803GlnTrp: 0.803 ± 0.562
1.205GlnTyr: 1.205 ± 0.575
0.0GlnXaa: 0.0 ± 0.0
Arg
2.41ArgAla: 2.41 ± 1.098
0.803ArgCys: 0.803 ± 0.557
1.606ArgAsp: 1.606 ± 0.577
2.41ArgGlu: 2.41 ± 1.142
2.008ArgPhe: 2.008 ± 0.472
1.606ArgGly: 1.606 ± 0.866
1.205ArgHis: 1.205 ± 0.337
4.418ArgIle: 4.418 ± 1.396
4.016ArgLys: 4.016 ± 0.548
3.614ArgLeu: 3.614 ± 1.924
1.606ArgMet: 1.606 ± 0.586
2.41ArgAsn: 2.41 ± 1.36
2.008ArgPro: 2.008 ± 0.613
1.606ArgGln: 1.606 ± 0.614
3.614ArgArg: 3.614 ± 1.21
0.402ArgSer: 0.402 ± 0.362
2.41ArgThr: 2.41 ± 0.741
2.008ArgVal: 2.008 ± 0.607
0.402ArgTrp: 0.402 ± 0.281
1.205ArgTyr: 1.205 ± 0.56
0.0ArgXaa: 0.0 ± 0.0
Ser
2.008SerAla: 2.008 ± 0.854
0.402SerCys: 0.402 ± 0.356
3.213SerAsp: 3.213 ± 0.809
5.221SerGlu: 5.221 ± 1.737
4.016SerPhe: 4.016 ± 0.784
3.213SerGly: 3.213 ± 0.689
1.606SerHis: 1.606 ± 0.601
6.426SerIle: 6.426 ± 0.907
9.237SerLys: 9.237 ± 1.394
4.016SerLeu: 4.016 ± 0.85
0.803SerMet: 0.803 ± 0.455
3.213SerAsn: 3.213 ± 0.775
3.213SerPro: 3.213 ± 1.171
3.213SerGln: 3.213 ± 0.954
3.213SerArg: 3.213 ± 1.723
6.426SerSer: 6.426 ± 1.862
2.811SerThr: 2.811 ± 0.924
1.205SerVal: 1.205 ± 1.141
1.205SerTrp: 1.205 ± 0.337
3.614SerTyr: 3.614 ± 0.992
0.0SerXaa: 0.0 ± 0.0
Thr
2.008ThrAla: 2.008 ± 1.132
0.803ThrCys: 0.803 ± 0.371
2.008ThrAsp: 2.008 ± 0.755
4.016ThrGlu: 4.016 ± 1.19
0.402ThrPhe: 0.402 ± 0.345
3.213ThrGly: 3.213 ± 1.02
1.205ThrHis: 1.205 ± 0.52
2.41ThrIle: 2.41 ± 0.929
3.614ThrLys: 3.614 ± 1.107
2.811ThrLeu: 2.811 ± 1.107
0.0ThrMet: 0.0 ± 0.0
6.024ThrAsn: 6.024 ± 1.162
1.606ThrPro: 1.606 ± 0.586
0.803ThrGln: 0.803 ± 0.35
1.606ThrArg: 1.606 ± 0.506
3.213ThrSer: 3.213 ± 1.186
2.008ThrThr: 2.008 ± 0.642
4.819ThrVal: 4.819 ± 0.631
0.0ThrTrp: 0.0 ± 0.0
2.008ThrTyr: 2.008 ± 0.795
0.0ThrXaa: 0.0 ± 0.0
Val
2.811ValAla: 2.811 ± 0.729
1.205ValCys: 1.205 ± 0.843
2.811ValAsp: 2.811 ± 0.827
2.008ValGlu: 2.008 ± 0.613
3.614ValPhe: 3.614 ± 0.805
2.41ValGly: 2.41 ± 0.669
1.205ValHis: 1.205 ± 0.654
2.41ValIle: 2.41 ± 0.974
3.614ValLys: 3.614 ± 1.497
4.819ValLeu: 4.819 ± 1.402
1.205ValMet: 1.205 ± 0.41
3.213ValAsn: 3.213 ± 0.529
1.205ValPro: 1.205 ± 0.562
1.606ValGln: 1.606 ± 1.423
2.41ValArg: 2.41 ± 0.677
2.41ValSer: 2.41 ± 0.945
2.008ValThr: 2.008 ± 0.915
1.606ValVal: 1.606 ± 0.249
0.803ValTrp: 0.803 ± 0.498
4.418ValTyr: 4.418 ± 1.264
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.402TrpGlu: 0.402 ± 0.356
0.402TrpPhe: 0.402 ± 0.281
0.402TrpGly: 0.402 ± 0.281
0.0TrpHis: 0.0 ± 0.0
1.205TrpIle: 1.205 ± 0.489
0.402TrpLys: 0.402 ± 0.281
0.402TrpLeu: 0.402 ± 0.281
0.402TrpMet: 0.402 ± 0.281
0.803TrpAsn: 0.803 ± 0.423
0.402TrpPro: 0.402 ± 0.539
0.803TrpGln: 0.803 ± 0.562
0.402TrpArg: 0.402 ± 0.281
0.402TrpSer: 0.402 ± 0.345
1.205TrpThr: 1.205 ± 0.41
0.402TrpVal: 0.402 ± 0.281
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.008TyrAla: 2.008 ± 0.351
0.402TyrCys: 0.402 ± 0.345
1.606TyrAsp: 1.606 ± 0.699
1.205TyrGlu: 1.205 ± 0.729
1.205TyrPhe: 1.205 ± 0.44
0.803TyrGly: 0.803 ± 0.371
1.606TyrHis: 1.606 ± 0.51
2.41TyrIle: 2.41 ± 0.564
2.41TyrLys: 2.41 ± 0.422
3.213TyrLeu: 3.213 ± 0.497
0.402TyrMet: 0.402 ± 0.345
1.205TyrAsn: 1.205 ± 0.946
2.41TyrPro: 2.41 ± 0.796
1.606TyrGln: 1.606 ± 0.506
0.803TyrArg: 0.803 ± 0.488
2.811TyrSer: 2.811 ± 1.219
1.606TyrThr: 1.606 ± 0.942
4.016TyrVal: 4.016 ± 0.915
0.402TyrTrp: 0.402 ± 0.281
1.205TyrTyr: 1.205 ± 0.398
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2491 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski