Amino acid dipepetide frequency for Raspberry vein chlorosis virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.702AlaAla: 2.702 ± 2.453
0.983AlaCys: 0.983 ± 0.233
3.685AlaAsp: 3.685 ± 0.551
2.948AlaGlu: 2.948 ± 0.559
0.737AlaPhe: 0.737 ± 0.745
4.667AlaGly: 4.667 ± 1.168
0.246AlaHis: 0.246 ± 0.314
4.913AlaIle: 4.913 ± 2.434
2.456AlaLys: 2.456 ± 1.282
5.895AlaLeu: 5.895 ± 1.147
2.456AlaMet: 2.456 ± 1.135
1.474AlaAsn: 1.474 ± 0.657
1.474AlaPro: 1.474 ± 0.83
1.965AlaGln: 1.965 ± 0.716
2.456AlaArg: 2.456 ± 1.745
5.158AlaSer: 5.158 ± 0.955
3.439AlaThr: 3.439 ± 1.518
2.211AlaVal: 2.211 ± 0.994
0.737AlaTrp: 0.737 ± 0.286
1.719AlaTyr: 1.719 ± 0.933
0.0AlaXaa: 0.0 ± 0.0
Cys
0.737CysAla: 0.737 ± 0.292
0.491CysCys: 0.491 ± 0.285
1.965CysAsp: 1.965 ± 0.502
0.737CysGlu: 0.737 ± 0.527
0.246CysPhe: 0.246 ± 0.142
0.737CysGly: 0.737 ± 0.521
0.0CysHis: 0.0 ± 0.0
1.719CysIle: 1.719 ± 0.513
1.228CysLys: 1.228 ± 0.419
0.491CysLeu: 0.491 ± 0.311
0.246CysMet: 0.246 ± 0.142
0.737CysAsn: 0.737 ± 0.431
1.965CysPro: 1.965 ± 1.008
0.246CysGln: 0.246 ± 0.314
1.719CysArg: 1.719 ± 0.439
1.474CysSer: 1.474 ± 0.625
0.737CysThr: 0.737 ± 0.292
0.983CysVal: 0.983 ± 0.57
0.491CysTrp: 0.491 ± 0.285
1.228CysTyr: 1.228 ± 0.851
0.0CysXaa: 0.0 ± 0.0
Asp
3.439AspAla: 3.439 ± 1.265
1.965AspCys: 1.965 ± 0.593
3.193AspAsp: 3.193 ± 0.951
3.685AspGlu: 3.685 ± 2.039
1.719AspPhe: 1.719 ± 0.614
3.439AspGly: 3.439 ± 0.568
2.211AspHis: 2.211 ± 0.831
7.369AspIle: 7.369 ± 0.982
3.93AspLys: 3.93 ± 0.981
4.422AspLeu: 4.422 ± 1.026
1.965AspMet: 1.965 ± 0.851
3.439AspAsn: 3.439 ± 0.964
2.456AspPro: 2.456 ± 0.771
0.491AspGln: 0.491 ± 0.427
2.456AspArg: 2.456 ± 0.636
5.65AspSer: 5.65 ± 1.431
3.93AspThr: 3.93 ± 1.07
3.439AspVal: 3.439 ± 0.895
0.491AspTrp: 0.491 ± 0.285
2.702AspTyr: 2.702 ± 0.798
0.0AspXaa: 0.0 ± 0.0
Glu
1.965GluAla: 1.965 ± 0.488
1.474GluCys: 1.474 ± 0.545
5.404GluAsp: 5.404 ± 1.664
4.422GluGlu: 4.422 ± 1.58
1.719GluPhe: 1.719 ± 0.545
4.667GluGly: 4.667 ± 0.543
0.0GluHis: 0.0 ± 0.0
4.422GluIle: 4.422 ± 1.302
2.948GluLys: 2.948 ± 1.049
6.141GluLeu: 6.141 ± 1.157
3.685GluMet: 3.685 ± 1.012
1.965GluAsn: 1.965 ± 0.688
2.702GluPro: 2.702 ± 0.811
1.474GluGln: 1.474 ± 0.48
2.702GluArg: 2.702 ± 0.531
3.93GluSer: 3.93 ± 1.25
3.685GluThr: 3.685 ± 1.122
3.685GluVal: 3.685 ± 1.319
1.228GluTrp: 1.228 ± 0.498
1.474GluTyr: 1.474 ± 0.522
0.0GluXaa: 0.0 ± 0.0
Phe
0.983PheAla: 0.983 ± 0.597
0.737PheCys: 0.737 ± 0.348
2.211PheAsp: 2.211 ± 0.551
1.474PheGlu: 1.474 ± 0.691
0.491PhePhe: 0.491 ± 0.254
0.737PheGly: 0.737 ± 0.29
0.491PheHis: 0.491 ± 0.285
1.719PheIle: 1.719 ± 0.89
2.702PheLys: 2.702 ± 0.857
3.685PheLeu: 3.685 ± 0.929
1.474PheMet: 1.474 ± 0.388
0.737PheAsn: 0.737 ± 0.345
2.211PhePro: 2.211 ± 0.755
0.737PheGln: 0.737 ± 0.345
0.737PheArg: 0.737 ± 0.688
2.456PheSer: 2.456 ± 0.68
2.456PheThr: 2.456 ± 0.979
1.965PheVal: 1.965 ± 0.758
0.491PheTrp: 0.491 ± 0.29
0.983PheTyr: 0.983 ± 0.233
0.0PheXaa: 0.0 ± 0.0
Gly
3.685GlyAla: 3.685 ± 0.316
0.737GlyCys: 0.737 ± 0.527
3.439GlyAsp: 3.439 ± 0.633
3.93GlyGlu: 3.93 ± 0.544
1.228GlyPhe: 1.228 ± 0.408
3.193GlyGly: 3.193 ± 0.671
2.702GlyHis: 2.702 ± 1.309
3.193GlyIle: 3.193 ± 1.002
4.913GlyLys: 4.913 ± 1.537
6.387GlyLeu: 6.387 ± 1.566
2.456GlyMet: 2.456 ± 0.814
1.228GlyAsn: 1.228 ± 0.536
1.965GlyPro: 1.965 ± 1.144
1.719GlyGln: 1.719 ± 0.592
4.422GlyArg: 4.422 ± 1.653
4.422GlySer: 4.422 ± 1.375
1.719GlyThr: 1.719 ± 1.168
4.913GlyVal: 4.913 ± 0.691
0.983GlyTrp: 0.983 ± 0.42
1.474GlyTyr: 1.474 ± 0.365
0.0GlyXaa: 0.0 ± 0.0
His
0.491HisAla: 0.491 ± 0.469
0.0HisCys: 0.0 ± 0.0
1.228HisAsp: 1.228 ± 0.712
0.983HisGlu: 0.983 ± 0.383
0.737HisPhe: 0.737 ± 0.29
0.983HisGly: 0.983 ± 0.508
0.737HisHis: 0.737 ± 0.427
0.983HisIle: 0.983 ± 0.57
2.456HisLys: 2.456 ± 0.597
2.456HisLeu: 2.456 ± 0.697
0.983HisMet: 0.983 ± 0.533
0.983HisAsn: 0.983 ± 0.327
1.474HisPro: 1.474 ± 0.349
0.737HisGln: 0.737 ± 0.29
1.719HisArg: 1.719 ± 0.721
0.983HisSer: 0.983 ± 0.383
0.491HisThr: 0.491 ± 0.437
1.474HisVal: 1.474 ± 0.291
0.491HisTrp: 0.491 ± 0.285
0.737HisTyr: 0.737 ± 0.292
0.0HisXaa: 0.0 ± 0.0
Ile
2.211IleAla: 2.211 ± 0.767
1.228IleCys: 1.228 ± 0.712
4.422IleAsp: 4.422 ± 1.6
3.93IleGlu: 3.93 ± 0.842
1.965IlePhe: 1.965 ± 0.479
4.422IleGly: 4.422 ± 1.182
1.965IleHis: 1.965 ± 0.485
3.93IleIle: 3.93 ± 1.232
6.387IleLys: 6.387 ± 0.874
6.387IleLeu: 6.387 ± 1.283
1.228IleMet: 1.228 ± 0.379
4.422IleAsn: 4.422 ± 0.41
3.93IlePro: 3.93 ± 1.566
2.948IleGln: 2.948 ± 0.753
3.193IleArg: 3.193 ± 0.773
6.878IleSer: 6.878 ± 1.47
4.913IleThr: 4.913 ± 1.405
3.193IleVal: 3.193 ± 0.669
1.719IleTrp: 1.719 ± 0.831
2.948IleTyr: 2.948 ± 0.658
0.0IleXaa: 0.0 ± 0.0
Lys
3.685LysAla: 3.685 ± 2.103
0.737LysCys: 0.737 ± 0.292
3.93LysAsp: 3.93 ± 0.838
4.422LysGlu: 4.422 ± 1.392
1.965LysPhe: 1.965 ± 1.054
3.93LysGly: 3.93 ± 1.34
1.965LysHis: 1.965 ± 0.692
4.422LysIle: 4.422 ± 1.169
3.685LysLys: 3.685 ± 1.148
5.65LysLeu: 5.65 ± 1.045
2.456LysMet: 2.456 ± 0.686
4.176LysAsn: 4.176 ± 0.701
1.719LysPro: 1.719 ± 0.592
3.193LysGln: 3.193 ± 0.624
3.93LysArg: 3.93 ± 1.031
4.667LysSer: 4.667 ± 1.168
4.667LysThr: 4.667 ± 0.699
4.176LysVal: 4.176 ± 1.111
1.228LysTrp: 1.228 ± 0.444
2.211LysTyr: 2.211 ± 0.589
0.0LysXaa: 0.0 ± 0.0
Leu
5.404LeuAla: 5.404 ± 1.834
1.228LeuCys: 1.228 ± 0.851
4.176LeuAsp: 4.176 ± 0.548
5.895LeuGlu: 5.895 ± 1.074
3.93LeuPhe: 3.93 ± 0.83
5.895LeuGly: 5.895 ± 1.029
1.965LeuHis: 1.965 ± 0.639
6.141LeuIle: 6.141 ± 0.963
5.404LeuLys: 5.404 ± 1.202
7.124LeuLeu: 7.124 ± 1.517
3.193LeuMet: 3.193 ± 0.801
3.93LeuAsn: 3.93 ± 1.082
3.193LeuPro: 3.193 ± 1.107
2.456LeuGln: 2.456 ± 0.71
7.124LeuArg: 7.124 ± 1.197
9.334LeuSer: 9.334 ± 2.167
5.895LeuThr: 5.895 ± 1.499
4.913LeuVal: 4.913 ± 1.045
1.228LeuTrp: 1.228 ± 0.552
3.93LeuTyr: 3.93 ± 0.927
0.0LeuXaa: 0.0 ± 0.0
Met
1.719MetAla: 1.719 ± 1.069
1.474MetCys: 1.474 ± 0.855
2.948MetAsp: 2.948 ± 1.246
2.211MetGlu: 2.211 ± 0.946
2.211MetPhe: 2.211 ± 0.819
2.948MetGly: 2.948 ± 0.902
0.983MetHis: 0.983 ± 0.371
3.439MetIle: 3.439 ± 0.55
1.965MetLys: 1.965 ± 0.911
1.965MetLeu: 1.965 ± 0.469
1.965MetMet: 1.965 ± 0.466
0.737MetAsn: 0.737 ± 0.345
0.246MetPro: 0.246 ± 0.142
0.983MetGln: 0.983 ± 0.57
3.193MetArg: 3.193 ± 0.75
3.685MetSer: 3.685 ± 1.13
3.685MetThr: 3.685 ± 0.744
0.983MetVal: 0.983 ± 0.379
0.491MetTrp: 0.491 ± 0.311
0.983MetTyr: 0.983 ± 0.336
0.0MetXaa: 0.0 ± 0.0
Asn
3.685AsnAla: 3.685 ± 0.551
0.491AsnCys: 0.491 ± 0.581
1.719AsnAsp: 1.719 ± 0.384
2.948AsnGlu: 2.948 ± 1.454
0.983AsnPhe: 0.983 ± 0.713
3.685AsnGly: 3.685 ± 0.625
0.491AsnHis: 0.491 ± 0.285
3.193AsnIle: 3.193 ± 1.124
2.702AsnLys: 2.702 ± 1.01
3.93AsnLeu: 3.93 ± 0.856
1.474AsnMet: 1.474 ± 0.666
1.228AsnAsn: 1.228 ± 0.653
3.439AsnPro: 3.439 ± 0.783
0.737AsnGln: 0.737 ± 0.581
3.439AsnArg: 3.439 ± 1.442
3.685AsnSer: 3.685 ± 0.924
2.211AsnThr: 2.211 ± 0.394
2.948AsnVal: 2.948 ± 1.143
0.491AsnTrp: 0.491 ± 0.285
1.228AsnTyr: 1.228 ± 0.41
0.0AsnXaa: 0.0 ± 0.0
Pro
1.228ProAla: 1.228 ± 0.723
0.246ProCys: 0.246 ± 0.315
2.456ProAsp: 2.456 ± 0.91
3.193ProGlu: 3.193 ± 0.567
1.228ProPhe: 1.228 ± 0.606
1.965ProGly: 1.965 ± 0.732
0.491ProHis: 0.491 ± 0.254
1.965ProIle: 1.965 ± 0.733
2.948ProLys: 2.948 ± 0.964
5.65ProLeu: 5.65 ± 0.997
0.983ProMet: 0.983 ± 0.336
2.456ProAsn: 2.456 ± 1.016
1.719ProPro: 1.719 ± 0.596
0.983ProGln: 0.983 ± 0.57
2.456ProArg: 2.456 ± 0.758
2.702ProSer: 2.702 ± 0.527
3.685ProThr: 3.685 ± 0.74
2.456ProVal: 2.456 ± 0.682
0.246ProTrp: 0.246 ± 0.142
1.474ProTyr: 1.474 ± 0.522
0.0ProXaa: 0.0 ± 0.0
Gln
2.211GlnAla: 2.211 ± 0.91
0.737GlnCys: 0.737 ± 0.56
2.211GlnAsp: 2.211 ± 0.961
1.228GlnGlu: 1.228 ± 1.302
0.491GlnPhe: 0.491 ± 0.439
1.474GlnGly: 1.474 ± 0.803
0.983GlnHis: 0.983 ± 0.418
1.965GlnIle: 1.965 ± 0.655
2.456GlnLys: 2.456 ± 0.796
2.702GlnLeu: 2.702 ± 0.543
0.983GlnMet: 0.983 ± 0.69
0.983GlnAsn: 0.983 ± 0.377
1.228GlnPro: 1.228 ± 0.567
0.0GlnGln: 0.0 ± 0.0
2.211GlnArg: 2.211 ± 0.584
2.211GlnSer: 2.211 ± 0.393
2.456GlnThr: 2.456 ± 0.463
1.228GlnVal: 1.228 ± 0.355
0.0GlnTrp: 0.0 ± 0.0
0.737GlnTyr: 0.737 ± 0.348
0.0GlnXaa: 0.0 ± 0.0
Arg
3.93ArgAla: 3.93 ± 1.742
0.983ArgCys: 0.983 ± 0.398
3.439ArgAsp: 3.439 ± 0.782
4.176ArgGlu: 4.176 ± 0.559
1.965ArgPhe: 1.965 ± 0.907
3.93ArgGly: 3.93 ± 0.486
1.228ArgHis: 1.228 ± 0.528
2.948ArgIle: 2.948 ± 1.916
3.685ArgLys: 3.685 ± 0.955
5.895ArgLeu: 5.895 ± 1.047
1.719ArgMet: 1.719 ± 0.606
2.211ArgAsn: 2.211 ± 0.632
1.228ArgPro: 1.228 ± 0.723
1.965ArgGln: 1.965 ± 0.781
3.193ArgArg: 3.193 ± 0.868
5.895ArgSer: 5.895 ± 1.317
2.702ArgThr: 2.702 ± 0.957
3.93ArgVal: 3.93 ± 0.977
1.474ArgTrp: 1.474 ± 0.647
1.719ArgTyr: 1.719 ± 0.502
0.0ArgXaa: 0.0 ± 0.0
Ser
4.913SerAla: 4.913 ± 2.012
1.965SerCys: 1.965 ± 0.435
5.895SerAsp: 5.895 ± 0.734
6.141SerGlu: 6.141 ± 0.994
1.965SerPhe: 1.965 ± 0.68
2.948SerGly: 2.948 ± 0.659
0.983SerHis: 0.983 ± 0.383
5.895SerIle: 5.895 ± 1.453
5.895SerLys: 5.895 ± 1.323
7.369SerLeu: 7.369 ± 0.833
2.702SerMet: 2.702 ± 1.013
3.93SerAsn: 3.93 ± 0.731
2.702SerPro: 2.702 ± 0.612
0.983SerGln: 0.983 ± 0.511
5.65SerArg: 5.65 ± 0.935
7.369SerSer: 7.369 ± 1.817
5.404SerThr: 5.404 ± 1.067
6.141SerVal: 6.141 ± 2.127
1.228SerTrp: 1.228 ± 0.528
3.685SerTyr: 3.685 ± 1.022
0.0SerXaa: 0.0 ± 0.0
Thr
3.93ThrAla: 3.93 ± 0.496
0.246ThrCys: 0.246 ± 0.142
3.439ThrAsp: 3.439 ± 1.092
2.702ThrGlu: 2.702 ± 0.86
2.211ThrPhe: 2.211 ± 0.761
3.685ThrGly: 3.685 ± 1.031
0.737ThrHis: 0.737 ± 0.731
4.913ThrIle: 4.913 ± 1.149
4.422ThrLys: 4.422 ± 0.924
5.158ThrLeu: 5.158 ± 0.762
2.702ThrMet: 2.702 ± 0.612
3.685ThrAsn: 3.685 ± 0.817
3.193ThrPro: 3.193 ± 0.523
2.948ThrGln: 2.948 ± 0.73
2.948ThrArg: 2.948 ± 0.549
5.404ThrSer: 5.404 ± 0.483
2.702ThrThr: 2.702 ± 0.792
2.948ThrVal: 2.948 ± 0.698
1.228ThrTrp: 1.228 ± 0.408
2.456ThrTyr: 2.456 ± 0.573
0.0ThrXaa: 0.0 ± 0.0
Val
2.456ValAla: 2.456 ± 1.062
1.719ValCys: 1.719 ± 0.854
4.176ValAsp: 4.176 ± 0.742
3.193ValGlu: 3.193 ± 0.805
1.965ValPhe: 1.965 ± 0.73
2.456ValGly: 2.456 ± 1.27
1.228ValHis: 1.228 ± 0.708
6.878ValIle: 6.878 ± 0.678
4.176ValLys: 4.176 ± 0.942
3.93ValLeu: 3.93 ± 0.908
3.685ValMet: 3.685 ± 1.067
2.948ValAsn: 2.948 ± 0.976
2.211ValPro: 2.211 ± 0.465
1.228ValGln: 1.228 ± 0.984
1.965ValArg: 1.965 ± 1.227
4.913ValSer: 4.913 ± 0.976
3.193ValThr: 3.193 ± 0.941
4.422ValVal: 4.422 ± 1.217
0.491ValTrp: 0.491 ± 0.266
1.719ValTyr: 1.719 ± 0.758
0.0ValXaa: 0.0 ± 0.0
Trp
0.737TrpAla: 0.737 ± 0.341
0.0TrpCys: 0.0 ± 0.0
0.737TrpAsp: 0.737 ± 0.292
0.737TrpGlu: 0.737 ± 0.29
0.983TrpPhe: 0.983 ± 0.42
0.737TrpGly: 0.737 ± 0.427
0.0TrpHis: 0.0 ± 0.0
0.737TrpIle: 0.737 ± 0.362
0.491TrpLys: 0.491 ± 0.266
1.719TrpLeu: 1.719 ± 0.609
1.474TrpMet: 1.474 ± 0.625
0.983TrpAsn: 0.983 ± 0.383
0.0TrpPro: 0.0 ± 0.0
0.491TrpGln: 0.491 ± 0.311
1.474TrpArg: 1.474 ± 0.447
1.228TrpSer: 1.228 ± 0.653
1.228TrpThr: 1.228 ± 0.369
0.737TrpVal: 0.737 ± 0.286
0.246TrpTrp: 0.246 ± 0.142
0.491TrpTyr: 0.491 ± 0.427
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.211TyrAla: 2.211 ± 0.846
0.737TyrCys: 0.737 ± 0.427
1.719TyrAsp: 1.719 ± 0.49
0.737TyrGlu: 0.737 ± 0.357
0.737TyrPhe: 0.737 ± 0.29
2.211TyrGly: 2.211 ± 0.696
1.719TyrHis: 1.719 ± 0.396
1.474TyrIle: 1.474 ± 0.799
1.965TyrLys: 1.965 ± 0.466
5.404TyrLeu: 5.404 ± 0.782
0.737TyrMet: 0.737 ± 0.427
2.456TyrAsn: 2.456 ± 0.521
1.474TyrPro: 1.474 ± 0.454
2.211TyrGln: 2.211 ± 0.833
1.228TyrArg: 1.228 ± 0.26
1.719TyrSer: 1.719 ± 0.701
2.456TyrThr: 2.456 ± 0.837
2.456TyrVal: 2.456 ± 1.425
0.246TyrTrp: 0.246 ± 0.142
2.456TyrTyr: 2.456 ± 0.796
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8 proteins (4072 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski