Amino acid dipepetide frequency for Lettuce big-vein associated virus (isolate Japan/Kagawa) (LBVaV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.177AlaAla: 4.177 ± 1.655
0.0AlaCys: 0.0 ± 0.0
4.734AlaAsp: 4.734 ± 1.606
2.506AlaGlu: 2.506 ± 1.352
2.506AlaPhe: 2.506 ± 1.107
3.62AlaGly: 3.62 ± 1.913
1.392AlaHis: 1.392 ± 0.476
3.063AlaIle: 3.063 ± 0.637
3.342AlaLys: 3.342 ± 1.441
4.177AlaLeu: 4.177 ± 1.449
3.063AlaMet: 3.063 ± 0.457
1.671AlaAsn: 1.671 ± 1.052
1.671AlaPro: 1.671 ± 1.28
1.392AlaGln: 1.392 ± 0.724
4.456AlaArg: 4.456 ± 0.999
4.734AlaSer: 4.734 ± 1.047
3.063AlaThr: 3.063 ± 0.577
3.899AlaVal: 3.899 ± 1.073
0.835AlaTrp: 0.835 ± 0.495
2.506AlaTyr: 2.506 ± 0.797
0.0AlaXaa: 0.0 ± 0.0
Cys
1.949CysAla: 1.949 ± 0.755
0.0CysCys: 0.0 ± 0.0
0.278CysAsp: 0.278 ± 0.151
1.114CysGlu: 1.114 ± 0.588
1.114CysPhe: 1.114 ± 0.391
1.392CysGly: 1.392 ± 0.376
0.557CysHis: 0.557 ± 0.302
0.557CysIle: 0.557 ± 0.294
0.835CysLys: 0.835 ± 0.381
1.949CysLeu: 1.949 ± 1.056
0.278CysMet: 0.278 ± 0.315
0.0CysAsn: 0.0 ± 0.0
0.557CysPro: 0.557 ± 0.631
1.114CysGln: 1.114 ± 0.604
0.835CysArg: 0.835 ± 0.375
1.114CysSer: 1.114 ± 0.536
0.557CysThr: 0.557 ± 0.302
0.835CysVal: 0.835 ± 0.453
0.835CysTrp: 0.835 ± 0.565
0.557CysTyr: 0.557 ± 0.294
0.0CysXaa: 0.0 ± 0.0
Asp
1.671AspAla: 1.671 ± 1.795
0.557AspCys: 0.557 ± 0.268
3.899AspAsp: 3.899 ± 0.735
4.177AspGlu: 4.177 ± 1.37
1.671AspPhe: 1.671 ± 0.391
4.456AspGly: 4.456 ± 1.069
0.835AspHis: 0.835 ± 0.379
5.569AspIle: 5.569 ± 1.018
3.063AspLys: 3.063 ± 0.51
3.899AspLeu: 3.899 ± 0.883
2.506AspMet: 2.506 ± 0.618
2.228AspAsn: 2.228 ± 0.599
3.063AspPro: 3.063 ± 1.296
1.392AspGln: 1.392 ± 0.434
1.392AspArg: 1.392 ± 0.412
4.177AspSer: 4.177 ± 1.389
4.177AspThr: 4.177 ± 1.044
3.342AspVal: 3.342 ± 1.175
0.0AspTrp: 0.0 ± 0.0
2.228AspTyr: 2.228 ± 0.419
0.0AspXaa: 0.0 ± 0.0
Glu
4.734GluAla: 4.734 ± 2.114
0.835GluCys: 0.835 ± 0.359
3.899GluAsp: 3.899 ± 1.035
5.569GluGlu: 5.569 ± 0.536
2.785GluPhe: 2.785 ± 0.382
5.013GluGly: 5.013 ± 1.463
2.228GluHis: 2.228 ± 0.504
5.013GluIle: 5.013 ± 1.143
4.177GluLys: 4.177 ± 1.188
4.456GluLeu: 4.456 ± 1.128
3.063GluMet: 3.063 ± 0.665
2.785GluAsn: 2.785 ± 0.825
0.835GluPro: 0.835 ± 0.375
0.835GluGln: 0.835 ± 0.379
4.456GluArg: 4.456 ± 0.734
4.177GluSer: 4.177 ± 1.563
5.013GluThr: 5.013 ± 1.467
4.456GluVal: 4.456 ± 0.464
0.835GluTrp: 0.835 ± 0.453
1.671GluTyr: 1.671 ± 0.391
0.0GluXaa: 0.0 ± 0.0
Phe
1.392PheAla: 1.392 ± 0.638
0.557PheCys: 0.557 ± 0.313
2.228PheAsp: 2.228 ± 0.894
1.949PheGlu: 1.949 ± 0.592
0.557PhePhe: 0.557 ± 0.302
1.114PheGly: 1.114 ± 0.336
1.114PheHis: 1.114 ± 0.536
3.063PheIle: 3.063 ± 0.863
2.228PheLys: 2.228 ± 0.664
5.569PheLeu: 5.569 ± 1.322
1.949PheMet: 1.949 ± 0.804
1.114PheAsn: 1.114 ± 0.41
3.063PhePro: 3.063 ± 0.849
1.392PheGln: 1.392 ± 0.539
2.506PheArg: 2.506 ± 0.779
3.62PheSer: 3.62 ± 1.505
1.949PheThr: 1.949 ± 0.936
2.228PheVal: 2.228 ± 0.517
0.0PheTrp: 0.0 ± 0.0
1.114PheTyr: 1.114 ± 0.475
0.0PheXaa: 0.0 ± 0.0
Gly
3.62GlyAla: 3.62 ± 1.084
1.114GlyCys: 1.114 ± 0.439
4.456GlyAsp: 4.456 ± 0.95
3.899GlyGlu: 3.899 ± 0.673
4.734GlyPhe: 4.734 ± 0.955
3.899GlyGly: 3.899 ± 1.141
1.392GlyHis: 1.392 ± 0.754
4.177GlyIle: 4.177 ± 1.028
4.456GlyLys: 4.456 ± 1.235
6.405GlyLeu: 6.405 ± 1.142
2.506GlyMet: 2.506 ± 0.365
2.506GlyAsn: 2.506 ± 0.678
1.671GlyPro: 1.671 ± 0.622
2.506GlyGln: 2.506 ± 0.732
4.177GlyArg: 4.177 ± 1.198
5.291GlySer: 5.291 ± 2.084
5.848GlyThr: 5.848 ± 0.888
3.342GlyVal: 3.342 ± 0.651
1.949GlyTrp: 1.949 ± 0.509
2.228GlyTyr: 2.228 ± 1.131
0.0GlyXaa: 0.0 ± 0.0
His
0.557HisAla: 0.557 ± 0.268
0.278HisCys: 0.278 ± 0.315
0.278HisAsp: 0.278 ± 0.151
1.671HisGlu: 1.671 ± 0.639
1.114HisPhe: 1.114 ± 0.425
0.557HisGly: 0.557 ± 0.399
2.228HisHis: 2.228 ± 0.46
2.506HisIle: 2.506 ± 0.878
1.114HisLys: 1.114 ± 0.425
3.063HisLeu: 3.063 ± 0.685
0.835HisMet: 0.835 ± 0.301
0.278HisAsn: 0.278 ± 0.442
1.671HisPro: 1.671 ± 0.53
0.0HisGln: 0.0 ± 0.0
1.392HisArg: 1.392 ± 0.512
0.835HisSer: 0.835 ± 0.512
1.114HisThr: 1.114 ± 0.41
2.785HisVal: 2.785 ± 2.175
0.278HisTrp: 0.278 ± 0.315
0.557HisTyr: 0.557 ± 0.302
0.0HisXaa: 0.0 ± 0.0
Ile
5.848IleAla: 5.848 ± 0.941
1.114IleCys: 1.114 ± 0.335
3.063IleAsp: 3.063 ± 0.561
4.177IleGlu: 4.177 ± 0.709
2.228IlePhe: 2.228 ± 0.886
6.962IleGly: 6.962 ± 1.042
0.835IleHis: 0.835 ± 0.375
3.62IleIle: 3.62 ± 0.611
4.734IleLys: 4.734 ± 1.346
5.291IleLeu: 5.291 ± 0.825
2.506IleMet: 2.506 ± 0.851
2.228IleAsn: 2.228 ± 0.504
3.62IlePro: 3.62 ± 0.748
1.671IleGln: 1.671 ± 0.42
5.569IleArg: 5.569 ± 2.064
6.405IleSer: 6.405 ± 2.173
3.899IleThr: 3.899 ± 0.751
2.785IleVal: 2.785 ± 0.678
0.0IleTrp: 0.0 ± 0.0
2.228IleTyr: 2.228 ± 0.512
0.0IleXaa: 0.0 ± 0.0
Lys
2.785LysAla: 2.785 ± 1.493
0.557LysCys: 0.557 ± 0.313
2.228LysAsp: 2.228 ± 1.041
4.734LysGlu: 4.734 ± 1.369
2.785LysPhe: 2.785 ± 0.663
4.734LysGly: 4.734 ± 0.889
1.114LysHis: 1.114 ± 0.54
4.734LysIle: 4.734 ± 0.831
5.013LysLys: 5.013 ± 1.197
6.126LysLeu: 6.126 ± 1.681
2.506LysMet: 2.506 ± 0.527
3.063LysAsn: 3.063 ± 0.566
1.949LysPro: 1.949 ± 0.713
2.506LysGln: 2.506 ± 1.454
2.506LysArg: 2.506 ± 0.675
4.734LysSer: 4.734 ± 1.229
3.62LysThr: 3.62 ± 0.677
4.456LysVal: 4.456 ± 1.779
0.835LysTrp: 0.835 ± 0.375
1.949LysTyr: 1.949 ± 0.571
0.0LysXaa: 0.0 ± 0.0
Leu
5.291LeuAla: 5.291 ± 0.634
2.228LeuCys: 2.228 ± 0.928
3.62LeuAsp: 3.62 ± 0.948
6.126LeuGlu: 6.126 ± 1.391
2.785LeuPhe: 2.785 ± 0.955
7.519LeuGly: 7.519 ± 1.542
0.557LeuHis: 0.557 ± 0.302
5.291LeuIle: 5.291 ± 0.784
6.126LeuLys: 6.126 ± 1.432
8.911LeuLeu: 8.911 ± 2.357
3.62LeuMet: 3.62 ± 1.034
3.063LeuAsn: 3.063 ± 0.968
4.734LeuPro: 4.734 ± 0.994
3.063LeuGln: 3.063 ± 0.558
5.848LeuArg: 5.848 ± 1.036
8.076LeuSer: 8.076 ± 1.609
5.848LeuThr: 5.848 ± 1.416
3.62LeuVal: 3.62 ± 0.926
0.835LeuTrp: 0.835 ± 0.565
2.506LeuTyr: 2.506 ± 0.701
0.0LeuXaa: 0.0 ± 0.0
Met
2.506MetAla: 2.506 ± 0.954
1.392MetCys: 1.392 ± 0.754
1.949MetAsp: 1.949 ± 0.843
1.671MetGlu: 1.671 ± 0.428
1.949MetPhe: 1.949 ± 0.756
1.949MetGly: 1.949 ± 0.495
1.392MetHis: 1.392 ± 0.827
2.506MetIle: 2.506 ± 1.073
3.62MetLys: 3.62 ± 1.319
2.506MetLeu: 2.506 ± 0.718
0.835MetMet: 0.835 ± 0.453
1.114MetAsn: 1.114 ± 0.432
1.114MetPro: 1.114 ± 0.379
0.557MetGln: 0.557 ± 0.313
2.506MetArg: 2.506 ± 0.746
3.342MetSer: 3.342 ± 0.756
3.063MetThr: 3.063 ± 0.682
2.506MetVal: 2.506 ± 0.667
0.835MetTrp: 0.835 ± 0.453
2.228MetTyr: 2.228 ± 0.945
0.0MetXaa: 0.0 ± 0.0
Asn
1.949AsnAla: 1.949 ± 0.912
1.392AsnCys: 1.392 ± 0.512
1.392AsnAsp: 1.392 ± 0.37
1.949AsnGlu: 1.949 ± 0.73
1.114AsnPhe: 1.114 ± 0.336
1.114AsnGly: 1.114 ± 0.983
0.835AsnHis: 0.835 ± 0.453
2.785AsnIle: 2.785 ± 1.094
2.228AsnLys: 2.228 ± 0.876
3.899AsnLeu: 3.899 ± 0.673
1.114AsnMet: 1.114 ± 0.425
1.114AsnAsn: 1.114 ± 0.627
1.392AsnPro: 1.392 ± 0.555
1.114AsnGln: 1.114 ± 0.432
1.392AsnArg: 1.392 ± 0.539
2.506AsnSer: 2.506 ± 0.942
2.506AsnThr: 2.506 ± 0.72
2.506AsnVal: 2.506 ± 0.337
0.278AsnTrp: 0.278 ± 0.151
1.949AsnTyr: 1.949 ± 0.519
0.0AsnXaa: 0.0 ± 0.0
Pro
1.392ProAla: 1.392 ± 0.749
0.278ProCys: 0.278 ± 0.151
2.785ProAsp: 2.785 ± 0.771
1.949ProGlu: 1.949 ± 0.771
1.671ProPhe: 1.671 ± 0.431
2.506ProGly: 2.506 ± 0.667
0.835ProHis: 0.835 ± 0.379
3.063ProIle: 3.063 ± 1.045
1.114ProLys: 1.114 ± 0.827
3.62ProLeu: 3.62 ± 0.597
1.671ProMet: 1.671 ± 0.599
2.506ProAsn: 2.506 ± 0.759
3.342ProPro: 3.342 ± 0.893
1.392ProGln: 1.392 ± 0.37
3.063ProArg: 3.063 ± 1.103
3.899ProSer: 3.899 ± 1.805
3.62ProThr: 3.62 ± 1.116
2.785ProVal: 2.785 ± 1.252
0.278ProTrp: 0.278 ± 0.473
0.835ProTyr: 0.835 ± 0.862
0.0ProXaa: 0.0 ± 0.0
Gln
0.835GlnAla: 0.835 ± 0.342
0.278GlnCys: 0.278 ± 0.151
1.671GlnAsp: 1.671 ± 0.758
1.671GlnGlu: 1.671 ± 0.482
1.114GlnPhe: 1.114 ± 0.425
2.228GlnGly: 2.228 ± 0.722
0.557GlnHis: 0.557 ± 0.599
1.671GlnIle: 1.671 ± 0.639
0.835GlnLys: 0.835 ± 0.651
1.949GlnLeu: 1.949 ± 0.594
1.114GlnMet: 1.114 ± 0.379
0.835GlnAsn: 0.835 ± 0.342
0.278GlnPro: 0.278 ± 0.473
0.278GlnGln: 0.278 ± 0.151
1.671GlnArg: 1.671 ± 0.438
1.392GlnSer: 1.392 ± 0.428
2.506GlnThr: 2.506 ± 1.06
1.949GlnVal: 1.949 ± 0.529
0.557GlnTrp: 0.557 ± 0.383
1.114GlnTyr: 1.114 ± 0.604
0.0GlnXaa: 0.0 ± 0.0
Arg
3.62ArgAla: 3.62 ± 1.237
1.949ArgCys: 1.949 ± 0.84
2.785ArgAsp: 2.785 ± 0.745
5.013ArgGlu: 5.013 ± 1.013
3.063ArgPhe: 3.063 ± 0.354
3.62ArgGly: 3.62 ± 0.932
0.835ArgHis: 0.835 ± 0.359
5.848ArgIle: 5.848 ± 1.419
4.177ArgLys: 4.177 ± 1.288
4.456ArgLeu: 4.456 ± 0.881
3.62ArgMet: 3.62 ± 0.937
0.835ArgAsn: 0.835 ± 0.347
1.671ArgPro: 1.671 ± 0.703
1.671ArgGln: 1.671 ± 0.581
3.342ArgArg: 3.342 ± 0.939
3.62ArgSer: 3.62 ± 0.939
2.506ArgThr: 2.506 ± 1.074
3.899ArgVal: 3.899 ± 0.902
0.557ArgTrp: 0.557 ± 0.399
2.785ArgTyr: 2.785 ± 0.529
0.0ArgXaa: 0.0 ± 0.0
Ser
4.456SerAla: 4.456 ± 1.322
0.557SerCys: 0.557 ± 0.268
6.405SerAsp: 6.405 ± 1.123
6.683SerGlu: 6.683 ± 2.365
1.949SerPhe: 1.949 ± 0.539
4.734SerGly: 4.734 ± 1.288
1.392SerHis: 1.392 ± 0.833
4.734SerIle: 4.734 ± 0.667
4.456SerLys: 4.456 ± 0.506
8.354SerLeu: 8.354 ± 1.541
2.785SerMet: 2.785 ± 0.85
2.785SerAsn: 2.785 ± 0.826
2.506SerPro: 2.506 ± 0.845
1.114SerGln: 1.114 ± 0.391
5.291SerArg: 5.291 ± 0.99
6.962SerSer: 6.962 ± 2.426
4.456SerThr: 4.456 ± 0.943
7.24SerVal: 7.24 ± 2.22
0.835SerTrp: 0.835 ± 0.381
2.506SerTyr: 2.506 ± 0.481
0.0SerXaa: 0.0 ± 0.0
Thr
2.228ThrAla: 2.228 ± 1.148
0.557ThrCys: 0.557 ± 0.294
1.114ThrAsp: 1.114 ± 0.425
5.291ThrGlu: 5.291 ± 1.278
2.785ThrPhe: 2.785 ± 1.023
5.569ThrGly: 5.569 ± 1.382
1.949ThrHis: 1.949 ± 0.771
4.734ThrIle: 4.734 ± 1.118
4.456ThrLys: 4.456 ± 1.252
5.291ThrLeu: 5.291 ± 1.244
1.392ThrMet: 1.392 ± 0.37
3.063ThrAsn: 3.063 ± 0.94
3.063ThrPro: 3.063 ± 0.561
0.835ThrGln: 0.835 ± 0.342
4.177ThrArg: 4.177 ± 0.987
6.126ThrSer: 6.126 ± 1.706
2.506ThrThr: 2.506 ± 1.207
3.342ThrVal: 3.342 ± 0.751
1.392ThrTrp: 1.392 ± 0.622
1.671ThrTyr: 1.671 ± 0.401
0.0ThrXaa: 0.0 ± 0.0
Val
5.013ValAla: 5.013 ± 1.548
1.671ValCys: 1.671 ± 0.599
4.734ValAsp: 4.734 ± 1.047
3.342ValGlu: 3.342 ± 1.038
1.949ValPhe: 1.949 ± 0.354
5.013ValGly: 5.013 ± 1.101
1.392ValHis: 1.392 ± 0.412
2.228ValIle: 2.228 ± 0.551
3.063ValLys: 3.063 ± 1.587
6.405ValLeu: 6.405 ± 0.458
1.671ValMet: 1.671 ± 0.599
1.671ValAsn: 1.671 ± 0.954
3.899ValPro: 3.899 ± 1.373
0.557ValGln: 0.557 ± 0.302
2.506ValArg: 2.506 ± 1.509
5.291ValSer: 5.291 ± 1.134
3.063ValThr: 3.063 ± 0.807
4.177ValVal: 4.177 ± 1.023
0.557ValTrp: 0.557 ± 0.399
3.342ValTyr: 3.342 ± 1.173
0.0ValXaa: 0.0 ± 0.0
Trp
0.278TrpAla: 0.278 ± 0.473
0.0TrpCys: 0.0 ± 0.0
1.671TrpAsp: 1.671 ± 0.881
0.835TrpGlu: 0.835 ± 0.463
0.0TrpPhe: 0.0 ± 0.0
1.392TrpGly: 1.392 ± 0.548
0.0TrpHis: 0.0 ± 0.0
1.114TrpIle: 1.114 ± 0.391
1.114TrpLys: 1.114 ± 0.604
0.835TrpLeu: 0.835 ± 0.453
0.557TrpMet: 0.557 ± 0.383
0.835TrpAsn: 0.835 ± 0.381
0.278TrpPro: 0.278 ± 0.151
0.0TrpGln: 0.0 ± 0.0
0.835TrpArg: 0.835 ± 0.495
1.114TrpSer: 1.114 ± 0.536
0.557TrpThr: 0.557 ± 0.313
0.557TrpVal: 0.557 ± 0.614
0.278TrpTrp: 0.278 ± 0.151
0.278TrpTyr: 0.278 ± 0.151
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.506TyrAla: 2.506 ± 0.798
1.114TyrCys: 1.114 ± 0.336
1.114TyrAsp: 1.114 ± 0.683
2.785TyrGlu: 2.785 ± 0.666
0.835TyrPhe: 0.835 ± 0.375
2.785TyrGly: 2.785 ± 0.79
1.671TyrHis: 1.671 ± 0.431
2.506TyrIle: 2.506 ± 0.895
2.785TyrLys: 2.785 ± 0.722
2.506TyrLeu: 2.506 ± 0.593
1.671TyrMet: 1.671 ± 0.438
0.835TyrAsn: 0.835 ± 0.453
2.228TyrPro: 2.228 ± 0.599
1.114TyrGln: 1.114 ± 0.685
1.949TyrArg: 1.949 ± 0.405
2.785TyrSer: 2.785 ± 0.857
1.671TyrThr: 1.671 ± 0.42
0.835TyrVal: 0.835 ± 0.453
0.557TyrTrp: 0.557 ± 0.294
1.114TyrTyr: 1.114 ± 0.536
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (3592 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski