Amino acid dipepetide frequency for Cotton leafroll dwarf virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.553AlaAla: 7.553 ± 1.148
0.985AlaCys: 0.985 ± 0.67
2.956AlaAsp: 2.956 ± 0.522
5.911AlaGlu: 5.911 ± 0.928
2.627AlaPhe: 2.627 ± 0.442
3.941AlaGly: 3.941 ± 0.727
1.314AlaHis: 1.314 ± 0.274
2.956AlaIle: 2.956 ± 1.062
3.612AlaLys: 3.612 ± 1.165
7.225AlaLeu: 7.225 ± 2.182
1.314AlaMet: 1.314 ± 0.752
0.657AlaAsn: 0.657 ± 0.282
4.269AlaPro: 4.269 ± 1.137
2.299AlaGln: 2.299 ± 0.99
2.956AlaArg: 2.956 ± 0.835
8.539AlaSer: 8.539 ± 0.998
4.598AlaThr: 4.598 ± 0.96
4.269AlaVal: 4.269 ± 0.823
0.657AlaTrp: 0.657 ± 0.597
1.642AlaTyr: 1.642 ± 0.541
0.0AlaXaa: 0.0 ± 0.0
Cys
0.328CysAla: 0.328 ± 0.223
0.0CysCys: 0.0 ± 0.0
0.657CysAsp: 0.657 ± 0.255
0.657CysGlu: 0.657 ± 0.415
0.328CysPhe: 0.328 ± 0.223
0.657CysGly: 0.657 ± 0.447
0.328CysHis: 0.328 ± 0.223
1.642CysIle: 1.642 ± 0.692
0.985CysLys: 0.985 ± 0.659
0.328CysLeu: 0.328 ± 0.223
1.314CysMet: 1.314 ± 0.305
0.328CysAsn: 0.328 ± 0.342
2.299CysPro: 2.299 ± 0.411
0.985CysGln: 0.985 ± 0.667
0.985CysArg: 0.985 ± 0.313
0.657CysSer: 0.657 ± 0.447
0.985CysThr: 0.985 ± 0.404
0.657CysVal: 0.657 ± 0.282
0.328CysTrp: 0.328 ± 0.223
0.657CysTyr: 0.657 ± 0.282
0.0CysXaa: 0.0 ± 0.0
Asp
2.956AspAla: 2.956 ± 0.871
0.985AspCys: 0.985 ± 0.394
2.627AspAsp: 2.627 ± 1.344
3.612AspGlu: 3.612 ± 0.865
2.956AspPhe: 2.956 ± 0.913
5.255AspGly: 5.255 ± 0.986
0.328AspHis: 0.328 ± 0.379
0.0AspIle: 0.0 ± 0.0
0.985AspLys: 0.985 ± 0.895
3.612AspLeu: 3.612 ± 0.62
0.328AspMet: 0.328 ± 0.219
0.985AspAsn: 0.985 ± 0.895
1.97AspPro: 1.97 ± 0.785
2.299AspGln: 2.299 ± 1.189
3.941AspArg: 3.941 ± 1.019
3.284AspSer: 3.284 ± 0.741
1.97AspThr: 1.97 ± 0.456
1.314AspVal: 1.314 ± 0.51
0.985AspTrp: 0.985 ± 0.375
1.314AspTyr: 1.314 ± 0.432
0.0AspXaa: 0.0 ± 0.0
Glu
4.269GluAla: 4.269 ± 0.672
0.328GluCys: 0.328 ± 0.342
5.911GluAsp: 5.911 ± 2.17
2.956GluGlu: 2.956 ± 0.981
3.612GluPhe: 3.612 ± 0.484
3.284GluGly: 3.284 ± 0.542
0.657GluHis: 0.657 ± 0.282
0.328GluIle: 0.328 ± 0.223
3.612GluLys: 3.612 ± 1.035
3.284GluLeu: 3.284 ± 0.571
0.657GluMet: 0.657 ± 0.255
1.642GluAsn: 1.642 ± 0.449
2.956GluPro: 2.956 ± 0.522
2.299GluGln: 2.299 ± 0.433
2.956GluArg: 2.956 ± 0.702
5.911GluSer: 5.911 ± 0.553
3.612GluThr: 3.612 ± 0.658
4.598GluVal: 4.598 ± 0.576
1.642GluTrp: 1.642 ± 0.445
3.284GluTyr: 3.284 ± 0.971
0.0GluXaa: 0.0 ± 0.0
Phe
4.598PheAla: 4.598 ± 0.967
0.657PheCys: 0.657 ± 0.342
2.956PheAsp: 2.956 ± 0.424
2.299PheGlu: 2.299 ± 0.457
2.299PhePhe: 2.299 ± 0.856
2.956PheGly: 2.956 ± 0.736
2.956PheHis: 2.956 ± 0.238
1.642PheIle: 1.642 ± 0.942
2.299PheLys: 2.299 ± 0.919
7.225PheLeu: 7.225 ± 1.553
0.328PheMet: 0.328 ± 0.342
0.985PheAsn: 0.985 ± 0.313
1.314PhePro: 1.314 ± 0.466
1.314PheGln: 1.314 ± 0.624
1.642PheArg: 1.642 ± 0.315
3.941PheSer: 3.941 ± 0.628
1.314PheThr: 1.314 ± 0.565
4.598PheVal: 4.598 ± 0.768
0.657PheTrp: 0.657 ± 0.415
1.314PheTyr: 1.314 ± 0.624
0.0PheXaa: 0.0 ± 0.0
Gly
3.612GlyAla: 3.612 ± 0.408
0.985GlyCys: 0.985 ± 0.21
3.284GlyAsp: 3.284 ± 0.581
4.926GlyGlu: 4.926 ± 1.028
4.598GlyPhe: 4.598 ± 1.163
7.882GlyGly: 7.882 ± 1.179
0.657GlyHis: 0.657 ± 0.415
2.299GlyIle: 2.299 ± 0.979
5.583GlyLys: 5.583 ± 0.848
4.598GlyLeu: 4.598 ± 1.256
0.328GlyMet: 0.328 ± 0.379
3.612GlyAsn: 3.612 ± 1.375
3.612GlyPro: 3.612 ± 1.128
1.314GlyGln: 1.314 ± 0.588
6.568GlyArg: 6.568 ± 2.022
7.882GlySer: 7.882 ± 1.626
2.956GlyThr: 2.956 ± 0.524
2.627GlyVal: 2.627 ± 0.512
0.985GlyTrp: 0.985 ± 0.375
3.284GlyTyr: 3.284 ± 0.439
0.0GlyXaa: 0.0 ± 0.0
His
0.657HisAla: 0.657 ± 0.342
0.985HisCys: 0.985 ± 0.394
1.314HisAsp: 1.314 ± 0.451
0.657HisGlu: 0.657 ± 0.449
1.97HisPhe: 1.97 ± 0.417
1.314HisGly: 1.314 ± 0.274
0.0HisHis: 0.0 ± 0.0
1.314HisIle: 1.314 ± 0.274
0.985HisLys: 0.985 ± 0.21
1.97HisLeu: 1.97 ± 0.575
0.0HisMet: 0.0 ± 0.0
1.314HisAsn: 1.314 ± 0.749
1.97HisPro: 1.97 ± 0.428
0.0HisGln: 0.0 ± 0.0
0.657HisArg: 0.657 ± 0.342
0.985HisSer: 0.985 ± 0.489
0.328HisThr: 0.328 ± 0.379
3.284HisVal: 3.284 ± 0.799
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
2.299IleAla: 2.299 ± 0.941
1.97IleCys: 1.97 ± 0.691
0.328IleAsp: 0.328 ± 0.298
0.985IleGlu: 0.985 ± 0.509
0.328IlePhe: 0.328 ± 0.223
0.985IleGly: 0.985 ± 0.74
0.657IleHis: 0.657 ± 0.376
0.657IleIle: 0.657 ± 0.685
1.314IleLys: 1.314 ± 0.793
4.926IleLeu: 4.926 ± 0.83
0.328IleMet: 0.328 ± 0.223
2.627IleAsn: 2.627 ± 1.15
3.612IlePro: 3.612 ± 0.573
1.314IleGln: 1.314 ± 0.351
4.269IleArg: 4.269 ± 1.549
6.568IleSer: 6.568 ± 0.878
1.97IleThr: 1.97 ± 0.711
0.657IleVal: 0.657 ± 0.282
0.0IleTrp: 0.0 ± 0.0
0.328IleTyr: 0.328 ± 0.298
0.0IleXaa: 0.0 ± 0.0
Lys
5.911LysAla: 5.911 ± 1.108
0.657LysCys: 0.657 ± 0.282
2.956LysAsp: 2.956 ± 0.444
1.97LysGlu: 1.97 ± 0.447
1.642LysPhe: 1.642 ± 0.445
3.941LysGly: 3.941 ± 0.48
0.657LysHis: 0.657 ± 0.597
2.299LysIle: 2.299 ± 0.594
1.314LysLys: 1.314 ± 0.416
4.269LysLeu: 4.269 ± 0.672
1.642LysMet: 1.642 ± 0.296
1.314LysAsn: 1.314 ± 0.56
2.627LysPro: 2.627 ± 0.553
3.612LysGln: 3.612 ± 0.627
1.97LysArg: 1.97 ± 0.73
4.269LysSer: 4.269 ± 0.983
3.284LysThr: 3.284 ± 0.561
2.299LysVal: 2.299 ± 0.586
1.642LysTrp: 1.642 ± 0.528
0.985LysTyr: 0.985 ± 0.461
0.328LysXaa: 0.328 ± 0.298
Leu
4.269LeuAla: 4.269 ± 0.807
2.299LeuCys: 2.299 ± 0.89
2.299LeuAsp: 2.299 ± 0.597
5.911LeuGlu: 5.911 ± 0.94
5.911LeuPhe: 5.911 ± 1.488
1.97LeuGly: 1.97 ± 0.428
1.97LeuHis: 1.97 ± 0.691
2.299LeuIle: 2.299 ± 0.58
3.284LeuLys: 3.284 ± 0.461
7.553LeuLeu: 7.553 ± 2.113
0.985LeuMet: 0.985 ± 0.404
3.612LeuAsn: 3.612 ± 0.648
5.583LeuPro: 5.583 ± 1.623
2.956LeuGln: 2.956 ± 0.669
4.269LeuArg: 4.269 ± 0.453
10.837LeuSer: 10.837 ± 1.058
5.911LeuThr: 5.911 ± 0.94
8.539LeuVal: 8.539 ± 2.266
3.284LeuTrp: 3.284 ± 1.114
4.269LeuTyr: 4.269 ± 0.764
0.0LeuXaa: 0.0 ± 0.0
Met
1.314MetAla: 1.314 ± 0.466
0.0MetCys: 0.0 ± 0.0
1.642MetAsp: 1.642 ± 0.747
1.314MetGlu: 1.314 ± 0.764
0.0MetPhe: 0.0 ± 0.0
0.985MetGly: 0.985 ± 0.475
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
0.985MetLys: 0.985 ± 0.21
1.314MetLeu: 1.314 ± 0.351
0.985MetMet: 0.985 ± 0.21
0.985MetAsn: 0.985 ± 0.667
0.657MetPro: 0.657 ± 0.456
0.0MetGln: 0.0 ± 0.0
0.0MetArg: 0.0 ± 0.0
1.97MetSer: 1.97 ± 1.048
0.657MetThr: 0.657 ± 0.376
2.299MetVal: 2.299 ± 0.429
0.328MetTrp: 0.328 ± 0.298
0.328MetTyr: 0.328 ± 0.298
0.0MetXaa: 0.0 ± 0.0
Asn
1.642AsnAla: 1.642 ± 0.589
0.328AsnCys: 0.328 ± 0.379
0.657AsnAsp: 0.657 ± 0.255
0.985AsnGlu: 0.985 ± 0.404
1.314AsnPhe: 1.314 ± 0.274
5.911AsnGly: 5.911 ± 1.709
0.657AsnHis: 0.657 ± 0.282
1.314AsnIle: 1.314 ± 1.009
2.627AsnLys: 2.627 ± 0.817
2.956AsnLeu: 2.956 ± 0.534
0.0AsnMet: 0.0 ± 0.0
1.97AsnAsn: 1.97 ± 0.554
2.627AsnPro: 2.627 ± 1.33
1.642AsnGln: 1.642 ± 0.455
3.941AsnArg: 3.941 ± 0.725
4.269AsnSer: 4.269 ± 0.855
3.941AsnThr: 3.941 ± 0.573
0.0AsnVal: 0.0 ± 0.0
1.642AsnTrp: 1.642 ± 0.398
2.956AsnTyr: 2.956 ± 0.424
0.0AsnXaa: 0.0 ± 0.0
Pro
2.956ProAla: 2.956 ± 0.448
0.657ProCys: 0.657 ± 0.282
2.956ProAsp: 2.956 ± 0.754
2.627ProGlu: 2.627 ± 0.874
2.299ProPhe: 2.299 ± 0.65
5.583ProGly: 5.583 ± 1.254
1.642ProHis: 1.642 ± 0.445
1.642ProIle: 1.642 ± 0.398
3.284ProLys: 3.284 ± 0.846
5.255ProLeu: 5.255 ± 1.055
0.657ProMet: 0.657 ± 0.282
1.314ProAsn: 1.314 ± 0.767
9.524ProPro: 9.524 ± 2.921
4.926ProGln: 4.926 ± 1.001
5.911ProArg: 5.911 ± 1.807
8.539ProSer: 8.539 ± 1.305
2.627ProThr: 2.627 ± 0.512
4.269ProVal: 4.269 ± 0.604
0.657ProTrp: 0.657 ± 0.255
0.657ProTyr: 0.657 ± 0.376
0.0ProXaa: 0.0 ± 0.0
Gln
3.941GlnAla: 3.941 ± 0.845
0.328GlnCys: 0.328 ± 0.298
0.328GlnAsp: 0.328 ± 0.379
3.612GlnGlu: 3.612 ± 0.834
3.284GlnPhe: 3.284 ± 1.027
1.314GlnGly: 1.314 ± 0.274
0.985GlnHis: 0.985 ± 0.353
0.985GlnIle: 0.985 ± 0.475
3.284GlnLys: 3.284 ± 0.894
2.956GlnLeu: 2.956 ± 1.113
0.328GlnMet: 0.328 ± 0.223
2.627GlnAsn: 2.627 ± 0.605
3.284GlnPro: 3.284 ± 0.723
1.642GlnGln: 1.642 ± 0.839
1.97GlnArg: 1.97 ± 0.364
2.299GlnSer: 2.299 ± 0.4
1.642GlnThr: 1.642 ± 0.61
1.642GlnVal: 1.642 ± 0.607
0.657GlnTrp: 0.657 ± 0.759
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
4.598ArgAla: 4.598 ± 1.675
1.314ArgCys: 1.314 ± 0.56
1.314ArgAsp: 1.314 ± 0.51
4.269ArgGlu: 4.269 ± 0.453
3.612ArgPhe: 3.612 ± 1.328
3.612ArgGly: 3.612 ± 0.834
1.642ArgHis: 1.642 ± 0.758
4.269ArgIle: 4.269 ± 1.464
1.314ArgLys: 1.314 ± 0.558
6.568ArgLeu: 6.568 ± 1.475
0.657ArgMet: 0.657 ± 0.804
4.598ArgAsn: 4.598 ± 1.35
4.926ArgPro: 4.926 ± 1.054
0.657ArgGln: 0.657 ± 0.449
13.136ArgArg: 13.136 ± 5.701
5.255ArgSer: 5.255 ± 1.278
2.299ArgThr: 2.299 ± 0.927
2.299ArgVal: 2.299 ± 0.876
1.314ArgTrp: 1.314 ± 0.644
0.328ArgTyr: 0.328 ± 0.223
0.0ArgXaa: 0.0 ± 0.0
Ser
6.897SerAla: 6.897 ± 1.929
0.657SerCys: 0.657 ± 0.342
2.627SerAsp: 2.627 ± 1.27
6.897SerGlu: 6.897 ± 1.447
4.598SerPhe: 4.598 ± 0.591
9.524SerGly: 9.524 ± 0.789
1.314SerHis: 1.314 ± 0.274
5.255SerIle: 5.255 ± 1.218
5.583SerLys: 5.583 ± 0.85
9.852SerLeu: 9.852 ± 2.189
2.627SerMet: 2.627 ± 0.839
3.941SerAsn: 3.941 ± 0.598
4.926SerPro: 4.926 ± 0.746
2.956SerGln: 2.956 ± 0.998
6.24SerArg: 6.24 ± 1.782
17.077SerSer: 17.077 ± 3.713
6.897SerThr: 6.897 ± 1.304
4.926SerVal: 4.926 ± 1.132
2.627SerTrp: 2.627 ± 0.846
3.284SerTyr: 3.284 ± 0.944
0.0SerXaa: 0.0 ± 0.0
Thr
3.612ThrAla: 3.612 ± 0.908
1.642ThrCys: 1.642 ± 0.398
1.97ThrAsp: 1.97 ± 0.509
1.97ThrGlu: 1.97 ± 0.494
1.97ThrPhe: 1.97 ± 1.108
3.284ThrGly: 3.284 ± 1.022
0.657ThrHis: 0.657 ± 0.376
4.269ThrIle: 4.269 ± 0.979
3.941ThrLys: 3.941 ± 0.654
4.598ThrLeu: 4.598 ± 0.757
0.985ThrMet: 0.985 ± 0.353
2.299ThrAsn: 2.299 ± 0.744
3.612ThrPro: 3.612 ± 1.275
2.627ThrGln: 2.627 ± 0.311
1.97ThrArg: 1.97 ± 0.644
4.926ThrSer: 4.926 ± 0.83
5.255ThrThr: 5.255 ± 1.97
4.926ThrVal: 4.926 ± 0.66
0.657ThrTrp: 0.657 ± 0.282
0.328ThrTyr: 0.328 ± 0.298
0.0ThrXaa: 0.0 ± 0.0
Val
5.255ValAla: 5.255 ± 1.325
0.0ValCys: 0.0 ± 0.0
4.269ValAsp: 4.269 ± 0.752
3.612ValGlu: 3.612 ± 0.636
2.627ValPhe: 2.627 ± 0.333
5.583ValGly: 5.583 ± 1.11
1.314ValHis: 1.314 ± 0.466
2.627ValIle: 2.627 ± 0.543
1.314ValLys: 1.314 ± 0.274
4.598ValLeu: 4.598 ± 1.118
1.314ValMet: 1.314 ± 0.432
1.97ValAsn: 1.97 ± 0.358
5.583ValPro: 5.583 ± 1.262
1.97ValGln: 1.97 ± 0.456
1.97ValArg: 1.97 ± 0.767
5.583ValSer: 5.583 ± 1.247
2.627ValThr: 2.627 ± 0.427
5.911ValVal: 5.911 ± 1.525
1.97ValTrp: 1.97 ± 0.847
0.657ValTyr: 0.657 ± 0.255
0.0ValXaa: 0.0 ± 0.0
Trp
1.642TrpAla: 1.642 ± 0.692
0.0TrpCys: 0.0 ± 0.0
0.328TrpAsp: 0.328 ± 0.298
0.328TrpGlu: 0.328 ± 0.298
0.328TrpPhe: 0.328 ± 0.298
1.97TrpGly: 1.97 ± 0.624
0.657TrpHis: 0.657 ± 0.449
0.328TrpIle: 0.328 ± 0.379
0.657TrpLys: 0.657 ± 0.282
1.642TrpLeu: 1.642 ± 0.572
0.328TrpMet: 0.328 ± 0.223
2.627TrpAsn: 2.627 ± 0.57
1.314TrpPro: 1.314 ± 0.624
0.985TrpGln: 0.985 ± 0.43
2.627TrpArg: 2.627 ± 0.344
2.627TrpSer: 2.627 ± 0.98
0.985TrpThr: 0.985 ± 0.404
0.328TrpVal: 0.328 ± 0.223
0.0TrpTrp: 0.0 ± 0.0
0.657TrpTyr: 0.657 ± 0.282
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.97TyrAla: 1.97 ± 0.86
0.328TyrCys: 0.328 ± 0.298
0.328TyrAsp: 0.328 ± 0.298
2.299TyrGlu: 2.299 ± 0.693
0.985TyrPhe: 0.985 ± 0.313
1.642TyrGly: 1.642 ± 0.271
1.314TyrHis: 1.314 ± 0.451
0.0TyrIle: 0.0 ± 0.0
2.627TyrLys: 2.627 ± 1.12
3.284TyrLeu: 3.284 ± 0.936
0.328TyrMet: 0.328 ± 0.342
1.97TyrAsn: 1.97 ± 0.66
1.314TyrPro: 1.314 ± 0.336
1.314TyrGln: 1.314 ± 0.592
0.328TyrArg: 0.328 ± 0.223
3.284TyrSer: 3.284 ± 0.484
1.642TyrThr: 1.642 ± 0.528
0.985TyrVal: 0.985 ± 0.43
0.328TyrTrp: 0.328 ± 0.379
0.657TyrTyr: 0.657 ± 0.342
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.328XaaVal: 0.328 ± 0.298
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (3046 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski