Amino acid dipepetide frequency for Blackberry chlorotic ringspot virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.79AlaAla: 6.79 ± 2.47
1.509AlaCys: 1.509 ± 0.589
4.527AlaAsp: 4.527 ± 1.448
2.263AlaGlu: 2.263 ± 0.419
3.018AlaPhe: 3.018 ± 1.296
2.641AlaGly: 2.641 ± 0.632
1.132AlaHis: 1.132 ± 0.449
2.641AlaIle: 2.641 ± 0.934
4.149AlaLys: 4.149 ± 2.601
6.79AlaLeu: 6.79 ± 1.694
3.395AlaMet: 3.395 ± 1.419
2.641AlaAsn: 2.641 ± 1.289
2.641AlaPro: 2.641 ± 0.934
0.754AlaGln: 0.754 ± 0.489
2.263AlaArg: 2.263 ± 0.54
5.281AlaSer: 5.281 ± 1.172
4.527AlaThr: 4.527 ± 0.75
5.281AlaVal: 5.281 ± 1.169
0.754AlaTrp: 0.754 ± 0.489
1.132AlaTyr: 1.132 ± 0.531
0.0AlaXaa: 0.0 ± 0.0
Cys
2.641CysAla: 2.641 ± 0.816
0.754CysCys: 0.754 ± 0.489
1.886CysAsp: 1.886 ± 0.823
1.132CysGlu: 1.132 ± 0.373
0.754CysPhe: 0.754 ± 0.503
1.886CysGly: 1.886 ± 0.823
0.754CysHis: 0.754 ± 0.489
0.754CysIle: 0.754 ± 0.295
0.0CysLys: 0.0 ± 0.0
2.641CysLeu: 2.641 ± 0.973
0.0CysMet: 0.0 ± 0.0
0.754CysAsn: 0.754 ± 0.503
1.509CysPro: 1.509 ± 0.74
0.377CysGln: 0.377 ± 0.245
0.377CysArg: 0.377 ± 0.303
1.132CysSer: 1.132 ± 0.655
1.132CysThr: 1.132 ± 0.647
3.395CysVal: 3.395 ± 1.069
0.0CysTrp: 0.0 ± 0.0
0.754CysTyr: 0.754 ± 0.489
0.0CysXaa: 0.0 ± 0.0
Asp
5.658AspAla: 5.658 ± 1.047
1.886AspCys: 1.886 ± 0.551
5.281AspAsp: 5.281 ± 2.214
4.904AspGlu: 4.904 ± 1.076
4.149AspPhe: 4.149 ± 0.765
5.658AspGly: 5.658 ± 2.222
1.132AspHis: 1.132 ± 0.724
3.395AspIle: 3.395 ± 0.644
4.904AspLys: 4.904 ± 1.362
6.035AspLeu: 6.035 ± 1.74
0.754AspMet: 0.754 ± 0.606
1.886AspAsn: 1.886 ± 1.061
2.641AspPro: 2.641 ± 0.75
2.263AspGln: 2.263 ± 0.971
1.886AspArg: 1.886 ± 0.889
6.035AspSer: 6.035 ± 1.264
3.018AspThr: 3.018 ± 1.294
7.922AspVal: 7.922 ± 1.602
0.754AspTrp: 0.754 ± 0.51
3.018AspTyr: 3.018 ± 1.664
0.0AspXaa: 0.0 ± 0.0
Glu
6.035GluAla: 6.035 ± 2.442
1.886GluCys: 1.886 ± 0.606
3.772GluAsp: 3.772 ± 0.959
4.149GluGlu: 4.149 ± 0.715
1.509GluPhe: 1.509 ± 0.595
1.132GluGly: 1.132 ± 0.655
2.263GluHis: 2.263 ± 0.629
4.149GluIle: 4.149 ± 0.943
4.527GluLys: 4.527 ± 1.406
5.658GluLeu: 5.658 ± 1.462
1.132GluMet: 1.132 ± 0.531
1.886GluAsn: 1.886 ± 0.418
2.263GluPro: 2.263 ± 1.084
0.377GluGln: 0.377 ± 0.245
3.772GluArg: 3.772 ± 1.424
2.641GluSer: 2.641 ± 2.051
4.149GluThr: 4.149 ± 1.051
5.281GluVal: 5.281 ± 1.518
1.132GluTrp: 1.132 ± 0.546
1.132GluTyr: 1.132 ± 0.546
0.0GluXaa: 0.0 ± 0.0
Phe
1.886PheAla: 1.886 ± 0.495
0.377PheCys: 0.377 ± 0.245
3.018PheAsp: 3.018 ± 1.28
3.395PheGlu: 3.395 ± 0.991
1.132PhePhe: 1.132 ± 0.546
2.263PheGly: 2.263 ± 0.492
0.754PheHis: 0.754 ± 0.295
2.263PheIle: 2.263 ± 0.765
2.263PheLys: 2.263 ± 1.091
4.527PheLeu: 4.527 ± 1.474
0.377PheMet: 0.377 ± 0.245
3.772PheAsn: 3.772 ± 1.474
2.263PhePro: 2.263 ± 0.456
1.509PheGln: 1.509 ± 0.43
4.149PheArg: 4.149 ± 1.004
2.263PheSer: 2.263 ± 0.765
1.509PheThr: 1.509 ± 0.938
3.772PheVal: 3.772 ± 1.466
0.377PheTrp: 0.377 ± 0.534
1.132PheTyr: 1.132 ± 0.449
0.0PheXaa: 0.0 ± 0.0
Gly
0.377GlyAla: 0.377 ± 0.245
1.132GlyCys: 1.132 ± 0.583
4.149GlyAsp: 4.149 ± 0.32
1.886GlyGlu: 1.886 ± 1.524
2.263GlyPhe: 2.263 ± 0.492
1.886GlyGly: 1.886 ± 0.86
0.754GlyHis: 0.754 ± 0.489
1.886GlyIle: 1.886 ± 1.063
3.395GlyLys: 3.395 ± 0.499
4.904GlyLeu: 4.904 ± 1.402
0.754GlyMet: 0.754 ± 0.511
2.263GlyAsn: 2.263 ± 0.884
1.132GlyPro: 1.132 ± 0.546
1.132GlyGln: 1.132 ± 1.008
3.772GlyArg: 3.772 ± 0.679
3.395GlySer: 3.395 ± 1.117
1.886GlyThr: 1.886 ± 0.579
4.904GlyVal: 4.904 ± 0.733
0.377GlyTrp: 0.377 ± 0.554
0.754GlyTyr: 0.754 ± 0.295
0.0GlyXaa: 0.0 ± 0.0
His
1.132HisAla: 1.132 ± 0.531
0.377HisCys: 0.377 ± 0.245
1.886HisAsp: 1.886 ± 0.719
1.132HisGlu: 1.132 ± 0.449
0.754HisPhe: 0.754 ± 0.295
0.377HisGly: 0.377 ± 0.303
1.132HisHis: 1.132 ± 0.909
1.886HisIle: 1.886 ± 0.938
1.509HisLys: 1.509 ± 0.369
1.132HisLeu: 1.132 ± 0.467
1.886HisMet: 1.886 ± 0.525
1.132HisAsn: 1.132 ± 0.546
0.754HisPro: 0.754 ± 0.517
0.754HisGln: 0.754 ± 0.569
1.132HisArg: 1.132 ± 0.734
2.641HisSer: 2.641 ± 0.92
1.509HisThr: 1.509 ± 0.589
2.263HisVal: 2.263 ± 0.629
0.0HisTrp: 0.0 ± 0.0
0.377HisTyr: 0.377 ± 0.303
0.0HisXaa: 0.0 ± 0.0
Ile
4.149IleAla: 4.149 ± 1.639
1.132IleCys: 1.132 ± 0.909
4.149IleAsp: 4.149 ± 1.367
1.886IleGlu: 1.886 ± 0.504
1.509IlePhe: 1.509 ± 0.804
3.395IleGly: 3.395 ± 1.462
0.754IleHis: 0.754 ± 0.813
1.886IleIle: 1.886 ± 0.572
4.527IleLys: 4.527 ± 0.978
2.641IleLeu: 2.641 ± 0.844
0.377IleMet: 0.377 ± 0.245
1.886IleAsn: 1.886 ± 0.606
4.904IlePro: 4.904 ± 1.232
2.263IleGln: 2.263 ± 1.492
3.395IleArg: 3.395 ± 1.05
4.904IleSer: 4.904 ± 0.695
2.641IleThr: 2.641 ± 0.6
4.527IleVal: 4.527 ± 1.796
0.0IleTrp: 0.0 ± 0.0
1.132IleTyr: 1.132 ± 0.734
0.0IleXaa: 0.0 ± 0.0
Lys
2.641LysAla: 2.641 ± 0.887
0.754LysCys: 0.754 ± 0.295
2.641LysAsp: 2.641 ± 1.365
5.281LysGlu: 5.281 ± 1.57
4.904LysPhe: 4.904 ± 1.792
3.395LysGly: 3.395 ± 1.117
0.754LysHis: 0.754 ± 0.469
3.395LysIle: 3.395 ± 1.272
2.641LysLys: 2.641 ± 1.003
5.281LysLeu: 5.281 ± 0.984
1.509LysMet: 1.509 ± 0.683
3.018LysAsn: 3.018 ± 0.83
3.772LysPro: 3.772 ± 2.582
1.886LysGln: 1.886 ± 1.068
2.263LysArg: 2.263 ± 0.807
3.772LysSer: 3.772 ± 0.858
6.035LysThr: 6.035 ± 1.172
5.281LysVal: 5.281 ± 2.615
0.754LysTrp: 0.754 ± 0.295
1.886LysTyr: 1.886 ± 0.719
0.0LysXaa: 0.0 ± 0.0
Leu
5.281LeuAla: 5.281 ± 0.738
3.018LeuCys: 3.018 ± 0.735
5.658LeuAsp: 5.658 ± 0.471
4.904LeuGlu: 4.904 ± 1.485
2.641LeuPhe: 2.641 ± 0.895
2.641LeuGly: 2.641 ± 0.619
1.132LeuHis: 1.132 ± 0.546
4.527LeuIle: 4.527 ± 0.457
6.79LeuLys: 6.79 ± 1.075
6.79LeuLeu: 6.79 ± 0.715
3.772LeuMet: 3.772 ± 0.83
4.904LeuAsn: 4.904 ± 1.353
6.79LeuPro: 6.79 ± 1.475
2.263LeuGln: 2.263 ± 0.711
4.904LeuArg: 4.904 ± 1.416
8.299LeuSer: 8.299 ± 2.256
4.904LeuThr: 4.904 ± 1.615
5.658LeuVal: 5.658 ± 0.976
0.377LeuTrp: 0.377 ± 0.303
1.886LeuTyr: 1.886 ± 1.381
0.0LeuXaa: 0.0 ± 0.0
Met
1.132MetAla: 1.132 ± 0.531
0.377MetCys: 0.377 ± 0.303
2.263MetAsp: 2.263 ± 0.898
1.886MetGlu: 1.886 ± 0.875
1.132MetPhe: 1.132 ± 0.467
0.377MetGly: 0.377 ± 0.526
0.377MetHis: 0.377 ± 0.303
2.641MetIle: 2.641 ± 0.619
1.132MetLys: 1.132 ± 0.734
1.886MetLeu: 1.886 ± 0.537
1.509MetMet: 1.509 ± 0.483
0.754MetAsn: 0.754 ± 0.295
1.509MetPro: 1.509 ± 0.804
0.377MetGln: 0.377 ± 0.303
1.132MetArg: 1.132 ± 0.531
1.886MetSer: 1.886 ± 1.66
3.395MetThr: 3.395 ± 1.16
1.132MetVal: 1.132 ± 0.449
0.754MetTrp: 0.754 ± 1.053
0.754MetTyr: 0.754 ± 0.511
0.0MetXaa: 0.0 ± 0.0
Asn
2.641AsnAla: 2.641 ± 0.865
2.263AsnCys: 2.263 ± 0.993
2.641AsnAsp: 2.641 ± 1.026
3.018AsnGlu: 3.018 ± 0.482
2.263AsnPhe: 2.263 ± 0.492
1.886AsnGly: 1.886 ± 1.061
1.132AsnHis: 1.132 ± 0.449
1.509AsnIle: 1.509 ± 0.483
1.886AsnLys: 1.886 ± 0.579
6.413AsnLeu: 6.413 ± 1.415
1.132AsnMet: 1.132 ± 0.54
0.754AsnAsn: 0.754 ± 0.489
3.772AsnPro: 3.772 ± 0.803
1.509AsnGln: 1.509 ± 0.507
3.018AsnArg: 3.018 ± 1.28
2.263AsnSer: 2.263 ± 0.767
3.395AsnThr: 3.395 ± 1.443
4.527AsnVal: 4.527 ± 2.248
0.377AsnTrp: 0.377 ± 0.245
0.754AsnTyr: 0.754 ± 0.489
0.0AsnXaa: 0.0 ± 0.0
Pro
4.149ProAla: 4.149 ± 1.885
0.754ProCys: 0.754 ± 0.517
3.772ProAsp: 3.772 ± 1.209
4.527ProGlu: 4.527 ± 2.519
2.263ProPhe: 2.263 ± 0.789
2.641ProGly: 2.641 ± 1.033
1.886ProHis: 1.886 ± 0.495
4.527ProIle: 4.527 ± 0.899
3.772ProLys: 3.772 ± 2.861
3.395ProLeu: 3.395 ± 0.939
1.509ProMet: 1.509 ± 0.468
2.263ProAsn: 2.263 ± 0.973
1.132ProPro: 1.132 ± 1.0
1.132ProGln: 1.132 ± 0.404
3.395ProArg: 3.395 ± 1.086
3.018ProSer: 3.018 ± 0.556
3.018ProThr: 3.018 ± 1.25
3.395ProVal: 3.395 ± 1.887
0.0ProTrp: 0.0 ± 0.0
1.132ProTyr: 1.132 ± 0.546
0.0ProXaa: 0.0 ± 0.0
Gln
0.377GlnAla: 0.377 ± 0.534
0.0GlnCys: 0.0 ± 0.0
0.754GlnAsp: 0.754 ± 0.511
1.132GlnGlu: 1.132 ± 0.546
0.754GlnPhe: 0.754 ± 0.606
1.132GlnGly: 1.132 ± 0.449
0.0GlnHis: 0.0 ± 0.0
2.263GlnIle: 2.263 ± 0.791
0.754GlnLys: 0.754 ± 0.295
1.132GlnLeu: 1.132 ± 1.073
0.0GlnMet: 0.0 ± 0.0
0.377GlnAsn: 0.377 ± 0.303
1.886GlnPro: 1.886 ± 1.495
0.0GlnGln: 0.0 ± 0.0
3.772GlnArg: 3.772 ± 2.05
2.263GlnSer: 2.263 ± 1.426
0.754GlnThr: 0.754 ± 0.517
3.395GlnVal: 3.395 ± 0.73
0.377GlnTrp: 0.377 ± 0.245
2.641GlnTyr: 2.641 ± 0.92
0.0GlnXaa: 0.0 ± 0.0
Arg
4.904ArgAla: 4.904 ± 1.424
1.886ArgCys: 1.886 ± 0.579
5.281ArgAsp: 5.281 ± 1.397
2.263ArgGlu: 2.263 ± 1.126
1.509ArgPhe: 1.509 ± 0.483
2.263ArgGly: 2.263 ± 1.0
0.754ArgHis: 0.754 ± 0.489
2.641ArgIle: 2.641 ± 1.026
4.904ArgLys: 4.904 ± 1.362
7.167ArgLeu: 7.167 ± 1.725
1.509ArgMet: 1.509 ± 0.978
5.281ArgAsn: 5.281 ± 1.578
1.132ArgPro: 1.132 ± 0.724
0.754ArgGln: 0.754 ± 0.503
4.149ArgArg: 4.149 ± 1.342
4.904ArgSer: 4.904 ± 1.831
2.641ArgThr: 2.641 ± 0.632
4.527ArgVal: 4.527 ± 1.406
0.754ArgTrp: 0.754 ± 0.503
1.886ArgTyr: 1.886 ± 0.93
0.0ArgXaa: 0.0 ± 0.0
Ser
4.149SerAla: 4.149 ± 1.65
2.641SerCys: 2.641 ± 1.036
7.167SerAsp: 7.167 ± 1.44
3.018SerGlu: 3.018 ± 0.556
3.772SerPhe: 3.772 ± 1.135
4.149SerGly: 4.149 ± 1.214
1.886SerHis: 1.886 ± 0.719
3.395SerIle: 3.395 ± 0.651
4.149SerLys: 4.149 ± 1.691
7.544SerLeu: 7.544 ± 1.589
1.509SerMet: 1.509 ± 0.863
5.658SerAsn: 5.658 ± 2.022
2.641SerPro: 2.641 ± 1.574
1.509SerGln: 1.509 ± 0.492
4.904SerArg: 4.904 ± 0.831
6.79SerSer: 6.79 ± 1.143
1.886SerThr: 1.886 ± 0.791
4.904SerVal: 4.904 ± 1.232
1.886SerTrp: 1.886 ± 0.823
1.886SerTyr: 1.886 ± 0.719
0.0SerXaa: 0.0 ± 0.0
Thr
2.641ThrAla: 2.641 ± 0.864
0.377ThrCys: 0.377 ± 0.245
3.772ThrAsp: 3.772 ± 1.242
3.395ThrGlu: 3.395 ± 1.605
3.018ThrPhe: 3.018 ± 0.731
1.886ThrGly: 1.886 ± 0.579
3.395ThrHis: 3.395 ± 1.069
3.772ThrIle: 3.772 ± 0.994
3.772ThrLys: 3.772 ± 2.166
6.035ThrLeu: 6.035 ± 1.683
2.641ThrMet: 2.641 ± 0.701
3.018ThrAsn: 3.018 ± 0.934
1.509ThrPro: 1.509 ± 0.59
0.754ThrGln: 0.754 ± 0.51
4.149ThrArg: 4.149 ± 1.239
2.641ThrSer: 2.641 ± 1.003
4.527ThrThr: 4.527 ± 1.401
3.772ThrVal: 3.772 ± 1.108
1.132ThrTrp: 1.132 ± 0.373
1.886ThrTyr: 1.886 ± 0.719
0.0ThrXaa: 0.0 ± 0.0
Val
6.035ValAla: 6.035 ± 1.079
0.754ValCys: 0.754 ± 0.295
8.676ValAsp: 8.676 ± 1.539
7.922ValGlu: 7.922 ± 0.904
3.395ValPhe: 3.395 ± 0.644
2.641ValGly: 2.641 ± 0.4
2.263ValHis: 2.263 ± 0.611
3.018ValIle: 3.018 ± 0.855
4.527ValLys: 4.527 ± 1.147
4.149ValLeu: 4.149 ± 1.558
1.132ValMet: 1.132 ± 0.449
4.527ValAsn: 4.527 ± 1.486
8.299ValPro: 8.299 ± 2.147
2.263ValGln: 2.263 ± 0.789
4.904ValArg: 4.904 ± 0.695
7.167ValSer: 7.167 ± 0.971
5.281ValThr: 5.281 ± 0.82
6.035ValVal: 6.035 ± 1.132
1.509ValTrp: 1.509 ± 0.9
2.641ValTyr: 2.641 ± 0.663
0.0ValXaa: 0.0 ± 0.0
Trp
0.754TrpAla: 0.754 ± 0.503
0.754TrpCys: 0.754 ± 0.503
0.377TrpAsp: 0.377 ± 0.245
0.377TrpGlu: 0.377 ± 0.245
1.132TrpPhe: 1.132 ± 0.449
0.377TrpGly: 0.377 ± 0.303
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
1.132TrpLys: 1.132 ± 0.724
0.377TrpLeu: 0.377 ± 0.245
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.377TrpPro: 0.377 ± 0.526
0.0TrpGln: 0.0 ± 0.0
1.132TrpArg: 1.132 ± 0.449
0.754TrpSer: 0.754 ± 0.503
0.754TrpThr: 0.754 ± 0.511
2.263TrpVal: 2.263 ± 1.295
0.0TrpTrp: 0.0 ± 0.0
0.754TrpTyr: 0.754 ± 0.469
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.754TyrAla: 0.754 ± 0.489
0.0TyrCys: 0.0 ± 0.0
1.509TyrAsp: 1.509 ± 0.589
0.377TyrGlu: 0.377 ± 0.245
1.132TyrPhe: 1.132 ± 0.546
0.377TyrGly: 0.377 ± 0.526
1.509TyrHis: 1.509 ± 0.589
1.509TyrIle: 1.509 ± 0.682
1.132TyrLys: 1.132 ± 0.546
2.641TyrLeu: 2.641 ± 0.632
1.132TyrMet: 1.132 ± 0.449
0.754TyrAsn: 0.754 ± 0.489
0.754TyrPro: 0.754 ± 0.813
1.886TyrGln: 1.886 ± 0.727
2.641TyrArg: 2.641 ± 0.699
3.395TyrSer: 3.395 ± 1.347
1.132TyrThr: 1.132 ± 0.373
4.904TyrVal: 4.904 ± 1.37
0.0TyrTrp: 0.0 ± 0.0
0.754TyrTyr: 0.754 ± 0.295
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (2652 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski