Skip to content

v1.4.0

Compare
Choose a tag to compare
@ZiMengSheng ZiMengSheng released this 15 Jan 04:22
· 27 commits to release-1.4 since this release
c70f410

What's Changed

  • koordlet: resource qos config supports SYSTEM qos by @saintube in #1556
  • koord-manager: Add configurable mutating pod update and normalize webhook name by @FillZpp in #1562
  • koord-manager: only enable necessary plugin in the ut of resource calculator by @zwzhang0107 in #1564
  • koord-manager: allow quotas that are not bound to pods to become parent nodes by @xulinfei1996 in #1553
  • apis: add cold memory collection API by @BUPT-wxq in #1561
  • koord-scheduler: fix quota onPodUpdate by @shaloulcy in #1565
  • koord-scheduler: skip checking for nodes whose nodemetrics update time has expired by @lucming in #1563
  • Fix webhook patch manifests for pod update mutation by @FillZpp in #1575
  • proposal: Eviction Arbitration Mechanism in Descheduler by @baowj-678 in #1454
  • koordlet: fix lose precision when show result of cpuspress by @lucming in #1572
  • koord-descheduler: Add Arbitration Mechanism config by @baowj-678 in #1571
  • koord-scheduler: fix missing IsParent when rebuild tree by @shaloulcy in #1576
  • koordlet: fix error in perf collector by @bowen-intel in #1569
  • koordlet: fix http server with default pprof by @FillZpp in #1577
  • apis: optimize extension APIs layout by @eahydra in #1579
  • apis: add NUMA-aware topology policies by @eahydra in #1581
  • ci: initialize e2e workflow by @saintube in #1573
  • koordlet: revise nri server start by @saintube in #1578
  • koordlet: fix NUMA info reporting by @saintube in #1587
  • koord-scheduler: add numa topology manager by @eahydra in #1584
  • apis: update ResourceStatus to support NUMA-aware scheduling by @eahydra in #1588
  • proposal: Node Resource Amplification by @zqzten in #1549
  • koord-manager: improve batch resource calculation with system metrics by @saintube in #1559
  • chore(deps): bump helm/kind-action from 1.5.0 to 1.8.0 by @dependabot in #1590
  • apis: add cpu normalization ratio and cpu basic info annotations by @saintube in #1591
  • koord-scheduler: move nodes to GroupQuotaManager by @shaloulcy in #1595
  • koord-scheduler: export koordinator-root-quota as CRD object by @tan90github in #1555
  • koord-scheduler: deviceshare plugin skips handling nodes without device cr by @lucming in #1594
  • koord-scheduler: support NUMA-aware scheduling by @eahydra in #1592
  • apis: add elastic quota profile by @shaloulcy in #1596
  • apis: remove omitempty tag of Device's field health by @eahydra in #1601
  • koord-manager: cleanup node.alloc when device cr have been deleted by @lucming in #1597
  • apis: add HostApplications for out-of-band application by @zwzhang0107 in #1508
  • koord-scheduler: fix runtime quota when quota is not allow lent resource by @shaloulcy in #1583
  • koord-scheduler: supports plugins to adjust reservation reusable resources by @eahydra in #1600
  • Add multi quota tree by @shaloulcy in #1603
  • koord-scheduler: ignore non-existing resources in NUMA Node by @eahydra in #1607
  • koord-scheduler: update NodeNUMAResource plugin args by @eahydra in #1610
  • koord-scheduler: support reservation score normalization by @eahydra in #1614
  • koord-scheduler: update Node capacity by profile by @shaloulcy in #1612
  • koord-scheduler: optimize NUMA Topology Manager interface by @eahydra in #1611
  • apis: mark device health defaults to false by @eahydra in #1619
  • chore(deps): bump actions/checkout from 3 to 4 by @dependabot in #1618
  • chore(deps): bump actions/setup-go from 3 to 4 by @dependabot in #1589
  • koord-manager: add quota profile controller by @shaloulcy in #1605
  • koord-scheduler: numa-aware scheduling supports scoring by @eahydra in #1609
  • koord-manager: fix role for elasticquotaprofile by @shaloulcy in #1620
  • koord-descheduler: add Arbitrator by @baowj-678 in #1586
  • koord-scheduler: add annotation about namespaces that limit quota usage by @xulinfei1996 in #1566
  • apis: introduce api for node resource amplification by @zqzten in #1622
  • koordlet: report CPUBasicInfo, adjust normalized cpu cgroups by @saintube in #1604
  • koord-manager: support cpu normalization by @saintube in #1599
  • koord-scheduler: support required CPU bind policy by @eahydra in #1628
  • ci: fix koord-manager role and ignore errors in e2e job by @saintube in #1630
  • koord-scheduler: update quota and pod handle for elasticquotaprofile by @shaloulcy in #1621
  • ci: remove quota profile rbac role for koord-scheduler by @shaloulcy in #1632
  • koordlet: report be cpu share pool and set cpuset if specified by @zwzhang0107 in #1626
  • koord-descheduler: add some SortFns for arbitration by @baowj-678 in #1625
  • ci: fix quota webhook verb by @shaloulcy in #1635
  • koord-scheduler: ElasticQuota plugin implements EnqueueExtensions by @eahydra in #1637
  • koord-scheduler: add ElasticQuotaIgnoreOverhead feature-gate by @eahydra in #1636
  • koord-scheduler: framework extension executes handler after handling scheduling errors by @eahydra in #1633
  • koordlet: report NRT zone resources mergeable by @saintube in #1640
  • chore(deps): bump docker/setup-buildx-action from 2 to 3 by @dependabot in #1648
  • koordlet: support qos for out-of-band applications by @zwzhang0107 in #1638
  • koord-scheduler: NodeNUMAResource only consider allocatable when generating hints by @eahydra in #1647
  • koordlet: add libpfm4&perf group by @bowen-intel in #1554
  • koordlet: revise BECPUEvict, RdtResctrl, CPUSetAllocator and system collector by @saintube in #1641
  • koordlet: add perf cgroup unsupported by @zwzhang0107 in #1653
  • koord-manager: fix LessThanOrEqual in elastic quota by @shaloulcy in #1660
  • koord-scheduler: revise NodeNUMAResource by @eahydra in #1657
  • Add new adopter by @pheianox in #1642
  • koordlet: fix resource qos rules missing LSE by @saintube in #1663
  • koord-manager: add NUMA-level batch resources by @saintube in #1631
  • Makefile: fix libpfm4 make test by @bowen-intel in #1661
  • Makefile: exit for macOS by @bowen-intel in #1664
  • koord-scheduler: add ElasticQuota Guaranteed usage by @shaloulcy in #1645
  • koord-scheduler: export min/max getters of QuotaInfo as public by @eahydra in #1666
  • add DisableDefaultQuota featuregate by @shaloulcy in #1665
  • koord-manager: calculate batch resource with kubelet reserved by @zwzhang0107 in #1667
  • chore(deps): bump docker/login-action from 2 to 3 by @dependabot in #1650
  • koord-scheduler: optimize NodeNUMAResource scoring by @eahydra in #1669
  • chore(deps): bump docker/build-push-action from 4 to 5 by @dependabot in #1649
  • koord-scheduler: LoadAware supports node amplification ratios by @jasonliu747 in #1673
  • koord-scheduler: update elastic quota endpoint by @jasonliu747 in #1674
  • koordlet: add cold memory collection and reporting by @BUPT-wxq in #1574
  • koord-scheduler: inplace update quota min and max by @shaloulcy in #1672
  • koord-scheduler: NodeNUMAResource supports amplification ratios by @eahydra in #1671
  • koord-scheduler: add e2e case for multi quota tree by @shaloulcy in #1675
  • koord-scheduler: fix root quota when deleted by @shaloulcy in #1678
  • koord-manager: fix quota profile resource ratio by @shaloulcy in #1679
  • koord-scheduler: NodeNUMAResource supports filtering/scoring with node-level amplification ratios by @eahydra in #1677
  • koord-scheduler: NodeNUMAResource scoring fix and tweak with node-level amplification ratios by @zqzten in #1683
  • e2e: add basic quota e2e cases by @shaloulcy in #1685
  • koord-scheduler: fix NodeNUMAResource allocate CPUSets on mixed CPUSet/Share NUMA Node by @eahydra in #1682
  • koordlet: revise logs in be suppress and batch resource by @saintube in #1684
  • Fix runtime proxy compatibility with CRI interface version by @hormes in #1593
  • refactor codes to help readability by @hormes in #1687
  • koord-descheduler: add arbitration to migration controller by @baowj-678 in #1651
  • koord-scheduler: add decorator for quota by @shaloulcy in #1686
  • koord-scheduler: optimize elastic quota log by @shaloulcy in #1690
  • webhook: revise the validation of Pod resources by @eahydra in #1693
  • scheduler: optimize reservation BeforePreFilter performance by @eahydra in #1695
  • dockerfile: support multiarch image build by @saintube in #1697
  • ci: add more slo controllers e2e test by @saintube in #1688
  • scheduler: modify coscheduling Less function with considering childScheduleCycle by @buptcozy in #1702
  • proposal: change node resource amplification from api to scheduling level by @zqzten in #1700
  • koord-scheduler: Add non-preemptible feature in Elastic Quota Management by @tan90github in #1668
  • chore(deps): bump goreleaser/goreleaser-action from 4 to 5 by @dependabot in #1643
  • scheduler: quota debug api support multi tree by @shaloulcy in #1706
  • ci: support loading all components image in E2E by @saintube in #1705
  • scheduler: make ReservationInfo.AddAssignedPod idempotent by @eahydra in #1709
  • webhook: add force update label for elastic quota by @shaloulcy in #1708
  • manager: add GetNodeAllocatable in quota-controller by @shaloulcy in #1710
  • koordlet: fix metric name typo by @saintube in #1712
  • scheduler: use asynclog to reduce klog performance overhead by @eahydra in #1694
  • scheduler: reservation senses node affinity by @eahydra in #1713
  • scheduler: optimize memory allocations by @eahydra in #1714
  • koord-manager: enhance batch resource configuration and updating by @saintube in #1703
  • scheduler: add elastic quota metric by @shaloulcy in #1716
  • scheduler: reservation transformer ends early with reservation affinity by @eahydra in #1717
  • apis: make PCIEID of DeviceTopology as string type by @ZiMengSheng in #1718
  • koordlet: cold memory fix usageWithHotPage and add usageWithPageCache by @BUPT-wxq in #1699
  • chores: update outdated dingtak qr code by @zwzhang0107 in #1719
  • descheduler: check nodemetrics cr is expired or not when descheduling by @lucming in #1721
  • pkg/koordlet: update some log level to silence log by @zxh326 in #1723
  • ci: disable codeql in github action workflow by @jasonliu747 in #1731
  • scheduler: numa-aware scheduling supports selecting numa node by score by @eahydra in #1726
  • scheduler: revise numa-aware hints generator by @eahydra in #1732
  • apis: add host application metric to node metric crd by @zwzhang0107 in #1725
  • koordlet: report host application resource usage for calculating batc… by @zwzhang0107 in #1733
  • scheduler: update reservationInfo ownerMatchers if reservation updated by @eahydra in #1740
  • scheduler: leaky resources in ReservationInfo when pod failed to bind by @eahydra in #1742
  • apis: add core sched apis by @saintube in #1720
  • scheduler: fix missing update quota when pod is in DeletedFinalStateUnknown state by @qinfustu in #1737
  • koordlet: fix prediction restore for node priorities by @saintube in #1749
  • slo-controller: supppor third party allocator for batch resource calculation by @zwzhang0107 in #1739
  • koord-manager: get node batch allocatable from extended resource by @zwzhang0107 in #1752
  • proposal: CPU Normalization by @zqzten in #1598
  • scheduler: Add non-preemptible Request in quotaInfo's QuotaCalculateInfo by @xulinfei1996 in #1755
  • koord-manager: improve batch resource calculation by @saintube in #1751
  • api: add koordinator.sh/gpu.shared by @ZiMengSheng in #1763
  • feat(deps): bump github.com/golang/protobuf from 1.5.2 to 1.5.3 by @dependabot in #1623
  • koordlet: support pouch container by @zwzhang0107 in #1761
  • scheduler: verify quota according to resource dimensions required by pod by @qinfustu in #1753
  • proposal: add device allocation apis by @eahydra in #1529
  • apis: add device allocate hints by @eahydra in #1766
  • chore: fix koordinator typo by @ikaven1024 in #1769
  • koordlet: add page cache collector unit test and mend default cold memory config by @BUPT-wxq in #1738
  • scheduler: DeviceShare supports allocating by hints by @eahydra in #1768
  • chore(deps): bump actions/setup-go from 4 to 5 by @dependabot in #1771
  • koordlet: report gpu device topology by @eahydra in #1775
  • koordlet: support hugepage reporting by @peiqiaoWang in #1744
  • scheduler: fix redundant argument when create a unschedulable status. by @bogo-y in #1782
  • scheduler: DeviceShare supports numa-aware scheduling by @eahydra in #1774
  • scheduler: fix DeviceShare concurrent map read and map write by @jasonliu747 in #1788
  • scheduler: Coscheduling skips check schedule cycle if pod has nominated node by @xulinfei1996 in #1790
  • scheduler: DeviceShare skips numa-aware if no device requests by @jasonliu747 in #1791
  • koordlet: fix lscpu for arm64 by @zwzhang0107 in #1792
  • koordlet: revise system qos and numa aware cpuset by @saintube in #1793
  • koordlet: fix lscpu for arm ut by @zwzhang0107 in #1794
  • colocationprofile: support mutating pod labels and annotations with mapping by @saintube in #1781
  • chores: add args for Makefile by @zwzhang0107 in #1795
  • scheduler: Reservation supports preempting permitted pods by @eahydra in #1800
  • scheduler: fix the calculation method in PreFilterExtension by @eahydra in #1802
  • scheduler: FilterReservation/ScoreReservation perceives preemptible states by @eahydra in #1803
  • scheduler: support DeviceHandler to selectively generate requests and replicas by @eahydra in #1806
  • scheduler: optimize DeviceHandler interface by @eahydra in #1808
  • scheduler: enhance ReservationNominator to support preemption by @eahydra in #1804
  • scheduler: fix preemption panic by @eahydra in #1810
  • koordlet: support core sched cookie management by @saintube in #1722
  • scheduler: change reservation default allocate policy to aligned by @eahydra in #1814
  • scheduler: remove useless code from Reservation plugin by @eahydra in #1815
  • chores: replace shimo doc to dingtalk by @kangclzjc in #1813
  • scheduler: add monitor to detect scheduling deadlock by @eahydra in #1817
  • feat(deps): bump go.uber.org/atomic from 1.10.0 to 1.11.0 by @dependabot in #1778
  • koordlet: fix be cpuset for numaResource by @saintube in #1818
  • chore: only require security updates and exclude version updates by @jasonliu747 in #1823
  • chores: update bi-weekly meeting notes link by @ZiMengSheng in #1824
  • slo-controller: Introduce node resource amplification plugin by @zqzten in #1796
  • scheduler: refine log in device share plugin by @jasonliu747 in #1826
  • scheduler: fix skip logic in reservation plugin by @jasonliu747 in #1827
  • chore: fix some unlinted errors by @saintube in #1811
  • webhook: add node webhook for resource amplification by @hormes in #1785
  • koordlet: fix RDT cache id issue by @bowen-intel in #1789
  • docs: support NetworkQoS based on external plugin such as terway-qos by @lucming in #1779
  • koordlet: fix memory leak in perfGroup by @bowen-intel in #1828
  • koordlet: fix typo in function isBECPUUsageHighEnough by @j4ckstraw in #1830
  • koordlet: fix by adding lock during podmap update by @xujihui1985 in #1838

New Contributors

Full Changelog: v1.3.0...v1.4.0