v1.4.0
ZiMengSheng
released this
15 Jan 04:22
·
27 commits
to release-1.4
since this release
What's Changed
- koordlet: resource qos config supports SYSTEM qos by @saintube in #1556
- koord-manager: Add configurable mutating pod update and normalize webhook name by @FillZpp in #1562
- koord-manager: only enable necessary plugin in the ut of resource calculator by @zwzhang0107 in #1564
- koord-manager: allow quotas that are not bound to pods to become parent nodes by @xulinfei1996 in #1553
- apis: add cold memory collection API by @BUPT-wxq in #1561
- koord-scheduler: fix quota onPodUpdate by @shaloulcy in #1565
- koord-scheduler: skip checking for nodes whose nodemetrics update time has expired by @lucming in #1563
- Fix webhook patch manifests for pod update mutation by @FillZpp in #1575
- proposal: Eviction Arbitration Mechanism in Descheduler by @baowj-678 in #1454
- koordlet: fix lose precision when show result of cpuspress by @lucming in #1572
- koord-descheduler: Add Arbitration Mechanism config by @baowj-678 in #1571
- koord-scheduler: fix missing IsParent when rebuild tree by @shaloulcy in #1576
- koordlet: fix error in perf collector by @bowen-intel in #1569
- koordlet: fix http server with default pprof by @FillZpp in #1577
- apis: optimize extension APIs layout by @eahydra in #1579
- apis: add NUMA-aware topology policies by @eahydra in #1581
- ci: initialize e2e workflow by @saintube in #1573
- koordlet: revise nri server start by @saintube in #1578
- koordlet: fix NUMA info reporting by @saintube in #1587
- koord-scheduler: add numa topology manager by @eahydra in #1584
- apis: update ResourceStatus to support NUMA-aware scheduling by @eahydra in #1588
- proposal: Node Resource Amplification by @zqzten in #1549
- koord-manager: improve batch resource calculation with system metrics by @saintube in #1559
- chore(deps): bump helm/kind-action from 1.5.0 to 1.8.0 by @dependabot in #1590
- apis: add cpu normalization ratio and cpu basic info annotations by @saintube in #1591
- koord-scheduler: move nodes to GroupQuotaManager by @shaloulcy in #1595
- koord-scheduler: export koordinator-root-quota as CRD object by @tan90github in #1555
- koord-scheduler: deviceshare plugin skips handling nodes without device cr by @lucming in #1594
- koord-scheduler: support NUMA-aware scheduling by @eahydra in #1592
- apis: add elastic quota profile by @shaloulcy in #1596
- apis: remove omitempty tag of Device's field health by @eahydra in #1601
- koord-manager: cleanup node.alloc when device cr have been deleted by @lucming in #1597
- apis: add HostApplications for out-of-band application by @zwzhang0107 in #1508
- koord-scheduler: fix runtime quota when quota is not allow lent resource by @shaloulcy in #1583
- koord-scheduler: supports plugins to adjust reservation reusable resources by @eahydra in #1600
- Add multi quota tree by @shaloulcy in #1603
- koord-scheduler: ignore non-existing resources in NUMA Node by @eahydra in #1607
- koord-scheduler: update NodeNUMAResource plugin args by @eahydra in #1610
- koord-scheduler: support reservation score normalization by @eahydra in #1614
- koord-scheduler: update Node capacity by profile by @shaloulcy in #1612
- koord-scheduler: optimize NUMA Topology Manager interface by @eahydra in #1611
- apis: mark device health defaults to false by @eahydra in #1619
- chore(deps): bump actions/checkout from 3 to 4 by @dependabot in #1618
- chore(deps): bump actions/setup-go from 3 to 4 by @dependabot in #1589
- koord-manager: add quota profile controller by @shaloulcy in #1605
- koord-scheduler: numa-aware scheduling supports scoring by @eahydra in #1609
- koord-manager: fix role for elasticquotaprofile by @shaloulcy in #1620
- koord-descheduler: add Arbitrator by @baowj-678 in #1586
- koord-scheduler: add annotation about namespaces that limit quota usage by @xulinfei1996 in #1566
- apis: introduce api for node resource amplification by @zqzten in #1622
- koordlet: report CPUBasicInfo, adjust normalized cpu cgroups by @saintube in #1604
- koord-manager: support cpu normalization by @saintube in #1599
- koord-scheduler: support required CPU bind policy by @eahydra in #1628
- ci: fix koord-manager role and ignore errors in e2e job by @saintube in #1630
- koord-scheduler: update quota and pod handle for elasticquotaprofile by @shaloulcy in #1621
- ci: remove quota profile rbac role for koord-scheduler by @shaloulcy in #1632
- koordlet: report be cpu share pool and set cpuset if specified by @zwzhang0107 in #1626
- koord-descheduler: add some SortFns for arbitration by @baowj-678 in #1625
- ci: fix quota webhook verb by @shaloulcy in #1635
- koord-scheduler: ElasticQuota plugin implements EnqueueExtensions by @eahydra in #1637
- koord-scheduler: add ElasticQuotaIgnoreOverhead feature-gate by @eahydra in #1636
- koord-scheduler: framework extension executes handler after handling scheduling errors by @eahydra in #1633
- koordlet: report NRT zone resources mergeable by @saintube in #1640
- chore(deps): bump docker/setup-buildx-action from 2 to 3 by @dependabot in #1648
- koordlet: support qos for out-of-band applications by @zwzhang0107 in #1638
- koord-scheduler: NodeNUMAResource only consider allocatable when generating hints by @eahydra in #1647
- koordlet: add libpfm4&perf group by @bowen-intel in #1554
- koordlet: revise BECPUEvict, RdtResctrl, CPUSetAllocator and system collector by @saintube in #1641
- koordlet: add perf cgroup unsupported by @zwzhang0107 in #1653
- koord-manager: fix LessThanOrEqual in elastic quota by @shaloulcy in #1660
- koord-scheduler: revise NodeNUMAResource by @eahydra in #1657
- Add new adopter by @pheianox in #1642
- koordlet: fix resource qos rules missing LSE by @saintube in #1663
- koord-manager: add NUMA-level batch resources by @saintube in #1631
- Makefile: fix libpfm4 make test by @bowen-intel in #1661
- Makefile: exit for macOS by @bowen-intel in #1664
- koord-scheduler: add ElasticQuota Guaranteed usage by @shaloulcy in #1645
- koord-scheduler: export min/max getters of QuotaInfo as public by @eahydra in #1666
- add DisableDefaultQuota featuregate by @shaloulcy in #1665
- koord-manager: calculate batch resource with kubelet reserved by @zwzhang0107 in #1667
- chore(deps): bump docker/login-action from 2 to 3 by @dependabot in #1650
- koord-scheduler: optimize NodeNUMAResource scoring by @eahydra in #1669
- chore(deps): bump docker/build-push-action from 4 to 5 by @dependabot in #1649
- koord-scheduler: LoadAware supports node amplification ratios by @jasonliu747 in #1673
- koord-scheduler: update elastic quota endpoint by @jasonliu747 in #1674
- koordlet: add cold memory collection and reporting by @BUPT-wxq in #1574
- koord-scheduler: inplace update quota min and max by @shaloulcy in #1672
- koord-scheduler: NodeNUMAResource supports amplification ratios by @eahydra in #1671
- koord-scheduler: add e2e case for multi quota tree by @shaloulcy in #1675
- koord-scheduler: fix root quota when deleted by @shaloulcy in #1678
- koord-manager: fix quota profile resource ratio by @shaloulcy in #1679
- koord-scheduler: NodeNUMAResource supports filtering/scoring with node-level amplification ratios by @eahydra in #1677
- koord-scheduler: NodeNUMAResource scoring fix and tweak with node-level amplification ratios by @zqzten in #1683
- e2e: add basic quota e2e cases by @shaloulcy in #1685
- koord-scheduler: fix NodeNUMAResource allocate CPUSets on mixed CPUSet/Share NUMA Node by @eahydra in #1682
- koordlet: revise logs in be suppress and batch resource by @saintube in #1684
- Fix runtime proxy compatibility with CRI interface version by @hormes in #1593
- refactor codes to help readability by @hormes in #1687
- koord-descheduler: add arbitration to migration controller by @baowj-678 in #1651
- koord-scheduler: add decorator for quota by @shaloulcy in #1686
- koord-scheduler: optimize elastic quota log by @shaloulcy in #1690
- webhook: revise the validation of Pod resources by @eahydra in #1693
- scheduler: optimize reservation BeforePreFilter performance by @eahydra in #1695
- dockerfile: support multiarch image build by @saintube in #1697
- ci: add more slo controllers e2e test by @saintube in #1688
- scheduler: modify coscheduling Less function with considering childScheduleCycle by @buptcozy in #1702
- proposal: change node resource amplification from api to scheduling level by @zqzten in #1700
- koord-scheduler: Add non-preemptible feature in Elastic Quota Management by @tan90github in #1668
- chore(deps): bump goreleaser/goreleaser-action from 4 to 5 by @dependabot in #1643
- scheduler: quota debug api support multi tree by @shaloulcy in #1706
- ci: support loading all components image in E2E by @saintube in #1705
- scheduler: make ReservationInfo.AddAssignedPod idempotent by @eahydra in #1709
- webhook: add force update label for elastic quota by @shaloulcy in #1708
- manager: add GetNodeAllocatable in quota-controller by @shaloulcy in #1710
- koordlet: fix metric name typo by @saintube in #1712
- scheduler: use asynclog to reduce klog performance overhead by @eahydra in #1694
- scheduler: reservation senses node affinity by @eahydra in #1713
- scheduler: optimize memory allocations by @eahydra in #1714
- koord-manager: enhance batch resource configuration and updating by @saintube in #1703
- scheduler: add elastic quota metric by @shaloulcy in #1716
- scheduler: reservation transformer ends early with reservation affinity by @eahydra in #1717
- apis: make PCIEID of DeviceTopology as string type by @ZiMengSheng in #1718
- koordlet: cold memory fix usageWithHotPage and add usageWithPageCache by @BUPT-wxq in #1699
- chores: update outdated dingtak qr code by @zwzhang0107 in #1719
- descheduler: check nodemetrics cr is expired or not when descheduling by @lucming in #1721
- pkg/koordlet: update some log level to silence log by @zxh326 in #1723
- ci: disable codeql in github action workflow by @jasonliu747 in #1731
- scheduler: numa-aware scheduling supports selecting numa node by score by @eahydra in #1726
- scheduler: revise numa-aware hints generator by @eahydra in #1732
- apis: add host application metric to node metric crd by @zwzhang0107 in #1725
- koordlet: report host application resource usage for calculating batc… by @zwzhang0107 in #1733
- scheduler: update reservationInfo ownerMatchers if reservation updated by @eahydra in #1740
- scheduler: leaky resources in ReservationInfo when pod failed to bind by @eahydra in #1742
- apis: add core sched apis by @saintube in #1720
- scheduler: fix missing update quota when pod is in DeletedFinalStateUnknown state by @qinfustu in #1737
- koordlet: fix prediction restore for node priorities by @saintube in #1749
- slo-controller: supppor third party allocator for batch resource calculation by @zwzhang0107 in #1739
- koord-manager: get node batch allocatable from extended resource by @zwzhang0107 in #1752
- proposal: CPU Normalization by @zqzten in #1598
- scheduler: Add non-preemptible Request in quotaInfo's QuotaCalculateInfo by @xulinfei1996 in #1755
- koord-manager: improve batch resource calculation by @saintube in #1751
- api: add koordinator.sh/gpu.shared by @ZiMengSheng in #1763
- feat(deps): bump github.com/golang/protobuf from 1.5.2 to 1.5.3 by @dependabot in #1623
- koordlet: support pouch container by @zwzhang0107 in #1761
- scheduler: verify quota according to resource dimensions required by pod by @qinfustu in #1753
- proposal: add device allocation apis by @eahydra in #1529
- apis: add device allocate hints by @eahydra in #1766
- chore: fix koordinator typo by @ikaven1024 in #1769
- koordlet: add page cache collector unit test and mend default cold memory config by @BUPT-wxq in #1738
- scheduler: DeviceShare supports allocating by hints by @eahydra in #1768
- chore(deps): bump actions/setup-go from 4 to 5 by @dependabot in #1771
- koordlet: report gpu device topology by @eahydra in #1775
- koordlet: support hugepage reporting by @peiqiaoWang in #1744
- scheduler: fix redundant argument when create a unschedulable status. by @bogo-y in #1782
- scheduler: DeviceShare supports numa-aware scheduling by @eahydra in #1774
- scheduler: fix DeviceShare concurrent map read and map write by @jasonliu747 in #1788
- scheduler: Coscheduling skips check schedule cycle if pod has nominated node by @xulinfei1996 in #1790
- scheduler: DeviceShare skips numa-aware if no device requests by @jasonliu747 in #1791
- koordlet: fix lscpu for arm64 by @zwzhang0107 in #1792
- koordlet: revise system qos and numa aware cpuset by @saintube in #1793
- koordlet: fix lscpu for arm ut by @zwzhang0107 in #1794
- colocationprofile: support mutating pod labels and annotations with mapping by @saintube in #1781
- chores: add args for Makefile by @zwzhang0107 in #1795
- scheduler: Reservation supports preempting permitted pods by @eahydra in #1800
- scheduler: fix the calculation method in PreFilterExtension by @eahydra in #1802
- scheduler: FilterReservation/ScoreReservation perceives preemptible states by @eahydra in #1803
- scheduler: support DeviceHandler to selectively generate requests and replicas by @eahydra in #1806
- scheduler: optimize DeviceHandler interface by @eahydra in #1808
- scheduler: enhance ReservationNominator to support preemption by @eahydra in #1804
- scheduler: fix preemption panic by @eahydra in #1810
- koordlet: support core sched cookie management by @saintube in #1722
- scheduler: change reservation default allocate policy to aligned by @eahydra in #1814
- scheduler: remove useless code from Reservation plugin by @eahydra in #1815
- chores: replace shimo doc to dingtalk by @kangclzjc in #1813
- scheduler: add monitor to detect scheduling deadlock by @eahydra in #1817
- feat(deps): bump go.uber.org/atomic from 1.10.0 to 1.11.0 by @dependabot in #1778
- koordlet: fix be cpuset for numaResource by @saintube in #1818
- chore: only require security updates and exclude version updates by @jasonliu747 in #1823
- chores: update bi-weekly meeting notes link by @ZiMengSheng in #1824
- slo-controller: Introduce node resource amplification plugin by @zqzten in #1796
- scheduler: refine log in device share plugin by @jasonliu747 in #1826
- scheduler: fix skip logic in reservation plugin by @jasonliu747 in #1827
- chore: fix some unlinted errors by @saintube in #1811
- webhook: add node webhook for resource amplification by @hormes in #1785
- koordlet: fix RDT cache id issue by @bowen-intel in #1789
- docs: support NetworkQoS based on external plugin such as terway-qos by @lucming in #1779
- koordlet: fix memory leak in perfGroup by @bowen-intel in #1828
- koordlet: fix typo in function isBECPUUsageHighEnough by @j4ckstraw in #1830
- koordlet: fix by adding lock during podmap update by @xujihui1985 in #1838
New Contributors
- @shaloulcy made their first contribution in #1565
- @baowj-678 made their first contribution in #1454
- @zqzten made their first contribution in #1549
- @tan90github made their first contribution in #1555
- @pheianox made their first contribution in #1642
- @zxh326 made their first contribution in #1723
- @qinfustu made their first contribution in #1737
- @ikaven1024 made their first contribution in #1769
- @peiqiaoWang made their first contribution in #1744
- @bogo-y made their first contribution in #1782
- @xujihui1985 made their first contribution in #1838
Full Changelog: v1.3.0...v1.4.0